DataTorrent is an enterprise-grade software platform that enables businesses to perform any sort of data processing or transformations on structured or unstructured data, all in real-time as the data is streaming into the data center. And it does this at massive scale on any type of Big Data job.
Leveraging Hadoop 2.0, DataTorrent is a YARN-native application platform. It can be installed directly onto an existing Hadoop cluster, connect directly to all in-coming data sources live, and perform any type of processing or transformation of your data in-memory, as it comes streaming in. DataTorrent will handle all of the scaling and fault tolerance of the system, leaving enterprises to focus on just their business logic, which can all be written in Java.