Tools and applications that are YARN Ready have been verified to work within YARN, which means they are able to use the resources of the customer’s Hadoop system to process Hadoop data in-place, without interfering with other YARN Ready tools and applications.
Apache Hadoop YARN is the data operating system for Hadoop 2, responsible for managing access to Hadoop’s critical resources. YARN enables a user to interact with all data in multiple ways simultaneously, making Hadoop a true multi-use data platform and allowing it to take its place in a modern data architecture. Customers building a data lake expect to operate on the data without moving it to other systems, leveraging the processing resources of the data lake. Applications that use YARN fulfill that promise, lowering operational costs while improving quality and time-to-insight.
|Full control||YARN Native||Custom or packaged app where fine grained control of cluster resources is required|
|Batch (Legacy)||MapReduce||Existing MapReduce application written for the 1.x code line|
|Batch or Interactive Query||Tez||Business Intelligence or analytic applications that optimize throughput while reducing latency|
|Online or Realtime Service||Slider||“Always-on” services, e.g. online or streaming applications|