The MapR Hadoop open source batch analytics software package will become available with Informatica’s data integration platform through four different vendors
MapR Technologies and Apache Hadoop, which already partner for their own distribution of the popular open-source batch analytics software package, on 5 March announced that MapR Hadoop will become available with Informatica’s data integration platform.
Sharing resources is the operative term here. This new analytics software is now accessible from four different vendors.
High-performance data integration
Informatica HParser Community Edition – already available for free download from the Informatica site – is now freely available as part of the MapR distribution from the MapR website. Moreover, the MapR distribution is also available through the EMC Greenplum MR Edition and as part of the Cisco Unified Computing System.
By using MapR and Informatica in tandem, enterprises can perform high-performance data integration, replication and messaging and get reports quickly. Together the two products can move numerous terabytes per hour of transaction, interaction and streaming data into MapR, in addition to parsing and processing a range of structured and unstructured data natively in Hadoop – all without scripting.
Key features of the combination of MapR and Informatica include:
Bi-directional data integration with Informatica PowerCentre and Informatica PowerExchange.
Snapshot replication using Informatica FastClone.
Data streaming using Informatica Ultra Messaging.
Parallel parsing and transformation on MapR using Informatica HParser.
“As the big data ecosystem continues to grow more holistic, data management practices are needed to address the challenges of information volume, variety, velocity and complexity,” said Gartner research vice president Merv Adrian.
“Coupling advanced data management practices is the next step organisations need to take as they build a platform that expands the breadth and depth of applications.”