Skip Ribbon Commands
Skip to main content
Navigate Up
Sign In

Quick Launch

Average Rating:

facebook Twitter
Email
Print Bookmark Alert me when this article is updated

Feedback

HOW TO: Configure Informatica Big Data 10.2.1 products with Amazon EMR 5.14
Solution
At the time of the Informatica Big Data 10.2.1 release, the Big Data products supported Amazon EMR 5.10. However, Informatica now supports Amazon EMR 5.14. To use Big Data 10.2.1 products with EMR 5.14, download and apply the following EBF-12444 from the TSFTP site at /updates/Informatica10/10.2.1/. Supports the RedHat (RHEL) distribution of Linux64-X86. This EBF supports the integration of Amazon EMR 5.14 with the following Informatica products:
  • Big Data Management
  • Big Data Streaming
  • Big Data Quality

Download Hive .jar Files

Get .jar files from the Hadoop administrator. The following files are on the master node in the Hadoop cluster:
  • For integration with EMR 5.10, copy emrfs-hadoop-assembly-2.20.0.jar.
  • For integration with EMR 5.14, copy emrfs-hadoop-assembly-2.23.0.jar.
Copy the .jar files to the following directory on each Data Integration Service machine: /<Informatica installation directory>/services/shared/hadoop/EMR_<version number>/lib.

For integration with EMR 5.14, also copy emrfs-hadoop-assembly-2.23.0.jar to the following path: /<Informatica installation directory>/services/shared/hadoop/EMR_<version number>/extras/hive-auxjars.

Note: If you have upgraded from EMR 5.10 to EMR 5.14, the part of the filepath that includes EMR_<version number> remains EMR_5.10. 


Known Limitation


 BDM-20784
When a mapping that runs on the Spark engine includes active transformations​, monitoring statistics displayed in the Monitoring tab of the Administrator tool are not accurate.​

More Information
​For more information about supported versions, see the Product Availability Matrix on the Informatica Customer Portal: https://network.informatica.com/community/informatica-network/product-availability-matrices.

For more information about steps to upgrade Big Data Management and integrate it with Hadoop distributions, see the Big Data Management Hadoop Integration Guide on the Informatica Customer Portal: https://kb.informatica.com/_layouts/ProductDocumentation/Page/ProductDocumentSearch.aspx.

To search for an EBF from the TSFTP site:
  1. Log in to https://network.informatica.com.
  2. Click HotFix Downloads.
  3. Click the EBF Download tab.
  4. Use the search box to search for an EBF. For example, enter EBF-12444.
Download the available EBF archive file, which will have a filename like EBF-<number>.<os>.tar.gz.
For instance, EBF-12444.Linux64-X86.tar.gz.

Reference
Applies To
Product: Data Engineering Integration(Big Data Management); Data Engineering Quality(Big Data Quality); Data Engineering Streaming(Big Data Streaming)
Problem Type:
User Type:
Project Phase:
Product Version: Big Data Management 10.2.1; Big Data Quality 10.2.1; Big Data Streaming 10.2.1
Database:
Operating System:
Other Software:
Attachments
Last Modified Date:11/14/2018 10:00 PMID:560632
People who viewed this also viewed

Feedback

Did this KB document help you?



What can we do to improve this information (2000 or fewer characters)