Informatica 'Data Engineering Integration' (DEI),
earlier known as 'Big Data
Management' (BDM), enables the organization to process large, diverse, and fast-changing data sets so as to get insights into their data. Big Data Management is used to perform big data integration and transformation without writing or maintaining Apache Hadoop code.
In case of versions prior to Informatica 10.2.0, it would be required to run 'BDMConfig.sh' script for downloading the Hadoop cluster configuration site-xml files (core-site.xml, hdfs-site.xml, yarn-site.xml etc..) into Informatica server machine and to create the Hadoop cluster connections. Utility can be found under '$INFA_HOME/tools/BDMUtil' and configuration site-xml files would be downloaded into '$INFA_HOME/services/shared/hadoop/<distribution>/conf' location.
Starting from Informatica 10.2.0 version, 'Cluster Configuration Object' (CCO) got introduced and it could be created from 'Manage > Connections' tab in Informatica Administrator console. CCO would encapsulate all the Hadoop configuration site-xml files in web User Interface (UI), which were earlier downloaded using 'BDMConfig.sh' script. For more information on CCO configuration, refer to the following link of Informatica Administrator guide:
In CCO, once created, value for any of the configurations in the site-xml files can be overridden, by editing it and providing the required value. Overridden configuration value in CCO would be used by pushdown jobs submitted from Informatica.
When there are any service configuration changes in the Hadoop cluster, it would be required to refresh the CCO, so as to make Informatica Hadoop pushdown jobs use the latest configurations. Even after the 'Refresh' operation on CCO, configurations that are overridden by Informatica would still persist in CCO. Overridden configurations can be seen from 'Overridden Properties' section in the CCO.
For more information on 'CCO Refresh', refer 'Refresh the Cluster Configuration' section in the Informatica DEI Administrator guide, accessible from the following location:
Big Data to 'Data Engineering' Product Portfolio Renaming
HOW TO: Create CCO using 'Import
from Archive File' option in Informatica DEI (KB 523286)
HOW TO: Refresh CCO using 'Import from Archive File' option in Informatica DEI? (KB 527855)
HOW TO: Verify if
the CCO changes have been picked up by the scheduled mapping/profiling job in
Informatica? (KB 615479)
HOW TO: Clear overridden Hadoop
service configuration present in CCO of Informatica DEI? (KB 527858)
HOW TO: Check cluster configured
value for the overridden property in CCO of Informatica DEI? (KB 532978)
FAQ: Is it possible to enable 'Auto Refresh' for CCO
in Informatica DEI? (KB 563545)
HOW TO: Refresh a
Cluster Configuration Object using Cluster URL from infacmd in Informatica DEI
HOW TO: Enable
debug logging for Cluster Configuration Object Creation/Refresh in Informatica
DEI (KB 522180)
What can we do to improve this information (2000 or fewer characters)