Powerexchange for cloud and big data applications

PowerExchange for MongoDB

Posted by: Informatica Platform

Informatica PowerExchange for MongoDB enables you to extract from and load data to MongoDB. Based on the ODBC standard, the adapter supports connectivity to both on-premise and cloud based Mongo deployments.

Overview

Many Application Developers today use MongoDB for use cases like Big Data Management, Content Management and Delivery, Operational Intelligence, Product Data Management and many others. However, without robust and reliable tools to access data from MongoDB, it can become a data silo. PowerExchange for MongoDB is an enterprise class connector to MongoDB which extracts from and load data to MongoDB, providing you the same reliability that you expect from Informatica products. MongoDB is the leading NOSQL database today. It is designed for ease of development and scaling. By providing a flexible schema, it makes the application development truly agile. However, this can create challenges for the IT department who are used to working with traditional databases and data-warehouses. PowerExchange for MongoDB is based on the same proven Informatica data integration software used by thousands of customers worldwide. It provides a single, consistent data integration approach for all types of data, whether it is structured or unstructured. This empowers your IT organization to make the adoption of MongoDB across the enterprise quicker and painless. PowerExchange for MongoDB also helps in reducing data integration costs. Up to 80% of the work in a Big Data project involves data quality and data integration tasks. By moving away from custom coding data integration logic to have proven data integration, productivity tools, you no longer need an army of developers for loading and extracting data from MongoDB. By taking care of the data integration part, PowerExchange for MongoDB and the Informatica Platform reduces your data integration costs and enables your developers to do what they do best: innovate and create better applications. PowerExchange for MongoDB lets PowerCenter directly read from and write to MongoDB without depending on intermediate message queues or staging tables. Data is delivered with the kind of reliability that can come only with an integrated platform. There's no code to develop, so integration interfaces can be developed faster and more reliably, whether they require access to message-oriented middleware, mainframe, midrange or relational databases, service-oriented architecture and Web services or packaged applications. One family of products, using one set of skills, accesses them all.The advantages of PowerExchange don't stop there. Each member of the PowerExchange family automatically captures relevant metadata from its sources and targets and shares it with the rest of the Informatica platform. Proposed changes can be assessed rapidly and implemented with confidence and the same metadata can help resolve questions of data lineage that often arise in audit and compliance exercises one more advantage of the Informatica data integration platform.

Features

High speed extract and load of data

PowerExchange for MongoDB uses the ODBC interface to extract from and load data to MongoDB. Based on the ODBC 3.52 standard, the adapter supports connectivity to both on-premise and cloud based Mongo deployments. By performing high speed data ingestion and extracts from MongoDB, the connector ensures that you get great performance even when working with large volumes of data.

Metadata Discovery for Flexible Data Models

Data in MongoDB has a flexible schema. Collections in MongoDB do not enforce a document structure and while this is great for agile apppcation development, this can lead to challenges when data needs to be extracted for analysis. PowerExchange for MongoDB samples records in a collection to come up with a representative schema of a collection. Users can edit the discovered schema to add or remove columns that are required for the analysis. Automated Schema Discovery ensures that no data is left out while doing data extracts.

Embedded Entity Access

MongoDB supports storing nested entities in a single document through Embedded Documents and Arrays. These objects help increase apppcation performance by storing contained and related entities together. PowerExchange for MongoDB supports accessing data from Embedded Documents and Arrays by creating pivoted columns, so that users can easily access data from these objects. Support for Arrays and Nested Documents allows you to meet your data integration needs independent of the apppcation data modepng in MongoDB.

JSON Document Handpng

MongoDB stores data in BSON seriapzation format, which is the binary form of JSON. PowerExchange for MongoDB provides an option to directly interact with JSON data when extracting from or loading data to MongoDB. This helps you in directly importing JSON data into MongoDB or transferring data from one Mongo collection to another. Combined with the powerful B2B Data Transformation Engine from Informatica, this feature can help you also create complex JSON structures that can be directly ingested into MongoDB as documents or extract selected JSON elements for analysis.

Distributed Data Access through Read Preferences

PowerExchange for MongoDB provides the powerful "Read Preferences" feature to allow you to choose which members of the MongoDB Reppca Set to use for Data Integration. This ensures that you have a choice to distribute data integration jobs to secondary nodes in a reppca set, thereby not affecting the performance of primary node which would be utipzed for running the main apppcation load.

Informatica products are supported on all major server operating systems, client operating systems and web browsers. The support provided varies for products, operating system versions and Informatica product versions. We recommend that you verify the availability of the product for the configuration that you need before making the purchase. All existing Informatica customers can access details about the availability from theProduct Availability Matrix (PAM) on the Support Portal. All non Informatica customers can seek additional information by contacting us at marketplacesupport@informatica.com

Support

Informatica Global Customer Support (GCS) does not require Informatica to receive Customer Data. “Customer” means the entity receiving Support Services and “Customer Data” means personal data as defined under the EU General Data Protection Regulation (GDPR) or Confidential Customer Information, PII (Personally Identifiable Information), PHI (Protected Health Information), and Personal Financial Information (PFI). In the event Customer Data is mistakenly sent, shared, or uploaded to GCS, Customer must notify Informatica promptly to ensure deletion from GCS systems. See the “Data Confidentiality” section of the Support Guide for more details.