ETL/MDM Developer

Posted: 2 months ago

Title : ETL/MDM Developer
Location : NYC, NY
Duration : 5 months
Interview Type : Phone + Onsite interview


Roles/Requirements ( "ETL/MDM Developer ")

Client has a truly unique opportunity for ETL Developer for MDM project. As part of this team, you will work on the collecting, storing, processing, and analyzing huge sets of data. The primary focus will be to develop the construction and maintenance of our data pipeline, ETL processes and data warehouse. ETL Developer will also be responsible for data quality and understanding the data needs our various source data in order to anticipate and scale our systems.

Candidate must be willing to learn, work well independently, be open to feedback, and enthusiastic, with demonstrated technical aptitude, skills and abilities

Current technology includes (but is not limited to):
ETL – Datastage, Unix Scripting, SQL, PL/SQL, Oracle
Change Data Capture (CDC)
Data Ingestion to preparation to exploration and consumption using Cloud & Big Data Platform
Dimensional and Relation table structures
AWS cloud (S3, EC2, EMR, Redshift, etc.)
Knowledge of Databricks, Snowflake, Attunity, Airflow
Must have good understanding of Master Data Management (MDM) architectures, business processes and various MDM domains like Customer, Product and Organization
Must have hands-on experience with MDM solution development - Data modelling, Data Integration, Data validation, Match and Merge rules and Data stewardship
Knowledge of Web Services, XML, SOAP, REST, integration middleware such as Mulesoft


Roles & responsibilities may include:
Integrate data from a variety of data sources (Data warehouse, Data marts) utilizing on-premises or cloud-based data structures;
Develop and implement streaming, data lake, and analytics big data solutions
Create Applications using Change Data Capture Tools
Application Performance and System Testing Analyst
Responsible for configuration identification & control, and build management of all components of MDM development life cycle. Manage Data & Code configuration across MDM Prod/Non-Prod Ecosystem
Implement Master data management standards, enforce data governance procedures and ensure data integrity across multiple platforms
Design, develop, deploy and support end to end ETL specifications based on business requirements and processes such as source-to-target data mappings, integration workflows, and load processes using IBM Datastage
Developing ETL jobs using various stages such as Sequential, Dataset, Transformer, Copy, Lookup, filter, Join, Merge, Funnel, Sort, Remove Duplicates, Modify and Aggregator etc.
Involve in day to day support and providing solutions/troubleshooting Production Outages/Issues using Datastage tool for business requirements, enhancements and handling service requests


"Ideal” candidates will have the following experience, knowledge, skills or abilities:
Minimum of 5-7 years of IT work experience focused in Data Acquisition and Data Integration using DataStage
Minimum 4-6 years of experience with ORACLE SQL and PL/SQL Package
Experience working with flat files and XML transformations
Prior experience in data warehouse design and modeling. Data Vault 2.0 / dimensional modeling preferred
Analyzing the statistics of the DataStage jobs in director and conducting performance tuning to reduce the time required to load the tables and the loads on computing nodes involved.
Technical experience in IBM MDM InfoSphere and WebSphere platforms
Programming capability in a high-level programming language such as Python, Java or other language related to automating deployment, configuration and system monitoring
Hands-on experience designing and developing MDM solutions leveraging DataStage or Talend or Reltio or Orchestra Networks
Knowledge/Experience on Data Warehousing applications, directly responsible for the Extraction, Staging, Transformation, Pre Loading and Loading of data from multiple sources into Data Warehouse
Application development, including Cloud development experience, preferably using AWS (AWS Services, especially S3, API Gateway, Redshift, Lambda, etc.)
Python, Spark and AWS experience is a big plus.

Must have Skill set :
* ETL – Datastage, Unix Scripting, SQL, PL/SQL, Oracle
* Must have good understanding of Master Data Management (MDM) architectures, business processes and various MDM domains like Customer, Product and Organization
* Must have hands-on experience with MDM solution development - Data modelling, Data Integration, Data validation, Match and Merge rules and Data stewardship

Enterprise Databases
ETL datastage - 6 yrs
Oracle RDBMS - 6 yrs

Programming tools
Oracle PL/SQL - 5 years