IBM
IBM Data Engineering Professional Certificate
IBM

IBM Data Engineering Professional Certificate

Prepare for a career as a Data Engineer. Gain the in-demand skills and hands-on experience to get job-ready in less than 5 months. No prior experience required.

Taught in English

Some content may not be translated

IBM Skills Network Team
Muhammad Yahya
Abhishek Gagneja

Instructors: IBM Skills Network Team

17,968 already enrolled

Professional Certificate - 13 course series

Earn a career credential that demonstrates your expertise

4.6

(3,970 reviews)

Beginner level

Recommended experience

5 months at 10 hours a week
Flexible schedule
Learn at your own pace
Earn degree credit

What you'll learn

  • Master the most up-to-date practical skills and knowledge data engineers use in their daily roles

  • Learn to create, design, & manage relational databases & apply database administration (DBA) concepts to RDBMSs such as MySQL, PostgreSQL, & IBM Db2

  • Develop working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, and Spark Streaming

  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards

Details to know

Shareable certificate

Add to your LinkedIn profile

Professional Certificate - 13 course series

Earn a career credential that demonstrates your expertise

4.6

(3,970 reviews)

Beginner level

Recommended experience

5 months at 10 hours a week
Flexible schedule
Learn at your own pace
Earn degree credit

See how employees at top companies are mastering in-demand skills

Placeholder

Advance your career with in-demand skills

  • Receive professional-level training from IBM
  • Demonstrate your technical proficiency
  • Earn an employer-recognized certificate from IBM
Placeholder
$132,000+
median U.S. salary for Data Engineering
¹
59,000+
U.S. job openings in Data Engineering
¹

Get exclusive access to career resources upon completion

  • Soft skills training

    Get free access to IBM’s People and Soft Skills Specialization

  • Resume review

    Improve your resume and LinkedIn with personalized feedback

  • Interview prep

    Practice your skills with interactive tools and mock interviews

  • Career support

    Plan your career move with Coursera’s job search guide

¹Lightcast™ Job Postings Report, United States, 7/1/22-6/30/23. ²Based on program graduate survey responses, United States 2021.

Placeholder

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Placeholder

Professional Certificate - 13 course series

Introduction to Data Engineering

Course 112 hours4.7 (2,298 ratings)

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Category: Shell Script
Category: Bash (Unix Shell)
Category: Extract Transform and Load (ETL)
Category: Linux
Category: Linux Commands

Python for Data Science, AI & Development

Course 226 hours4.6 (34,899 ratings)

What you'll learn

  • Learn Python - the most popular programming language and for Data Science and Software Development.

  • Apply Python programming logic Variables, Data Structures, Branching, Loops, Functions, Objects & Classes.

  • Demonstrate proficiency in using Python libraries such as Pandas & Numpy, and developing code using Jupyter Notebooks.

  • Access and web scrape data using APIs and Python libraries like Beautiful Soup.

Skills you'll gain

Category: Cloud Database
Category: Mongodb
Category: Cassandra
Category: NoSQL
Category: Cloudant

Python Project for Data Engineering

Course 39 hours4.6 (555 ratings)

What you'll learn

  • Demonstrate your skills in Python for working with and manipulating data

  • Implement webscraping and use APIs to extract data with Python

  • Play the role of a Data Engineer working on a real project to extract, transform, and load data

  • Use Jupyter notebooks and IDEs to complete your project

Skills you'll gain

Category: Data Science
Category: Data Analysis
Category: Python Programming
Category: Numpy
Category: Pandas

Introduction to Relational Databases (RDBMS)

Course 421 hours4.6 (494 ratings)

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Skills you'll gain

Category: Machine Learning
Category: Machine Learning Pipelines
Category: Data Engineer
Category: SparkML
Category: Apache Spark

Databases and SQL for Data Science with Python

Course 520 hours4.6 (19,061 ratings)

What you'll learn

  • Analyze data within a database using SQL and Python.

  • Create a relational database and work with multiple tables using DDL commands.

  • Construct basic to intermediate level SQL queries using DML commands.

  • Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.

Skills you'll gain

Category: Python Programming
Category: Relational Databases
Category: SQL
Category: NoSQL
Category: Data Pipelines

Hands-on Introduction to Linux Commands and Shell Scripting

Course 614 hours4.7 (1,051 ratings)

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Skills you'll gain

Category: Python Programming
Category: Information Engineering
Category: Extract Transform and Load (ETL)
Category: Data Engineer
Category: Web Scraping

Relational Database Administration (DBA)

Course 720 hours4.5 (158 ratings)

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Category: Extract Transform and Load (ETL)
Category: Data Engineer
Category: Apache Kafka
Category: Apache Airflow
Category: Data Pipelines

ETL and Data Pipelines with Shell, Airflow and Kafka

Course 816 hours4.5 (278 ratings)

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement an ETL pipeline through shell scripting.

  • Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Category: Big Data
Category: SparkSQL
Category: SparkML
Category: Apache Hadoop
Category: Apache Spark

Getting Started with Data Warehousing and BI Analytics

Course 916 hours4.4 (143 ratings)

What you'll learn

  • Explore the architecture, features, and benefits of data warehouses, data marts, and data lakes and identify popular data warehouse system vendors.

  • Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • Design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Category: Python Programming
Category: Cloud Databases
Category: Relational Database Management System (RDBMS)
Category: SQL
Category: Jupyter notebooks

Introduction to NoSQL Databases

Course 1017 hours4.6 (239 ratings)

What you'll learn

  • Differentiate among the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operations in Cassandra.

Skills you'll gain

Category: Database Security
Category: Database (DBMS)
Category: Database Servers
Category: database administration
Category: Relational Database

Introduction to Big Data with Spark and Hadoop

Course 1118 hours4.4 (316 ratings)

What you'll learn

  • Explain the impact of big data, including use cases, tools, and processing methods.

  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.

  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Category: Data Science
Category: Database (DBMS)
Category: Information Engineering
Category: SQL
Category: NoSQL

Machine Learning with Apache Spark

Course 1214 hours4.6 (42 ratings)

What you'll learn

  • Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.

  • Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML pipelines.

  • Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.

  • Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.

Skills you'll gain

Category: Database (DB) Design
Category: Postgresql
Category: Relational Database Management System (RDBMS)
Category: Database Architecture
Category: MySQL

Data Engineering Capstone Project

Course 1313 hours4.7 (82 ratings)

What you'll learn

  • Demonstrate proficiency in skills required for an entry-level data engineering role.

  • Design and implement various concepts and components in the data engineering lifecycle such as data repositories.

  • Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.

  • Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Category: Data Warehousing
Category: Business Intelligence (BI)
Category: cognos analytics
Category: Cube and Rollup
Category: Star and Snowflake Schema

Instructors

IBM Skills Network Team
IBM
53 Courses711,393 learners
Muhammad Yahya
IBM
4 Courses43,193 learners
Abhishek Gagneja
IBM
4 Courses90,961 learners

Offered by

IBM

Why people choose Coursera for their career

New to Data Management? Start here.

Placeholder

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions