At Air Products, our purpose is to bring people together to reimagine what’s possible, collaborate and innovate solutions to the world’s most significant energy and environmental sustainability challenges. Grow with us as we embark on building tomorrow together by being the safest, most diverse and most profitable industrial gas company in the world.
Reimagine What’s Possible
Join Our Global Team as a Data Engineer!
Are you passionate about data and analytics? We're looking for a talented Data Engineer to join our team and help operationalize data pipelines that drive our company's analytics and AI initiatives.
Our data lake empowers data scientists, business analysts, and IT professionals to undertake advanced analytics projects and business reporting. As a Data Engineer, you'll play a crucial role in transforming data models and algorithms into actionable insights, driving the success of our enterprise initiatives.
This role is responsible for creating, managing, and documenting data flows from various sources leveraging batch, near real-time, and streaming data ingestion patterns to ensure delivery of high-quality data into our enterprise data lake. You will work closely with data scientists, analysts, and other data consumers to productionize data models and algorithms, enhancing the efficiency of advanced analytics projects. Throughout each stage of the process, you will implement the appropriate data quality, governance and security steps, ensuring data is ready to use by the enterprise.
If you're ready to make a global impact and take your career to the next level, apply now and become a key player in our data-driven journey!
Principal Accountabilities
- Data Pipeline Development: Design, construct, test, and maintain highly scalable data management systems.
- Data Integration: Integrate structured and unstructured data from multiple data sources into a unified data system, ensuring data quality and consistency leveraging tools such as Qlik Replicate, Spark, Glue and Python.
- Data Warehousing: Build and maintain data warehouses and data lakes to store and retrieve vast amounts of data efficiently.
- Data Processing: Implement data processing frameworks (e.g., Spark) to process large datasets in real-time or batch processing.
- Automation and Monitoring: Automate manual processes, optimize data delivery, and develop data monitoring systems to ensure data integrity and accuracy.
- Collaboration: Work closely with data scientists, analysts, and other stakeholders to understand data needs and provide technical solutions that meet business requirements.
- Data Governance: Ensure data governance policies are followed, including data security, data privacy, and compliance with regulations.
- Performance Tuning: Optimize the performance of ETL processes, databases, and data pipelines to handle large volumes of data and reduce processing times.
- Project Management: Drive projects from the design phase through delivery and handover.
Qualifications
Education:
- 4-year College Degree required; Bachelor’s Degree in Information Technology or related technical discipline preferred.
Experience:
- Python, PySpark, building scalable real-time streaming ETL applications and data warehouses.
- Advanced proficiency in PySpark and Python ETL modules.
- Experience with large data sets in a time-sensitive environment.
Technical Skills:
- Proficient with AWS tools (S3, Glue, Lake Formation, Athena, Redshift).
- Experience with infrastructure as code using Terraform.
- Advanced understanding of SQL and NoSQL technologies (e.g., MongoDB/DocumentDB).
- Hands-on experience with Qlik (Attunity) Replicate.
- Experience with Databricks
Additional Skills:
- Solid understanding of data warehouse design patterns and best practices.
- Ability to develop test plans and stress test platforms.
- Experience with complex job scheduling.
- Strong process development, adherence, and improvement skills.
- Effective analytical, conceptual, and problem-solving skills.
- Organized, disciplined, and task/goal-oriented.
- Ability to prioritize and coordinate work based on high-level goals and strategy.
- Effective team player with a positive attitude.
- Strong oral and written English communication skills.
- Coordinating work across multiple teams/resources is a plus.
We are the world’s largest hydrogen producer with over 80 years of industrial gas experience. We are hydrogen and industrial gas experts delivering safe, end-to-end solutions, investing in real, clean energy projects at scale, and driving the industry forward to generate a cleaner future.
At Air Products, we work in an environment where we put safety first, diversity is essential, inclusion is our culture, and each person knows they belong and matter. To learn more, visit About Air Products.