Cummins Inc. Data Engineer - Quality Analytics in Columbus, Indiana
Data Engineer - Quality Analytics
Cummins is a place big enough to coach and develop a global workforce and create the world’s leading clean, engine technology. We’re also small enough for you to find your fit and personal passion with a team of dependable, innovative thinkers who are developing their careers within a diverse, inclusive, empowering environment.
Cummins delivers reliable, durable, high performing products to our global partners. Working in an innovative space, you’ll develop high tech solutions that will fuel your advanced career skill set and empower you to own your career. Our integrated businesses demand the talents and creativity of individuals with a wide range of skills and experience.
This is an exciting opportunity in Columbus, Indiana for a Data Engineer - Quality Analytics.
Your impact will happen in these and other ways:
Lead projects for design, development and maintenance of a data and analytics platform.
Effectively and efficiently process, store and make data available to analysts and other consumers.
Work with key business stakeholders, IT experts and subject-matter experts to plan, design and deliver optimal analytics and data science solutions.
Work on one or many product teams at a time.
Design and automate deployment of our distributed system for ingesting and transforming data from various types of sources (relational, event-based, unstructured).
Design and implement framework to continuously monitor and troubleshoot data quality and data integrity issues.
Implement data governance processes and methods for managing metadata, access, retention to data for internal and external users.
Design and provide guidance on building reliable, efficient, scalable and quality data pipelines with monitoring and alert mechanisms that combine a variety of sources using ETL/ELT tools or scripting languages.
Design and implement physical data models to define the database structure.
Optimizing database performance through efficient indexing and table relationships.
Participate in optimizing, testing, and troubleshooting of data pipelines.
Design, develop and operate large scale data storage and processing solutions using different distributed and cloud based platforms for storing data (e.g. Data Lakes, Hadoop, Hbase, Cassandra, MongoDB, Accumulo, DynamoDB, others).
Use innovative and modern tools, techniques and architectures to partially or completely automate the most-common, repeatable and tedious data preparation and integration tasks in order to minimize manual and error-prone processes and improve productivity.
Assist with renovating the data management infrastructure to drive automation in data integration and management.
Ensure the timeliness and success of critical analytics initiatives by using agile development technologies such as DevOps, Scrum, Kanban
Coach and develop less experienced team members.
Data Extraction - Performs data extract-transform-load (ETL) activities from variety of sources and transforms them for consumption by various downstream applications and users using appropriate tools and technologies.
Solution Documentation - Documents information and solution based on knowledge gained as part of product development activities; communicates to stakeholders with the goal of enabling improved productivity and effective knowledge transfer to others who were not originally part of the initial learning.
Quality Assurance Metrics - Applies the science of measurement to assess whether a solution meets its intended outcomes using the Function's defined best practices, including the Systems Development Life Cycle standards, tools, metrics and key performance indicators, to deliver a quality product.
Solution Validation Testing - Validates a configuration item change or solution using the Function's defined best practices, including the Systems Development Life Cycle (SDLC) standards, tools and metrics, to ensure that it works as designed and meets customer requirements.
System Requirements Engineering - Uses appropriate methods and tools to translate stakeholder needs into verifiable requirements to which designs are developed; establishes acceptance criteria for the system of interest through analysis, allocation and negotiation; tracks the status of requirements throughout the system lifecycle; assesses the impact of changes to system requirements on project scope, schedule, and resources; creates and maintains information linkages to related artifacts.
Problem Solving - Solves problems using a systematic analysis process by leveraging industry standard methodologies to create problem traceability and protect the customer; determines the assignable cause; implements robust, data-based solutions; identifies the systemic root causes and recommended actions to prevent problem reoccurrence.
Data Quality - Identifies, understands and corrects flaws in data that supports effective information governance across operational business processes and decision making.
Programming - Creates, writes and tests computer code, test scripts, and build scripts using algorithmic analysis and design, industry standards and tools, version control, and build and test automation to meet business, technical, security, governance and compliance requirements.
Customer Focus - Building strong customer relationships and delivering customer-centric solutions.
Decision Quality - Making good and timely decisions that keep the organization moving forward.
Collaborates - Building partnerships and working collaboratively with others to meet shared objectives.
Communicates Effectively - Developing and delivering multi-mode communications that convey a clear understanding of the unique needs of different audiences.
Education, Licenses, Certifications
College, university or equivalent degree preferred or equivalent work experience in relevant technical discipline.
Intermediate experience in a relevant discipline area is required. Knowledge of the latest technologies and trends in data engineering are highly preferred and includes:
Familiarity analyzing complex business systems, industry requirements, and/or data regulations
Background in processing and managing large data sets
Design and development for a Big Data platform using open source and third-party tools
SPARK, Scala/Java, Map-Reduce, Hive, Hbase, and Kafka or equivalent college coursework
SQL query language
Clustered compute cloud-based implementation experience
Experience developing applications requiring large file movement for a Cloud-based environment and other data extraction tools and methods from a variety of sources
Experience in building analytical solutions
Intermediate experiences in the following are preferred:
Experience with IoT technology
Experience in Agile software development
-Experience in data preparation for Data Science purposes.
-Working in cross-functional teams with Data Scientists, Product Owners and Quality SMEs
At Cummins, we are an equal opportunity and affirmative action employer dedicated to diversity in the workplace. Our policy is to provide equal employment opportunities to all qualified persons without regard to race, gender, color, disability, national origin, age, religion, union affiliation, sexual orientation, veteran status, citizenship, gender identity and/or expression, or other status protected by law. Cummins validates right to work using E-Verify.
Cummins will provide the Social Security Administration (SSA) and, if necessary, the Department of Homeland Security (DHS), with information from each new employee’s Form I-9 to confirm work authorization. To learn more about E-Verify, including your rights and responsibilities, please visit www.dhs.gov/E-Verify .
Ready to think beyond your desk? Apply for this opportunity and start your career with Cummins today.
Not ready to apply but want to learn more? Join our Talent Community to get the inside track on great jobs and confidentially connect to our recruiting team:
Job SYSTEMS/INFORMATION TECHNOLOGY
Primary Location United States-Indiana-Columbus-US, IN, Columbus, 301 Irwin Building
Job Type Experienced - Exempt / Office
Recruitment Job Type Exempt - Experienced
Job Posting Jul 10, 2020, 1:05:57 AM
Unposting Date Ongoing
Req ID: 2000011S