USNLX Diversity Jobs

USNLX Diversity Careers

Job Information

The Vanguard Group Data Engineer, Specialist in Malvern, Pennsylvania

Data Engineer, Specialist (The Vanguard Group / Malvern, PA) -- Apply technological expertise to address data challenges that business and clients face, using insights to turn challenges into opportunities; design and implement data pipelines that leverage key AWS big data services, including Lambda, Glue, Lake Formation, API Gateway, Redshift, Athena, Elasticsearch Service, and Spark EMR; support the Business in understanding and interpretation of marketing data and help to derive approaches for how data can be used to support effective marketing operations; identify, design, and implement internal process improvements: automating manual processes, optimizing data pipeline performance, re-designing infrastructure for greater scalability and access to information; build/develop ETL (Extract / Transform / Load) processes, design database systems, and develop tools for real-time and offline analytic processing using Python, PySpark, SQL Frameworks; build and maintain scalable data solutions in cloud environment and drive business improvements with innovative Big Data and AWS technologies including Sage maker, S3, EMR, IAM, EC2, Cloud Watch, Cloud Trail, Event Bridge; lead all phases of solution development; explain technical considerations at related meetings, including those with internal clients and less experienced team members; translate business specifications into design specifications and code; responsible for writing complex programs, ad hoc queries, and reports; ensure all code is well structured, includes sufficient documentation, and is easy to maintain and reuse. Requires Master's in Computer Science, Computer Information Systems, Computer Engineering, or closely related IT field and two years of experience in job offered or in IT positions including Software Engineer and/or AWS DevOps Cloud Engineer. Background in education, training or experience must include HDFS, Hadoop Map Reduce, Hive, Presto, Sqoop, Spark, Yarn; AWS EMR, EC2, IAM, S3, Service Catalog, GLUE, Redshift, Data Sync, CloudFormation, Lambda, Athena; Python, Scala Programming; CI/CD, BitBucket, Bamboo; Database Systems, Data Warehousing and ETL tools; Distributed Computing and Massive Parallel Processing. Company operates on hybrid model with three days in office and work-from-home available two days.

DirectEmployers