This job listing has expired and the position may no longer be open for hire.

Analytic Data Engineer, Principal at Blue Shield of CA in rancho cordova, California

Posted in Other 26 days ago.

Type: Full Time





Job Description:

At Blue Shield of California we are parents, leader, students, visionaries, heroes, and providers. Everyday we come together striving to fulfill our mission, to ensure all Californians have access to high-quality health care at a sustainably affordable price. For more than 80 years, Blue Shield of California has been dedicated to transforming health care by making it more accessible, cost-effective, and customer-centric. We are a not-for-profit, independent member of the Blue Cross Blue Shield Association with 6,800 employees, more than $20 billion in annual revenue and 4.3 million members. The company has contributed more than $500 million to Blue Shield of California Foundation since 2002 to have a positive impact on California communities. Blue Shield of California is headquartered in Oakland, California with 18 additional locations including Sacramento, Los Angeles, and San Diego. Were excited to share Blue Shield of California has received awards and recognition for LGBT diversity, quality improvement, most influential women in corporate America, Bay Areas top companies in volunteering & giving, and one of the worlds most ethical companies. Here at Blue Shield of California, were striving to make a positive change across our industry and the communities we live in Join us!

The Advanced Analytics team at Blue Shield of California is seeking a Data Engineer to join us in transforming healthcare through innovation. We partner with a wide array of stakeholders to build impactful solutions for our members and providers. We are awed by the complex problems before us, and undaunted by the challenge. If there is data and a problem statement, we are building a solution.

The Data Engineer builds, manages and optimizes production data pipelines supporting key data and analytics initiatives. You will be the cornerstone of a robust data stack, integrating diverse sources and serving up data to power machine learning solutions and self-service analytics. You will mine the operational details of adjudicating claims and build a real-time pipeline to feed an AI-enabled recommendation engine. You will follow our members journey through the healthcare landscape and blend data points across a broad spectrum of applications to create multidimensional records serving interventions. You will scrape surveys and munge data streams to extract measurable insights. From the minutiae of desk level procedures to macro population trends, there are no problems too big or small for your keen eyes and open mind.

Responsibilities:

Build, develop, implement and execute extensible reusable data pipelines

consisting of multiple acquisition sources and integration into use case driven

endpoints.

Maintain and optimize workloads in various deployment stages and data

environments to ensure optimal performance as data volume and variety

increase.

Lead design activities in partnership with data scientists, analysts and product

owners to translate functional requirements into technical specifications for

scalable data pipelines.

Oversee management of analytical data assets for exploratory and early stage

analytic usage patterns, and develop recommendations to integrate with

production pipelines.

Orchestrate data pipelines using modern tools and techniques to automate

repeatable ETL processes, minimize error-prone dependencies and improve

integrity of published data assets.

Collaborate with internal IT teams to troubleshoot incidents and coordinate

resolutions to minimize disruption of analytic applications.

Monitor data consumption patterns and develop enhancements to ensure

pipelines adapt to evolving data schema and analytic use cases.

Collaborate with data consumers to define and catalog use cases to ensure

adherence to data governance standards and ethical/legal guidelines.

The Advanced Analytics team at Blue Shield of California is seeking a Data Engineer to join us in transforming healthcare through innovation. We partner with a wide array of stakeholders to build impactful solutions for our members and providers. We are awed by the complex problems before us, and undaunted by the challenge. If there is data and a problem statement, we are building a solution.

The Data Engineer builds, manages and optimizes production data pipelines supporting key data and analytics initiatives. You will be the cornerstone of a robust data stack, integrating diverse sources and serving up data to power machine learning solutions and self-service analytics. You will mine the operational details of adjudicating claims and build a real-time pipeline to feed an AI-enabled recommendation engine. You will follow our members journey through the healthcare landscape and blend data points across a broad spectrum of applications to create multidimensional records serving interventions. You will scrape surveys and munge data streams to extract measurable insights. From the minutiae of desk level procedures to macro population trends, there are no problems too big or small for your keen eyes and open mind.

Responsibilities:

Build, develop, implement and execute extensible reusable data pipelines

consisting of multiple acquisition sources and integration into use case driven

endpoints.

Maintain and optimize workloads in various deployment stages and data

environments to ensure optimal performance as data volume and variety

increase.

Lead design activities in partnership with data scientists, analysts and product

owners to translate functional requirements into technical specifications for

scalable data pipelines.

Oversee management of analytical data assets for exploratory and early stage

analytic usage patterns, and develop recommendations to integrate with

production pipelines.

Orchestrate data pipelines using modern tools and techniques to automate

repeatable ETL processes, minimize error-prone dependencies and improve

integrity of published data assets.

Collaborate with internal IT teams to troubleshoot incidents and coordinate

resolutions to minimize disruption of analytic applications.

Monitor data consumption patterns and develop enhancements to ensure

pipelines adapt to evolving data schema and analytic use cases.

Collaborate with data consumers to define and catalog use cases to ensure

adherence to data governance standards and ethical/legal guidelines.

A college degree or equivalent in computer science, data management,

information systems or related quantitative field.

A minimum of 10 years of experience in data management disciplines.

Industry experience in health care sector preferred.

High proficiency working with large, heterogeneous datasets in

building/optimizing data pipelines using ELT, data replication, API access,

data virtualization, stream data integration, and emerging technologies.

High proficiency with relational databases (Netezza, Oracle, MS SQL),

NoSQL databases (MongoDB, Cassandra), and distributed computing

platforms.

External hires must pass a background check/drug screen. Qualified applicants with arrest records and/or conviction records will be considered for employment in a manner consistent with Federal, State and local laws, including but not limited to the San Francisco Fair Chance Ordinance. All qualified applicants will receive consideration for employment without regards to race, color, religion, sex, national origin, sexual orientation, gender identity, protected veteran status or disability status and any other classification protected by Federal, State and local laws.