Skip to content

General Information

Req ID
R020105
State
New York
Work Type
Remote

Description and Requirements

The Data Engineer is responsible for leading and implementing best-in-class data management strategies and practices through various forms of integration; data acquisition/ingestion, data cleansing/refinements, data transformations/conversions, data migrations, data purging, and back-ups. The role will work with a variety of users and stakeholders to understand data requirements and support the data architecture to translate into the data management practice at Healthfirst. The Data Engineer is to author artifacts defining standards and definitions for storing, processing and moving data, including associated processes and business rules. Additionally, the Data Engineer will map the details within these artifacts to business processes, non-functional characteristics, qualitative criteria and technical enablement. The role is responsible to be constantly thinking through the needs of the business to support efficient and error-free processes. The Data Engineer is responsible for finding trends in datasets and developing workflows and algorithms to assist in highlighting strategic raw data as useful to the enterprise. The role is also responsible for creating data acquisition strategy and develop dataset processes in a robust manner according to best practices of standardization. The role will support the Data Science team through existing pipelines and processes to integrate towards best-in-class data management practices seamlessly.
  • Designs and implements standardized data management procedures around data staging, data ingestion, data preparation, data provisioning, and data destruction (e.g., scripts, programs, automation, etc.)
  • Ensures quality of technical solutions as data moves across multiple zones and environments
  • Provides insight into the changing data environment, data processing, data storage and utilization requirements for the company, and offer suggestions for solutions
  • Ensures managed analytic assets to support the company’s strategic goals by creating and verifying data acquisition requirements and strategy
  • Develops, constructs, tests, and maintains architectures
  • Aligns architecture with business requirements and use programming language and tools
  • Identifies ways to improve data reliability, efficiency, and quality
  • Conducts research for industry and business questions
  • Deploys sophisticated analytics programs, machine learning, and statistical methods to efficiently implement solutions
  • Prepares data for predictive and prescriptive modeling and find hidden patterns using data; in support of the Data Science team
  • Uses data to discover tasks that can be automated
  • Creates data monitoring capabilities for each business process and works with data consumers on updates
  • Aligns data architecture to the solution architecture; contributes to overall solution architecture
  • Develops patterns for standardizing the environment technology stack
  • Helps maintain the integrity and security of company data

Minimum Qualifications:

  • Bachelor’s Degree in Computer Engineering or related field
  • 8+ years of experience in data engineering
  • 5+ years of experience in data programming languages, such as java, python or pyspark
  • 5+ years of experience working in a ‘Big Data’ ecosystem process data; includes file systems, data structures/data bases, , automation, security, messaging, movement, etc.
  • 5+ years of experience working in a production cloud infrastructure

Preferred Qualifications:

  • Proven track record of success directing the efforts of data engineers and data analysts with a deadline-driven and fast-paced environment
  • Hands-on experience in leading healthcare data transformation initiatives from on-premise to cloud deployment
  • Demonstrated experience working in an Agile environment as a Data Engineer
  • Hands-on work with AWS, including creating Redshift data structures, accessing them with Spectrum, and storing data in S3
  • Knowledge of SQL and multiple programming languages to optimize data processes and retrieval
  • Proven results using an analytical perspective to identify engineering patterns within complex strategies and ideas, and break them down into engineered code components
  • Experience developing, prototyping, and testing engineered processes, products or services
  • Proven ability to work in distributed systems
  • Proficiency with relational, graph and NoSQL databases
  • Must be able to develop creative solutions to problems
  • Demonstrates critical thinking skills with ability to communicate across functional departments to achieve desired outcomes
  • Excellent interpersonal skills with proven ability to influence with impact across functions and disciplines
  • Ability to work independently and as part of a team
  • Ability to manage multiple projects/deadlines, identifying the necessary steps, and moving forward through completion
  • Skilled in Microsoft Office; including PowerPoint, Word, Excel and Visio

Hiring Range*:

  • Greater New York City Area (NY, NJ, CT residents): $129,900 - $187,680

  • All Other Locations (within approved locations): $114,400 - $170,170

As a candidate for this position, your salary and related elements of compensation will be contingent upon your work experience, education, licenses and certifications, and any other factors Healthfirst deems pertinent to the hiring decision.

In addition to your salary, Healthfirst offers employees a full range of benefits such as, medical, dental and vision coverage, incentive and recognition programs, life insurance, and 401k contributions (all benefits are subject to eligibility requirements). Healthfirst believes in providing a competitive compensation and benefits package wherever its employees work and live.

*The hiring range is defined as the lowest and highest salaries that Healthfirst in “good faith” would pay to a new hire, or for a job promotion, or transfer into this role.

WE ARE AN EQUAL OPPORTUNITY EMPLOYER. Applicants and employees are considered for positions and are evaluated without regard to mental or physical disability, race, color, religion, gender, gender identity, sexual orientation, national origin, age, genetic information, military or veteran status, marital status, mental or physical disability or any other protected Federal, State/Province or Local status unrelated to the performance of the work involved.