Sep 13, 2019

Data and Model Operations Data Scientist, Retail Bank

  • Capital One
  • McLean, VA, USA
Full time Data Science Cloud

Job Description

This role within the Bank Data Sciences team is for individuals who are passionate about engineering excellence of modeling pipelines. The Deposit Forecasting and Pricing team are looking for an experienced feature pipeline and model implementation expert to support the next generation of models in Bank.

More about the role:

  • Productionizing feature pipelines

    • Set feature pipeline standards in the batch, stream, and real-time settings

    • Refactor feature pipelines for faster feature changes and updates

    • Implement data validation framework, and quality tests

  • Productionizing models

    • Implement model objects in cloud-based platforms

    • Develop low latency model scoring

    • Establish model promotion and configuration management mechanisms

    • Enable enhanced model & feature versioning

    • Enable system and integration testing for model deployment

  • Productionizing model monitoring

    • Integrate with alerts and notification mechanisms

    • Collaboratively develop model monitoring tools and reporting

  • Experience in modern distributed computing tools such as Hadoop, Spark, and H2O

  • You should know Python, Scala, or Mosel and are comfortable with working with multiple languages

Twenty years after Capital One was started it’s still led by its founder. Be ready to join a community of the smartest people you’ve ever met, who see the customer first, and want to use their data skills to make a difference.

 Basic Qualifications:
-Bachelor’s Degree plus 2 years of experience in data analytics, or Master’s Degree plus 1 year of experience in data analytics, or PhD
-At least 1 year of experience in open source programming languages for large scale data analysis
-At least 1 year of experience with machine learning
-At least 1 year of experience with relational databases

Preferred Qualifications:

-Master’s Degree or PhD

-At least 1-year experience working with AWS services (emr, lambda, ec2)
-At least 3 years’ experience in Python or Scala
-At least 3 years’ experience with machine learning (sci-kit-learn, tensorflow)
-At least 3 years’ experience with SQL

-At least 1 year experience working with Mosel language

-At least 1-year experience with streaming frameworks in Spark Streaming or Apache Flink

-At least 3 years experience with Apache Spark

-At least 1-year experience with containerization and cluster management methods (e.g. Docker, Kubernetes)

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.


Data Scientist