Job DescriptionApply Now
Data ScientistJob ID 1230-1
Mathematica applies expertise at the intersection of data, methods, policy, and practice to improve well-being around the world. We collaborate closely with public- and private-sector partners to translate big questions into deep insights that improve programs, refine strategies, and enhance understanding using data science and analytics. Our work yields actionable information to guide decisions in wide-ranging policy areas, from health, education, early childhood, and family support to nutrition, employment, disability, and international development. Mathematica offers our employees competitive salaries, and a comprehensive benefits package, as well as the advantages of being 100 percent employee owned. As an employee stock owner, you will experience financial benefits of ESOP holdings that have increased in tandem with the company’s growth and financial strength. You will also be part of an independent, employee-owned firm that is able to define and further our mission, enhance our quality and accountability, and steadily grow our financial strength. Read more about our benefits here: https://www.mathematica.org/career-opportunities/benefits-at-a-glance.
We are looking for junior data scientists to lead and support data processing and analysis tasks, such as building data pipelines, monitoring data quality, developing documentation, applying statistical and data science methods, and creating data visualizations. Our junior data scientists underpin our company's core offerings in program improvement and data science, which yield crucial evidence and information for policy and decision makers. Junior data scientists are on the data scientist career track at Mathematica.
This position focuses on health policy, which includes projects such as:
- Monitoring the impacts of an alternative payment model for primary care in terms of care quality, cost, and health outcomes for diverse beneficiaries, using claims from thousands of primary care practices across the country and predicting future hospital costs and behavior
- Developing and testing how claims and survey data from federal and state-level programs could be used to measure patients’ experience of care, quality of life, care coordination, and long-term outcomes for beneficiaries enrolled in both Medicare and Medicaid
- Creating an interactive data visualization tool to help local policy and decision makers understand how social determinants of health are related to health outcomes in their county, using open source data from public agencies and non-profits
Specifically, our junior data scientists contribute to team-based projects by:
- Co-developing analysis plans with a data scientist
- Leading and managing small teams and tasks with oversight from a data scientist
- Writing and maintaining programming systems in languages such as Python and R to obtain, combine, and transform datasets on cloud, internal, and client servers
- Developing and maintaining documentation
- Implementing quality assurance practices, such as version control and testing
- Conducting analysis and communicating results, both to internal teams and clients, such as descriptive statistics, data visualizations, and model diagnostics
- Master's degree, or bootcamp, with an excellent academic record, including courses in subjects such as statistics, data science, data analytics, mathematics, operations research, computer science, and/or social science
- Demonstrated interest and/or experience using programming and data science and/or statistics to contribute to projects with a policy/social impact in academic and/or professional settings
- At least three years of experience performing data cleaning and analysis using programming languages such as R or Python in the academic, extra-curricular, or professional environment
- Experience executing data science and statistics techniques including machine learning algorithms, network analysis, or natural language processing
- Ability and desire to work independently as part of an interdisciplinary team that may be geographically dispersed. This includes being able to learn resources such as self-guided tutorials, package documentation, and academic articles and willingness to constantly learn and contribute to knowledge sharing with team members.
- Experience with reproducible research principles, version control, interactive visualizations, tidyverse, R Shiny, R Markdown, pandas, and/or scikit-learn
- Desired but not required: experience with healthcare datasets (for example, Medicare or Medicaid claims and enrollment data), Bayesian statistics experience (especially with Stan), production-quality machine learning applications, cloud computing environments, and algorithmic fairness and ethics
To apply, please submit:
- A cover letter
- A resume
- Salary expectations
- A programming sample in one of the following languages: Python, R, Julia (Can be provided in the form of file attachment or GitHub repository link)
Various federal agencies with whom we contract require that staff successfully undergo a background investigation or security clearance as a condition of working on the project. If you are assigned to such a project, you will be required to obtain the requisite security clearance.
This position offers an anticipated annual base salary range of $60,000 - $121,000. This position may be eligible for a discretionary bonus based on company and individual performance.
Available locations: Washington, DC; Princeton, NJ; Cambridge, MA; Ann Arbor, MI; Woodlawn, MD; Oakland, CA.
We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.