Data Engineer

Sayari Labs
  • Location
    Washington, D.C.
  • Sector
    Commercial
  • Experience
    Early Career
  • Posted
    Nov 05

Position description

About Sayari Labs:

Sayari is a venture-backed and founder-led global corporate data provider and commercial intelligence platform, serving financial institutions, legal and advisory service providers, multinationals, journalists, and governments. Thousands of analysts and investigators in over 30 countries rely on our products to safely conduct cross-border trade, research front-page news stories, confidently enter new markets, and prevent financial crimes such as corruption and money laundering.

Our company culture is defined by a dedication to our mission of using open data to prevent illicit commercial and financial activity, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you. 

Position Description:

Sayari is looking for a junior to mid-level Data Engineer to join our Data team located in Washington, DC. The Data team is an integral part of our Engineering division and works closely with our Software & Product teams, as well as other key stakeholders across the business.

What You Will Do: 

We need your help to harvest and transform hundreds of millions of structured and unstructured records from over 150 countries and 30 languages into a dynamic and meaningful graph of entities and relationships. 

What You Will Need:

  • Experience developing and deploying containerized applications and services, including orchestration, particularly Kubernetes

  • Two plus years of experience developing in Python (e.g. pandas, NumPy, Scrapy) 

  • At least one year of experience working in a cloud environment (GCP/AWS) 

  • Experience with or interest in learning natural language processing (NLP) techniques 

  • Experience with or interest in learning Apache Spark and/or other components of the Hadoop ecosystem

  • Work with data and analytics experts to strive for greater functionality in our data systems

What We Would Like:

  • Two plus years of machine learning experience deploying on Apache Spark

  • Two plus years of NLP experience

  • Familiarization with open-source NLP packages, especially OpenNLP, OpenNMT, and fastText

Who You Are:

  • Strong process-oriented self-starter, with impeccable organizational skills

  • Experienced in supporting and working with cross-functional teams in a dynamic environment

  • Interested in learning graph databases 

  • Experienced in working with non-English data 

What We Offer: 

  • Limitless growth and learning opportunities 

  • A collaborative and positive culture - your team will be as smart and driven as you

  • A strong commitment to diversity, equity & inclusion 

  • Outstanding competitive compensation & comprehensive benefits package, including full healthcare coverage plans, commuter benefits, 401K matching, generous vacation, and a variety of other benefits. 

How to Apply: 

To apply, please email the documents listed below to [email protected] by COB Sunday, December 1, 2019. 

  • Resume & any salary requirement

  • Optional: Brief note to highlight relevant experience or skills.  

  • Optional: Share links to any public repos of your previous work. 

  • Two references, including at least one former supervisor. (Sayari will not contact references without first checking with the applicant.) 

Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.

Qualifications

What You Will Need:

  • Experience developing and deploying containerized applications and services, including orchestration, particularly Kubernetes

  • Two plus years of experience developing in Python (e.g. pandas, NumPy, Scrapy) 

  • At least one year of experience working in a cloud environment (GCP/AWS) 

  • Experience with or interest in learning natural language processing (NLP) techniques 

  • Experience with or interest in learning Apache Spark and/or other components of the Hadoop ecosystem

  • Work with data and analytics experts to strive for greater functionality in our data systems

What We Would Like:

  • Two plus years of machine learning experience deploying on Apache Spark

  • Two plus years of NLP experience

  • Familiarization with open-source NLP packages, especially OpenNLP, OpenNMT, and fastText

Who You Are:

  • Strong process-oriented self-starter, with impeccable organizational skills

  • Experienced in supporting and working with cross-functional teams in a dynamic environment

  • Interested in learning graph databases 

  • Experienced in working with non-English data 

Application instructions

How to Apply: 

To apply, please email the documents listed below to [email protected] by COB Sunday, December 1, 2019. 

  • Resume & any salary requirement

  • Optional: Brief note to highlight relevant experience or skills.  

  • Optional: Share links to any public repos of your previous work. 

  • Two references, including at least one former supervisor. (Sayari will not contact references without first checking with the applicant.) 

Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.

follow us on Twitter