Data Scientist I at Elsevier in Raleigh, North Carolina

Posted in Other 4 days ago.

Job Description:

Are you passionate about data? Would you like to use your super-powers for good and making the world a more just place? We are looking for an aspiring data scientist who is devoted to building the most accurate legal data. If you have experience using, training and improving models along with presenting and verifying the outcomes, then we are looking for you!

Developing Artificial Intelligence (AI) and Advanced Analytics are the primary missions for LexisNexis and we are seeing the acceleration of amazing AI work being developed in the Legal space. AI technology is evolving quickly, but AI and Advanced Analytics are only as good as the underlying data sets that power it. LexisNexis has millions of the best legal entities in the world! We are looking for an the aspiring Data Scientist who is devoted to creating a world class authority system that has metrics, automation and artificial intelligence built in.


The Data Scientist is responsible for defining generic extraction of data from our processing of web pages that form the core of some of our authorities. The extraction of data will come from a range of informational structures like paragraphs, lists and sentence fragments. The data scientist will also work with machine learning to provide more accurate fuzzy matching as well as data cleansing like finding duplicates in the authority sets. Accuracy of our primary data entities such as courts, judges, lawfirms, attorneys and companies is vital to our success. The successful candidate must be able to quickly familiarize themselves with LexisNexis' diverse data resources and have the ability to leverage the latest in NLP technologies to define specific models and logic for legal and corporate entities.


>Develop innovative strategies for extracting information from web pages for legal and corporate entities

>Manage, validate and deploy custom models for extraction and machine learning models

>Integrate models into existing extraction and processing logic

>Writing queries and reports to confirm extraction models and processing models.

>Perform cost analysis to determine optimal models that balance model success with processing costs

>Must document all models and provide APIs to quickly test and validate models and model changes


>3+ years experience with AWS products

>Must include intimate familiarity with 2 plus NLP models

>Must include intimate familiarity with machine learning models and decision trees

>Must include intimate familiarity with SQL query language

>AWS Certification is a plus

>Expert Python programmer

>Ability to provide guidance to engineers, system engineers and other team members

>Documentation skills for processes and procedures

>Bachelor of Science Degree in Engineering, Mathematics, or Computer Science

LexisNexis, a division of RELX Group, is an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. If a qualified individual with a disability or disabled veteran needs a reasonable accommodation to use or access our online system, that individual should please contact 1.877.734.1938 or .

Please read our Candidate Privacy Policy