Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Co-op: Semantics in Data Management

Biogen

Co-op: Semantics in Data Management

Cambridge, MA
Internship
Paid
  • Responsibilities

    Job Description

    This position is a 6-month Co-op lasting from January - June 2022.

    Scientific and Clinical Data management and analysis is becoming ever more complex in the era of “Big Data”, “Machine Learning” and “Artificial Intelligence”. Modern approaches in Data Science, Informatics and Natural Language Processing are making an impact on how we approach Data. DANTE stands for “Data, ANalytics and TEchnology”, and the DANTE team at Biogen initiates programs aimed toward evolving an R&D Data ecosystem that facilitates inquiry & maximizes the return on R&D data. We aim to create a connected ‘Data Fabric’ supporting interconnected richly annotated Data via a strategy that leverages FAIR data principles, unified Semantics, and unfettered access. DANTE is focused on Data Management principles and Semantics management where we implement projects that emphasize the use of Ontologies to promote FAIR data. We focus on Ontology structure, curation and engineering, technical deployment and integration of Ontologies, and the downstream impact and usability of unified vocabularies in Data capture and analysis systems. 

    POSITION DESCRIPTION 

    Ontologies allow for consistent development and usage of shared vocabularies. When used to support Data management, Ontologies provide an accessible and efficient way to promote FAIR data principles and ultimately promote ‘machine learnable’ and analytics-ready data. What to include and how to resolve terms and definitions in a way that supports understanding across an organization is the first place to begin when representing a lexicon as an Ontology. Often two scientists performing similar experiments may label things differently, and an enterprise ‘preferred term’ could be proposed to increase understanding and usability of our data.  

    You will participate in harvesting typical vocabulary and stylistic representations from different scientific workflows by interviewing and collaborating with scientists and experts. You will learn to use specialized software tools to resolve and propose terms, synonyms, and definitions.  

  • Qualifications

    Qualifications

    You have a background in and familiarity with Biology, Chemistry, Computer Science or linguistics, and an interest in Drug Discovery and Pharma Sciences. You may have some exposure to or experience with Genomics and/or Clinical information. First and foremost, you are interested in how language can bring data together and enthusiastic to learn something about using Ontologies in Science and Health data. 

    To participate in the Biogen Internship Program, students must meet the following eligibility criteria: 

    • Legal authorization to work in the U.S. 
    • At least 18 years of age prior to the scheduled start date 
    • Be currently enrolled in an accredited college or university

     

    EDUCATION 

    Bachelor or Master level student majoring in Biology, Chemistry, or Computer Science preferred

    Additional Information

    All your information will be kept confidential according to EEO guidelines.

  • Industry
    Manufacturing