Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Data Scientist Internship - Natural Language Processing

Cambia Group

Data Scientist Internship - Natural Language Processing

Kenmore, WA
Internship
Paid
  • Responsibilities

    Data Scientist Internship - Natural Language Processing Cambia Health 70 reviews - Seattle, WA Internship Overview Data Scientist Internship - Natural Language Processing Seattle, WA This internship position is scheduled to begin in May/June 2018 Responsibilities & Requirements Cambias Data and Technology Solutions Department delivers innovative data and technology products, services, and solutions that will help drive Cambia to its 2020 vision of person-focused health care transformation. Looking for a passionate, talented and inventive Data Scientist intern to help build industry-leading speech and language solutions. Together with a highly multi-disciplinary team of scientists, engineers, strategic partners and subject domain experts, you will work on building a real product with natural language processing and machine learning at its core. Essential Function of the NLP Data Scientist Internship: * Utilize statistical natural language processing to mine unstructured data and create insights * Build and optimize cutting-edge natural language understanding systems such as conversational agents (chatbots) * Build core in-house NLP components and analytical tools such as document clustering, topic analysis, text classification, named entity recognition, sentiment analysis, and part-of-speech tagging methods for unstructured and semi-structured data * Identify and deploy existing machine learning, natural language processing, and information retrieval techniques and systems for knowledge management and discovery, such as using Electronic Medical Records (EMR) data, progress notes, and discharge summaries to identify admitting diagnosis, reason for consultation, clinical history, etc. * Identify ways to analyze consumers experiences from various communication channels and improve customer satisfaction * Cluster and analyze large amounts of user generated content and process data in large-scale environments in Amazon AWS such as EC2, EMR, MapReduce, and PySpark * Integrate the NLP pipeline into the production environment, ensure its scalability, and leverage knowledge gained into other projects, modeling, and work practices * Design novel algorithms for problem solving, which may include data cleaning, feature selection, statistical modeling, data clustering and classification, text processing, and other machine learning techniques, to solve complex healthcare problems presented by healthcare organizations * Collaborate with different functional teams within Cambia and externally to find solutions to problems in healthcare Key Qualifications and Experience: * Currently enrolled in an undergraduate or graduate degree program focused on Big Data, Computer Science, Data Analytics, Engineering, Math, Statistics, Science or related degree program (preference will be given to graduate students) * Candidates who have completed their degree in the last six months are also encouraged to apply * Strong analytic and problem-solving skills, including the ability to apply quantitative analysis techniques to business situations including forecasting, descriptive statistics, statistical inference, and multivariate modeling techniques * Experience with a good range of NLP techniques, including text processing, tokenization, POS-tagging, parsing, annotation, regular expressions, language modeling, etc. * Ability to develop prototypes by manipulating and analyzing complex, high-volume, high-dimensionality data from various sources * Expertise in producing, processing, evaluating, and utilizing unstructured/semi-structured data * Proficiency in open-source NLP and machine learning toolkits such Stanford CoreNLP, NLTK, Gensim, Mallet, OpenNLP, LingPipe, cTAKES, scikit-learn, NumPy, LIBSVM, MLlib, Theano, TensorFlow, etc. * Solid background in statistical learning and clustering techniques for NLP such as HMM, CRF, SVM, MaxEnt, LDA, LSI, and K-Means * Must have ML/NLP algorithm implementation experience as well as the ability to modify standard algorithms, e.g., change objective functions, work out the math, and implement * Practical ability to visualize data, communicate about data, and utilize data effectively * Proficiency in SQL relational databases and/or NoSQL databases * Ability to think creatively and to work well both as part of a team and as an individual contributor * Eager to learn new algorithms, new application areas, and new tools * Excellent oral and written communication skills to effectively interface and communicate with a broad array of internal and external contacts including leadership * Strong programming skills in at least one object oriented programming language, e.g., Java, Python, C++, Scala, etc. * Fluency with Linux/Unix

    • Required minimum cumulative undergraduate GPA of 3.0 The following skills/experiences/knowledge, a plus: * Expertise in one or more of the following areas: question answering, conversational agents (chatbots), entity/relation extraction, summarization, semantic search, information retrieval, and knowledge bases * Experience and/or motivation to work on modern deep learning approaches to NLP, such as word/paragraph embedding and representation learning * Basic knowledge of core linguistic concepts, such as phonology, morphology, syntax, and semantics * Experience with noisy and/or unstructured textual data, such as tweets and search queries * Knowledge of or experience in building production quality and large-scale deployment of applications related to natural language processing and machine learning * Experience with large-scale data analysis tools in a cloud environment, such as Spark, Hadoop, MapReduce, Hive, Pig, etc. * Experience with open-source search engines like ElasticSearch, Solr or Lucene * Demonstrated knowledge of health plan operations, medical terminologies/ontologies and/or clinical informatics and healthcare systems * Experience with text analysis in clinical and medical domain corpora like Electronic Medical Records (EMR) * Knowledge of REST APIs and visualization tools, such as HTML, CSS, JS, and D3.js * General software development skills (source code management, debugging, testing, deployment, etc.) * Publication in NLP/IR academic conferences/journals or industrial circles, such as ACL, EMNLP, NAACL, EACL, COLING, SIGIR, WWW, etc. About Us At Cambia, we advocate for transforming the health care system. You arent satisfied with the status quo and neither are we. We're looking for individuals who are as passionate as we are about transforming the way people experience health care. We offer a competitive salary and a generous benefits package. We are an equal opportunity employer dedicated to workforce diversity and a drug and tobacco-free workplace. All qualified applicants will receive consideration for employment without regard to race, color, national origin, religion, age, sex, sexual orientation, gender identity, disability, protected veteran status or any other status protected by law. A drug screen and background check is required. Cambias portfolio of companies spans health care information technology and software development; retail health care; health insurance plans that carry the Blue Cross and Blue Shield brands; pharmacy benefit management; life, disability, dental, vision and other lines of protection; alternative solutions to health care access; and free-standing health and wellness solutions. We have nearly a century of experience in developing and providing health solutions to serve our members. We had our beginnings in the logging communities of the Pacific Northwest as innovators in helping workers afford health care. That pioneering spirit has kept us at the forefront as we build new avenues to improve access to and quality of health care for the future. 9 hours ago - save job - original job Apply On Company Site Other jobs you may like Data Scientist Brillio - Seattle, WA 17 hours ago Data Scientist Amazon.com - Seattle, WA 1 day ago Data Scientist Launch Consulting Group - Bellevue, WA 21 hours ago Easily apply Senior Data Scientist Indeed - Seattle, WA 2 hours ago Data Scientist Klein Hersh International - Seattle, WA 3 days ago * Data Scientist Internship jobs in Seattle, WA * Jobs at Cambia Health in Seattle, WA * Data Scientist Internship salaries in Seattle, WA Cambia Health Cambia Health 70 reviews Cambia Health Solutions, headquartered in Portland, Oregon, is a health solutions company dedicated to transforming health care by creating... Let employers find you Thousands of employers search for candidates on Indeed Upload Your Resume
  • Industry
    Management Consulting