Your responsibilities: Build, Improve and extend NLP capabilities Research and evaluate new/different approaches to NLP problems. Produce deliverable results and take them from development to production in collaboration with our engineers.
You must have:
- Strong Machine Learning background and familiarity with R, WEKA, RapidMiner, etc.
- Expertise in at least 3 of the following: Sentiment Analysis, Entity Extraction, Document Classification, Topic Modeling, Natural Language Understanding (NLU) and Natural Language Generation (NLG).
- Strong understanding of text pre-processing and normalization techniques, such as tokenization, POS tagging and parsing and how they work at a low level.
- Strong knowledge of Java or Python, and general software development skills (source code management, debugging, testing, deployment, etc.)
- Expertise in producing, processing, evaluating and utilizing training data.
- MSc./PhD in Computer Science, Computational Linguistics or related fields from top tier university
Great to have:
- Good understanding of linguistics and language as a phenomenon.
- Strong interest in, and knowledge of Artificial Intelligence and its subfields.
- Experience with non-English NLP. Experience with Deep Learning and Word Embeddings.
- Experience with open-source NLP toolkits such as CoreNLP, OpenNLP, NLTK, gensim, LingPipe, Mallet, etc.
- Experience with open-source ML/math toolkits such as scikit-learn, MLlib, Theano, NumPy, etc.
- Experience with noisy and/or unstructured textual data (e.g. tweets)
- Published work in academic conferences/journals or industry circles.
To apply, include the following:
- Detailed resume and/or LinkedIn profile
- Links to any research/papers you have been an instrumental part of and are proud of
- Name of instructor/adviser, if any along with link to their profile
- Cover Letter identifying your five favorite apps on your phone