We are developing an Artificial Intelligence solution that utilises big data and cluster computing (Spark) to solve a problem within the Identity & Access Management (IAM) space. The Data Scientist will be expected to manage, architect (Analytics architecture only) and analyze big data in order to build data-driven insights and high impact data models for roles and entitlement data. The value chain they aid us in creating will help to address the following challenges: distilling information from multiple data sources, acquiring usable and robust data, evaluating the inherent value of data, and formulating methods to derive insights from post-modelled data. This will provide us with the ability to improve our solution’s capabilities over time. Preferred Data Scientist skills would include an understanding of / experience with the following: Python (Scikit Learn, Numpy, Scipy) implementations; Machine Learning techniques; Applied Statistics e.g. utilisation of Association Rules, Naïve Bayes, etc.; Data manipulation and sorting i.e. Features engineering. Advantageous but not essential knowledge would be: Spark and SparkML; Linear algebra for vector calculations; Docker, RestAPI's, Unit Testing.
Required Skills
Location: Role is entirely remote • Years of experience: 3+ • Degree or specialized training is required/preferred: • Top three skills required: Big Data, Scala, Python
Required Experience
Location: Role is entirely remote • Years of experience: 3+ • Degree or specialized training is required/preferred: • Top three skills required: Big Data, Scala, Python