Qualifications
PRIMARY: Spark, Python, Data Engineering, Data Science
- MS/BS degree in a computer science field or related discipline
- 6+ years’ experience in large-scale software development
- 1+ year experience in Hadoop
- Strong Java programming, Python, shell scripting, and SQL
- Strong development skills around Hadoop, Spark, MapReduce, and Hive
- Strong understanding of Hadoop internals
- Good understanding of file formats including JSON, Parquet, Avro, and others
- Experience with databases like Oracle
- Experience with performance/scalability tuning, algorithms, and computational complexity
- Experience (at least familiarity) with data warehousing, dimensional modeling, and ETL development
- Ability to understand ERDs and relational database schemas
- Proven ability to work with cross-functional teams to deliver appropriate resolution
NICE TO HAVE:
- Experience with AWS components and services, particularly EMR, S3, and Lambda
- Experience with open source NoSQL technologies such as HBase, DynamoDB, Cassandra
- Experience with messaging and complex event-processing systems such as Kafka and Storm
- Experience provisioning RESTful API’s to enable real-time data consumption
- Automated testing, Continuous Integration / Continuous Delivery
- Scala
- Machine learning frameworks
- Statistical analysis with Python, R, or similar
Additional Information
All your information will be kept confidential according to EEO guidelines.
Only those with first and last names will be considered.
Resumes should include links to LinkedIn profile.
GC, USC or H1 only
ONLY W2 CANDIDATES WILL BE CONSIDERED; NO C2C!