Data Engineer (Python, SparkData Warehouse, ETL tools, Java)
Job Description
ITLE : DATA ENGINEER
LOCATION : LOS ANGELES, CA
NATURE OF EMPLOYMENT : FULL TIME PERMANENT OR CONTRACT IS ALSO FINE
JOB DESCRIPTION:
· Python and SQL (Ideal candidate will also have experience with R, Scala and Java)
· Proficiency with various operating systems
· Data warehousing and ETL tools
· Container technology such as Docker, RedShift
· Hadoop-based analytics (HBase, Hive)
· Parallel programing: Spark, PySpark, H2O, Dask, etc.
PREFERRED BUT NOT REQUIRED SKILLS:
· Real time analytics tools with streaming data such as Kafka, NiFi, Storm, Spark etc.
EXPERIENCE
· Excellent understanding of data analytics architecture & infrastructure.
· History of providing a support capability to data scientists and strong abilities with data manipulation and preparation.
· Extensive experience building data pipeline for data preparation.
· Knowledge of deep learning framework installation and GPU configuration.
· Experience with machine learning & deep learning model deployment.
· General understanding of machine learning and deep learning.
· Familiarity with data ingestion.
Qualifications
Python, Spark, pyspark, Data Warehouse, ETL tools, Java
Additional Information
R programming langauge is good to have skillset