THE MISSION:
Our daily fight with cyber bad guys requires us to collect and analyze a
lot of data…. a LOT of data! And, as our customer base continues its
rapid growth, we must look at faster and more robust tools to help us
and our customers make the best decisions possible.
With your knowledge of Hadoop and Big Data technologies, you will add
your tools-building superpowers to a small team tasked with building out
a DevOps automation environment, one that will step up our Business
Intelligence game and help us protect our customers from cyber intruders
We offer the chance to be part of an important mission: ending breaches
and protecting our way of digital life. If you are a motivated,
intelligent, creative, and hardworking individual, then this job is for
you!
THE JOB:
- As a Big Data Engineer, you will be an integral member of our Big
Data & Analytics team responsible for design and development
- Partner with data analyst, product owners and data scientists, to
better understand requirements, finding bottlenecks,
resolutions, etc.
- You will be an SME for all things ‘Big Data’ as well as mentor other
team members.
- Design and develop different architectural models for our scalable
data processing as well as scalable data storage
- Build data pipelines and ETL using heterogeneous sources
- You will build data ingestion from various source systems to Hadoop
using Kafka, Flume, Sqoop, Spark Streaming etc.
- You will transform data using data mapping and data processing
capabilities like MapReduce, Spark SQL
- You will be responsible to ensure that the platform goes through
Continuous Integration (CI) and Continuous Deployment (CD) with
DevOps automation
- Expands and grows data platform capabilities to solve new data
problems and challenges
- Supports Big Data and batch/real time analytical solutions
leveraging transformational technologies like Apache Beam
- You will have the ability to research and assess open source
technologies and components to recommend and integrate into the
design and implementation
- You will work with development and QA teams to design Ingestion
Pipelines, Integration APIs, and provide Hadoop ecosystem services
THE SKILLS:
- 8+ years of experience with the Hadoop ecosystem and Big Data
technologies
- Ability to dynamically adapt to conventional big-data frameworks and
tools with the use-cases required by the project
- Hands-on experience with the Hadoop eco-system (HDFS, MapReduce,
Hbase, Hive, Impala, Spark, Kafka, Kudu, Solr)
- Experience with building stream-processing systems using solutions
such as spark-streaming, Storm or Flink etc
- Experience in other open-sources like Druid, Elastic Search,
Logstash etc is a plus
- Knowledge of design strategies for developing scalable, resilient,
always-on data lake
- Some knowledge of agile(scrum) development methodology is a plus
- Strong development/automation skills. Must be very comfortable with
reading and writing Scala, Python or Java code.
- Excellent inter-personal and teamwork skills
- Can-do attitude on problem solving, quality and ability to execute
Degree in Bachelor of Science in Computer Science or equivalent
Learn more about Palo Alto Networks here and check out our fast facts