Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Software Engineer - Data Infrastructure

CardinalHire

Software Engineer - Data Infrastructure

San Francisco, CA
Paid
  • Responsibilities

    Company Description
    Our startup is the Cyber Security Investigation Platform that enables cybersecurity teams to know the data that matters, ask the questions they need and get answers to use cases at scale in seconds. We are a diverse group of natural language processing (NLP), machine learning, security, product design, and data visualization experts that believe data-empowered people power the success of the organization. We have deep consumer technology roots that are reflected in our approach to product development.

    Required Skills: Python, Spark, Java, SQL, AWS, Go, Postgres, ETL, GCP, OLAP, Kafka, Hadoop

    Job Description
    We are looking for a Software Engineer - Data Infrastructure, to help us get the right data into the right structure to turn it into information, then process that information into something accessible to humans, so they can gain a level of awareness that wasn’t possible before. As an integral member of our technology team, you will engineer, operate, and optimize machine learning models, ETL pipelines, text search engines, OLAP datastores, and everything in between. Your work will enable our groundbreaking natural language platform and help us develop, scale, and deploy our applications in a variety of contexts. You’ll wear many hats, touch many parts of our system, and have a significant impact on our products.

    General Requirements

    • BS, MS, PhD in Computer Science, Engineering, or related discipline, or 3+ years equivalent technology experience
    • Authorized to work in the United States
    • Use engineering best practices – deliver high code quality, automated testing, and build reusable components
    • Secure cloud development experience on AWS, GCP, or equivalent
    • Expertise with writing efficient, complex database queries
    • Operational experience with OLAP datastores, text search engines, key-value stores, or distributed databases
    • Familiar with complex database management, replication, and backup
    • 2+ years of software development (Go, Python, Java, or equivalent)

    Responsibilities

    • Leveraging existing open source technologies like Kafka, Hadoop, Druid, Spark, PostgreSQL, and other tools
    • Crafting data normalization models and rules
    • Developing data-driven APIs for machine learning applications
    • Scaling and maintaining cloud databases and data processing pipelines
    • Optimizing database queries for efficient real-time processing
    • Indexing and summarizing large data-sets to enable high-performance analytics

    Benefits

    • Transit & parking FSA
    • Health care FSA
    • Short-term & long-term disability insurance
    • Life insurance
    • Dental & vision insurance
    • Health care insurance
    • Open vacation policy

    #ZR