Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Full-Stack Python Developer *Remote* Web Scraping and Data Enrichment

Go RIddler

Full-Stack Python Developer *Remote* Web Scraping and Data Enrichment

San Diego, CA
Full Time
Paid
  • Responsibilities

    Job Description

    Job Description

    About Us

    We’re revolutionizing lead generation by building a highly scalable, cost-effective system to scrape, validate, enrich, and segment niche audiences from social platforms and public sources.

    With 326 forms , each generating an average audience list of 75,000 leads , we offer high-quality leads for $1 per lead. Our commission-based sales model rewards developers and team members with a 30% share of each sale , giving you a direct stake in our success.

    If you’re passionate about web scraping, data processing, and building systems that directly impact revenue, we want to hear from you!

    Role Overview

    We’re seeking a Python Developer to build a scalable lead generation system. This is a unique opportunity to work at the intersection of web scraping, data enrichment, and revenue-driving strategies. You’ll play a critical role in developing the tools that enable us to deliver millions of high-quality leads, ensuring compliance and scalability.

    Key Responsibilities

    1. Scraper Development :
    * Build custom scrapers for platforms like Facebook, Twitter, and Instagram using tools like Selenium, Playwright, and Scrapy.
    * Extract critical data like usernames, bios, and engagement metrics while adhering to compliance standards.
    
    1. Data Validation & Enrichment:
    * Validate phone numbers and emails with APIs like Twilio Lookup and ZeroBounce.
    * Enrich data with demographic and behavioral insights using APIs like HypeAuditor, Clearbit, or FullContact.
    
    1. Lead Management :
    * Process raw scraped data into validated and enriched datasets ready for sale.
    * Automate workflows for segmentation by niche, demographics, and engagement metrics.
    
    1. Revenue-Driven Focus :
    * Design algorithms to prioritize high-value leads based on engagement, demographics, and niche relevance.
    * Continuously optimize scraping efficiency to maximize lead generation potential.
    
    1. Compliance and Scalability :
    * Ensure all data collection complies with GDPR and CCPA guidelines.
    * Deploy scalable systems using AWS, Google Cloud, or Azure for parallel scraping and data processing.
    

    Required Skills and Qualifications

    1. Core Skills :
    * Expert in **Python** (5+ years).
    * Strong experience with scraping frameworks: **Selenium** , **Playwright** , **Scrapy** , or **Beautiful Soup**.
    * Hands-on knowledge of proxy rotation and CAPTCHA solving mechanisms (e.g., 2Captcha, ProxyMesh).
    
    1. Data Handling :
    * Proficient with Python libraries: **pandas** , **NumPy** , and **PySpark**.
    * Experience in validating and cleaning data with tools like Twilio and ZeroBounce.
    
    1. Enrichment and Analytics :
    * Familiar with APIs like **HypeAuditor** , **Clearbit** , or **FullContact**.
    * Strong understanding of data segmentation and ranking algorithms.
    
    1. Compliance and Privacy :
    * Deep knowledge of GDPR, CCPA, and ethical scraping practices.
    
    1. Cloud and Deployment :
    * Skilled in deploying scalable systems on **AWS** , **Google Cloud** , or **Azure**.
    * Proficient in **Docker** and orchestration tools like **Kubernetes**.
    

    What You’ll Earn

    • Base Salary : Competitive, based on experience.
    • Commission : Earn 30% of each sale. With a dataset of 326 forms , each with an average list of 75,000 leads , there is a significant earning potential. For example: * Selling 10% of a single form's list (7,500 leads) generates $7,500. * Your 30% commission equals $2,250 for just one partial sale. We received 326 forms in 28 days. The bottleneck is our previous coder.

    Preferred Skills

    • Experience with Tweepy (Twitter API) and Facebook Graph API.
    • Knowledge of Apache Airflow or similar workflow automation tools.
    • Familiarity with ranking systems and machine learning.

    Benefits

    • Flexible remote work options.
    • Uncapped commission structure with a direct impact on earnings.
    • Access to cutting-edge tools and technology.
    • Opportunity to work on a revenue-driving project with a significant market impact.

    How to Apply

    Follow instructions on ZipRecreuiter

    Company Description

    Goriddler.com is a data enrichment, lead provider to verified businesses. Mainly coaches and marketing agencies.

    Company Description

    Goriddler.com is a data enrichment, lead provider to verified businesses. Mainly coaches and marketing agencies.