Big Data Engineer (The Data Pipeline Innovator)

Company: Unreal Gigs
Location: San Francisco
Posted on: November 11, 2024

Job Description:

Are you passionate about handling massive datasets and building the infrastructure that enables complex data analysis and machine learning at scale? Do you excel in creating robust, scalable data pipelines that fuel data-driven decision-making? If you're ready to tackle the challenges of big data, our client has the perfect role for you. We're seeking a Big Data Engineer (aka The Data Pipeline Innovator) to architect and maintain high-performance data systems that empower analytics and support advanced data processing needs.As a Big Data Engineer at our client, you'll collaborate with data scientists, analysts, and software engineers to design, implement, and optimize big data platforms. Your expertise in data engineering, distributed systems, and cloud infrastructure will be critical to ensuring that our data ecosystem is efficient, reliable, and scalable.Key Responsibilities:

Design and Build Scalable Data Pipelines:

Architect and implement data pipelines for ETL processes using tools like Apache Spark, Kafka, and Hadoop. You'll create data workflows that handle high-volume, high-velocity data and ensure seamless integration across systems.
Optimize Big Data Storage and Processing:
- Develop and manage data storage solutions (e.g., HDFS, S3, Cassandra) that are optimized for performance and cost-efficiency. You'll configure distributed processing systems to support efficient data retrieval and transformation.
- Collaborate on Data Strategy and Integration:
  - Work closely with data scientists, analysts, and other engineers to align big data architecture with analytics goals. You'll ensure data availability and integrity across systems to support business objectives.
  - Implement Data Quality and Governance Standards:
    - Develop processes and tools to monitor data quality and enforce data governance policies. You'll ensure data is accurate, reliable, and secure through regular checks and validation processes.
    - Enhance Data Processing with Automation:
      - Use tools like Apache Airflow or AWS Glue to automate data workflows and reduce manual processing. You'll implement scripts and automation that streamline data handling and improve efficiency.
      - Monitor and Troubleshoot Data Systems:
        
        Use monitoring tools to track system performance and address issues proactively. You'll troubleshoot and resolve any bottlenecks or failures to maintain optimal data processing capabilities.
        
        Stay Updated on Big Data Trends and Technologies:
        
        Keep up with advancements in big data technologies and tools. You'll integrate new techniques and platforms that align with business needs and promote innovation. Required Skills:
        
        Big Data Platform Proficiency: Extensive experience with big data technologies such as Apache Spark, Hadoop, Kafka, and Hive. You're skilled at handling high-volume data and distributed processing.
        
        Data Pipeline and ETL Knowledge: Proven ability to design, build, and maintain ETL processes for massive datasets. You can handle both real-time and batch data processing requirements.
        
        Programming and Scripting: Proficiency in programming languages like Python, Java, or Scala for data processing and automation. Experience with SQL for data querying and manipulation is essential.
        
        Cloud Data Services Expertise: Familiarity with cloud platforms such as AWS, GCP, or Azure, including their big data and storage services (e.g., S3, BigQuery, Azure Data Lake).
        
        Data Quality and Governance: Strong understanding of data quality standards and governance practices, with experience in implementing data validation and monitoring frameworks. Educational Requirements:
        
        Bachelor's or Master's degree in Computer Science, Data Engineering, Information Technology, or a related field. Equivalent experience in data engineering or big data management may be considered.
        
        Certifications in big data or cloud technologies (e.g., Cloudera Certified Data Engineer, AWS Certified Big Data - Specialty, Google Professional Data Engineer) are a plus. Experience Requirements:
        
        5+ years of experience in data engineering, with at least 3+ years focusing on big data technologies and high-scale data environments.
        
        Experience in distributed systems and large-scale data storage management.
        
        Familiarity with containerization (Docker, Kubernetes) for deploying data processing environments is advantageous.
        
        Health and Wellness: Comprehensive medical, dental, and vision insurance plans with low co-pays and premiums.
        
        Paid Time Off: Competitive vacation, sick leave, and 20 paid holidays per year.
        
        Work-Life Balance: Flexible work schedules and telecommuting options.
        
        Professional Development: Opportunities for training, certification reimbursement, and career advancement programs.
        
        Wellness Programs: Access to wellness programs, including gym memberships, health screenings, and mental health resources.
        
        Life and Disability Insurance: Life insurance and short-term/long-term disability coverage.
        
        Employee Assistance Program (EAP): Confidential counseling and support services for personal and professional challenges.
        
        Tuition Reimbursement: Financial assistance for continuing education and professional development.
        
        Community Engagement: Opportunities to participate in community service and volunteer activities.
        
        Recognition Programs: Employee recognition programs to celebrate achievements and milestones.
        #J-18808-Ljbffr

Keywords: Unreal Gigs, Tracy , Big Data Engineer (The Data Pipeline Innovator), Engineering , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco Engineering jobs via email.

View more Tracy Engineering jobs

Other Engineering Jobs

Product Engineer - Fintech
Description: Logistics is one of the single largest industries in the world. Globally, logistics is an 8- 12 trillion dollar industry and in the US alone, - 2 trillion , representing -10 of GDP. A single percent (more...)
Company: Trucksmarter
Location: San Francisco
Posted on: 11/15/2024

Solutions Support Engineer
Description: At OSARO, we develop solutions to endow industrial robots with the level of autonomy needed to perform an unprecedented variety of complex pick and place tasks leveraging sophisticated robot control and (more...)
Company: Roman Health Pharmacy LLC
Location: San Francisco
Posted on: 11/15/2024

Mechatronics Engineer - Firmware & Controls
Description: Mechatronics Engineer - Firmware Controls ul li Full-timeNanoCore Technologies is an early stage technology company developing a method of metal additive manufacturing an order of magnitude less (more...)
Company: Mantle, Inc.
Location: San Francisco
Posted on: 11/15/2024

Salary in Tracy, California Area | More details for Tracy, California Jobs |Salary

CUDA Kernel Engineer & Researcher Bay Area (San Francisco and Palo Alto) CUDA Kernel Engineer &[...]
Description: About xAI br xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. br Our team is small, highly motivated, and focused on (more...)
Company: x.ai
Location: San Francisco
Posted on: 11/15/2024

Automation Engineer
Description: Local Candidates Preferred. Non-local candidates must be willing to pay for your own interview travel expenses and relocation costs.Candidates submitted over the max. bill rate will be automatically disqualified (more...)
Company: Cloud Analytics Technologies LLC
Location: San Francisco
Posted on: 11/15/2024

Senior Product Engineer
Description: Who is : br Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. Our team is 100 remote and we work with teams across North America, South America, and (more...)
Company: Recruiting From Scratch
Location: San Francisco
Posted on: 11/15/2024

Signal and Power Integrity Engineer
Description: Signal and Power Integrity EngineerJoin the leading chiplet startup from the ground floor As a Signal and Power Integrity Engineer, you will provide packaging solutions for chiplet interconnects in a (more...)
Company: Eliyan
Location: San Francisco
Posted on: 11/15/2024

Field Service Engineer
Description: Published on: Oct 4, 2024Country: United StatesCompany: Luminex CorporationJob Category: After Market ServiceEmployment type: Regular Full TimeJob ScopeThe Field Service Engineer provides service to customers (more...)
Company: DiaSorin
Location: San Francisco
Posted on: 11/15/2024

Staff Test Engineer (Mechanical/Electro-mechanical)
Description: Joby OverviewLocated in Northern California, the Joby Aviation team has been steadily working toward our goal of providing safe, affordable, fully electric air transportation that is accessible to everyone. (more...)
Company: Joby Aviation
Location: San Carlos
Posted on: 11/15/2024

Civil Engineer, PE (Ukraine)
Description: Planate Management Group PMG is a Service-Disabled Veteran-Owned Small Business SDVOSB headquartered in Alexandria, Virginia, and Orlando, Florida USA with technical support centers in South East (more...)
Company: Planate Management Group
Location: San Francisco
Posted on: 11/15/2024

Loading more jobs...

Big Data Engineer (The Data Pipeline Innovator)

Didn't find what you're looking for? Search again!

Other Engineering Jobs

Log In or Create An Account