
Data Engineer - ETL & Data Pipelines
Paramount
San Francisco, CAThis is a Full Time Job
#WeAreParamount on a mission to unleash the power of content… you in?
We’ve got the brands, we’ve got the stars, we’ve got the power to achieve our mission to entertain the planet – now all we’re missing is… YOU! Becoming a part of Paramount means joining a team of passionate people who not only recognize the power of content but also enjoy a touch of fun and uniqueness. Together, we co-create moments that matter – both for our audiences and our employees – and aim to leave a positive mark on culture.
Summary
The Data Engineering team is hiring a Data Engineer – Data Pipeline & ETL. You will help build and maintain scalable data platforms and ETL/ELT pipelines in a fast-moving environment. In this role, you will build and support batch and real-time data systems powering analytics, ML, and AI applications. You will also grow your expertise in modern data architecture and cloud-native best practices.
Key Responsibilities
Build and Maintain Scalable Data Pipelines
• Design, develop, and maintain scalable batch and streaming data pipelines for large-scale structured and unstructured datasets.
• Build robust ETL/ELT frameworks supporting analytics, BI, experimentation, and machine learning use cases.
• Optimize pipelines for performance, reliability, scalability, and cost efficiency.
• Implement advanced ingestion patterns including CDC, incremental loads, and event-driven processing.
Data Modeling & Data Warehouse Architecture
• Design scalable, dimensional, and hybrid data models optimized for analytics and ML use cases.
• Develop reusable transformation layers (semantic layers) that serve BI, ML, and AI applications.
• Write optimized, production-grade SQL for large-scale analytics workloads.
• Contribute to query optimization, indexing, partitioning, and performance tuning across distributed systems and cloud warehouses.
Modern Data Pipeline Development
• Build and maintain modular data components following established framework patterns.
• Contribute to architectural decisions across streaming systems, data lakes, and warehouses.
Data Quality, Governance & Observability
• Implement automated data validation, anomaly detection, and monitoring frameworks.
• Establish data lineage and metadata standards to support reproducibility in ML workflows.
• Enforce governance, privacy, and security best practices, particularly for sensitive AI datasets.
• Ensure responsible AI data usage and compliance standards.
Required Technical Skills
Advanced Data Pipeline & ETL/ELT Expertise
• 2–4 years of experience building and scaling ETL/ELT pipelines in production environments.
• Experience with workflow orchestration tools such as Airflow, Composer, or similar platforms.
• Strong understanding of distributed data processing concepts.
SQL & Data Modeling for Analytics & ML
• Expert-level SQL skills for large-scale transformation and analytics.
• Experience designing scalable warehouse schemas and ML-ready data layers.
• Strong experience optimizing complex queries across multi-terabyte datasets.
Programming & ML Data Integration
• Proficiency in Python (or similar language) for data processing and ML pipeline integration.
• Experience with distributed processing frameworks such as Spark.
• Familiarity integrating data pipelines with ML platforms such as Vertex AI (preferred), Databricks ML, or equivalent.
Streaming & Event-Driven Systems
• Experience building real-time data pipelines using Kafka, Pub/Sub, or similar technologies.
• Understanding of feature streaming, low-latency data processing, and event-driven architectures.
• Ability to architect and build real-time dashboards using Superset.
Cloud & Modern AI Data Platforms
• Experience designing cloud-native data architectures (GCP preferred).
• Experience with lakehouse architectures and cloud data warehouses.
• Familiarity with vector databases, embeddings pipelines, and AI-serving infrastructure is a plus.
Basic Qualifications
• Bachelor's or Master's degree in Computer Science, Engineering, or related field (or equivalent experience).
• 2–4 years of experience in data engineering, data pipeline development, or related fields.
• Strong foundation in modern data engineering principles, distributed systems design, and cloud-native architectures.
• Demonstrated ability to design and operate large-scale production data systems.
• Proven track record of technical leadership and cross-functional collaboration.
• Strong problem-solving skills and ability to thrive in complex, fast-paced environments.
• Detail-oriented and committed to engineering excellence and continuous improvement.
#LI-PV1
ADDITIONAL INFORMATION
What We Offer:
• Attractive compensation and comprehensive benefits packages.
• Generous paid time off.
• An exciting and fulfilling opportunity to be part of one of Paramount’s most dynamic teams.
• Opportunities for both on-site and virtual engagement events.
• Unique opportunities to make meaningful connections and build a vibrant community, both inside and outside the workplace.
• Explore life at Paramount: https://www.paramount.com/careers/life-at-paramount