Jobs / Cap***
GCP /Python Data Engineer
Cap*** · Atlanta, GA, United States
Visa sponsorship details are locked. Unlock company name and apply link with .
Atlanta, GA, United StatesHybrid
Remuneration
Not specified
Location
Atlanta, GA, United States
Visa sponsorship
Sponsors visa
Job summary
Choosing Cap*** means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible.
Benefits
Package to all regular, full-time employees.In the U.And Canada, availableAre determined by local policy and eligibility and may include:Medical, dental, and vision coverage (or provincial healthcare coordination in CRetirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)Life and disability insuranceEmployee assistance programsOtherAs provided by local policy and eligibilityThe Company reserves the right to amend or withdraw compensation programs at any
Qualifications
- The ideal candidate will have hands-on experience developing batch and real-time data pipelines, working with large-scale datasets, and enabling analytics and AI/ML use cases.
- Experience with Dataproc (Spark/PySpark) for large-scale processing
- Familiarity with event-driven architectures
- Knowledge of Terraform or Infrastructure as Code
- Understanding of cost optimization (FinOps)
- Nice to Have
- Google Cloud Professional Data Engineer Certification
- Experience supporting AI/ML data pipelines
- The base compensation range for this role in the posted location is: 80420 - 106050
- Cap*** provides compensation range information in accordance with applicable national, state, provincial, and local pay transparency laws.
- The base compensation range listed for this position reflects the minimum and maximum target compensation Cap***, in good faith, believes it may pay for the role at the time of this posting.
- This range may be subject to change as permitted by law.
Responsibilities
- Data Engineering & Pipeline Development
- Design, build, and maintain scalable batch and real-time data pipelines using GCP services such as Dataflow, Dataproc, and Pub/Sub
- Develop and optimize ETL/ELT workflows for structured and unstructured data processing
- Implement event-driven data processing using Cloud Functions and Pub/Sub
- Build and manage data ingestion frameworks for streaming and batch data sources
- Data Storage & Processing
- Design and optimize data lakes and data warehouses using BigQuery and Cloud Storage
- Develop efficient data models to support analytics, reporting, and machine learning workloads
- Optimize performance and cost of data pipelines and queries
- Development & Automation
- Develop reusable and scalable solutions using Python
- Automate workflows and orchestration using Cloud Composer (Airflow)
Degrees
Associate
Industry
AutomotiveEducationEnergyHealthcareInsuranceLogisticsMedia
Company size
EnterpriseSmb