PEPersistent
GCP Data Engineer
Pune ₹5-12 LPA Posted 15 Apr 2025
FULL TIME
Etl Tools
Database Technologies
Big Data Technologies
Job Description
What You'll Do:
- Your role is focused on Design, Development and delivery of solutions involving:
- Data Integration, Processing & Governance
- Data Storage and Computation Frameworks, Performance Optimizations
- Analytics & Visualizations
- Infrastructure & Cloud Computing
- Data Management Platforms
- Implement scalable architectural models for data processing and storage
- Build functionality for data ingestion from multiple heterogeneous sources in batch & realtime mode
- Build functionality for data analytics, search and aggregation
Expertise You'll Bring:
- Overall 4+ years of IT experience with 3+ years in Data related technologies
- Minimum 2.5 years of experience in Big Data technologies and working exposure in at GCP cloud platform
- Hands-on experience with the Hadoop stack HDFS, sqoop, kafka, Pulsar, NiFi, Spark, Spark Streaming, Flink, Storm, hive, oozie, airflow and other components required in building end to end data pipeline.
- Strong experience in at least of the programming language Java, Scala, Python.
- Hands-on working knowledge of NoSQL and MPP data platforms like Hbase, MongoDb, Cassandra, AWS Redshift, Azure SQLDW, GCP BigQuery etc
- Well-versed and working knowledge with data platform related services on at least 1 cloud platform, IAM and data security
- Preferred Experience and Knowledge (Good to Have):
- Good knowledge of traditional ETL tools (Informatica, Talend, etc) and database technologies (Oracle, MySQL, SQL Server, Postgres) with hands on experience
- Knowledge on data governance processes (security, lineage, catalog) and tools like Collibra, Alation etc
- Knowledge on distributed messaging frameworks like ActiveMQ / RabbiMQ / Solace, search & indexing and Micro services architectures
- Performance tuning and optimization of data pipelines
- CI/CD Infra provisioning on cloud, auto build & deployment pipelines, code quality
- Cloud data specialty and other related Big data technology certifications
