This job listing has expired and the position may no longer be open for hire.

Software Engineer - Replication Manager at Cloudera, Inc. in Palo Alto, California

Posted in Other 30+ days ago.

Type: Full Time





Job Description:

Job Description:
The Replication Manager team is looking for passionate developers to join our growing engineering team. The team is responsible for building out the data, metadata, and permissions replication support for the Hadoop ecosystem. The goal of the team is to have a seamless experience for our customers for moving the data and all entities associated with that to migration as well disaster recovery use cases.

Replication Manager enables you to replicate data across data centers or to/from the cloud for disaster recovery and migration scenarios. Replications can include data stored in HDFS, data stored in Hive tables, Hive metastore data, and Impala metadata (catalog server metadata) associated with Impala tables registered in the Hive metastore. Replication Manager not only replicates data and metadata but also translates security and governance policies as part of the move. The datasets can range from terabytes to petabytes of data with some additional challenges like millions of directories, individual file sizes ranging in gigabytes, etc.

Key Responsibilities:
-
Build and maintain large-scale replication systems on top of Hadoop
-
Work with a team of engineers to design cloud-based, low RPO, RTO replication architectures
-
Support replication across multiple Hadoop components like HDFS, Hive, HBase, Kudu, etc
-
Mentor junior engineers
-
Work with product management to formulate a product roadmap

Requirements:
-
8+ years experience building complex systems that handle \\"big data\\".
-
Strong proficiency in one JVM language such as Java, Scala
-
Familiarity with cloud-based systems
-
Strong understanding of systems, databases, networks and the web
-
Systems experience

Preferred:
-
Experience with scalable systems (petabytes and beyond)
-
Prior replication experience
-
Experience with AWS, Azure, GCP
-
Current expertise with Java/Scala developer ecosystems

Why Cloudera?
-
Amazing people - We are a fun and smart team, including many of the top luminaries in Hadoop and related open source communities. We frequently interact with the research community, collaborate with engineers at other top companies and host cutting edge researchers for tech talks.
-
Innovative work - Cloudera pushes the frontier of big data and distributed computing, as our track record shows. We test and deploy our code on clusters with hundreds of nodes, terabytes of RAM, and petabytes of storage. We work on high-profile open source projects, interacting daily with engineers at other exciting companies, speaking at meet-ups, etc.

.

Cloudera is an Equal Opportunity / Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.


More jobs in Palo Alto, California

Other
about 2 hours ago

ATR International
Other
about 4 hours ago

Athleta
Other
about 4 hours ago

Athleta
More jobs in Other

Other
1 minute ago

Tyson Foods, Inc.
Other
1 minute ago

Tyson Foods, Inc.
Other
1 minute ago

Tyson Foods, Inc.