This job listing has expired and the position may no longer be open for hire.

Sr. Site Reliability Engineer at

Posted in Other 30+ days ago.

This job brought to you by America's Job Exchange

Type: Full Time
Location: Palo Alto, California

Job Description:

Post Date:

The Salesforce DMP team is looking to hire a Sr. Member for the platform engineering team.
So, you're not only well versed in infrastructure and operations, but also in developing software that can run it for you. You support application developers by building and exposing well designed tools, APIs and services. You hate waking up at 2am to production failures and that's why you have monitors for everything, and enough failover or self-healing setup so you can deal with it in the morning. You're fine with rolling out code all the time because if anything unexpected happens, you have it covered. You did this already. Twice. So you automated it. And when the caching system had a problem, you found the bug, fixed it and sent them a patch.
Great, we're dying to talk to you! Salesforce might be just the place for you; we're looking for people who can build and operate big data and ML infrastructure, fast changing webapps, real time APIs and a lot of traffic. People who are technical experts and get things done by using their smarts and whatever tools make sense to get the job done. People who love to solve new problems and thrive in a rapidly innovating space.

* Architect, develop, implement, and operate the scalable distributed systems platform that supports our big data processing, Data-Science and AI applications.

* Provide expertise and guidance to assist development teams in making the right architectural & operational decisions early

* Develop new methods and tools to automate packaging, deployment and configuration of internally developed applications, and others to self-heal production environments.

* Apply software development workflows to operational environments

* Test and tune newly developed systems to prepare them for production deployment and ensure maximum performance at minimum cost

* Lead, coach and grow our DevOps practice by bridging development, Ops and QA

* Measure everything, providing critical operational insight into our applications

* Become and stay an expert in current and emerging technologies and tools

* Contribute to Open Source solutions and communities we use wherever you can


* are a domain expert in scalable and distributed infrastructure

* ran production services in the cloud (AWS) and know where it's awesome and where it hurts

* are fluent in several programming language (preferably Scala, Java, and Python)

* make things work and get things done

* love to leave things better than how you found them and automate anything you've done more than twice

* really grok your CI (jenkins, etc), config management (puppet) and VCS tools (git)

* can write a one-liner for anything, but know when not to

* can't go to sleep knowing your tests or monitors are failing

* love to learn new things and can do so quickly

* you don't just use the code, you contribute


* love agile/lean development and delivering code

* know the pros and cons of all the technology trends in your field

* make excellent informed decisions and know when to ask for advice

* know your way around the entire stack

* are a great communicator

* have a great sense of humor

LI - Y is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Headhunters and recruitment agencies may not submit resumes/CVs through this Web site or directly to managers. does not accept unsolicited headhunter and agency resumes. will not pay fees to any third-party agency or company that does not have a signed agreement with