O

DevOps Infrastructure Engineer

OpenReq
Full-time
On-site
Cupertino, United States


Designing and writing software for new ASICs is hard, and requires a huge amount of software and tooling. It is even more challenging for model-specific ASICs, as it is important for them to hit the market at the right time, and thus moving fast is essential.



You will drive adoption of cutting-edge tooling, to improve the speed and reliability of our toolchains. You will help us innovate to do better than the industry norm, by running massively parallel CI jobs, specifying and building our own fully-redundant SSD-only server infrastructure, and making sure these tools run automatically and reliability.



You will work with an IT contracting firm to do the day-to-day maintenance and installation - while you must be knowledgeable enough about IT to work with this firm, most of your time will be spent designing new toolchains entirely.



The scope and title of this role can be modified for exceptional candidates.



Representative projects
● Spec out a server using a 6 GHz desktop CPU to speed up single-threaded workloads
● Decide if moving our servers to the cloud/a colo facility makes sense to improve uptime
● Set up networking infrastructure to allow Jupyter notebook users to connect to our
servers, without waiting for them to be restarted.
● Parallelize our CI stack to run on dozens of different machines at once, designing a
policy to avoid unnecessary CI failures if a machine goes down.



You may be a good fit if you:
● Are highly technical
● Strong knowledge of Linux, containerization, CI/CD, and programming languages such
as Python/C++. You will be asked coding questions during your interview.
● Proven ability to lead technical teams and mentor junior members
● Have 4+ years of experience with either infrastructure engineering or software
development
● Experience debugging complex hardware and software issues with server infrastructure



Strong candidates may also have experience with:
● In-depth understanding of workflows used in the semiconductor industry, especially those involving Synopsys and Cadence EDA tooling and Verilator
● Proficiency with cloud computing technology and experience working with a Big 3 Cloud
● Experience monitoring and installing datacenter hardware
● In-depth understanding of workflows used in the semiconductor industry,



We encourage you to apply even if you do not believe you meet every single qualification.