Senior Machine Learning Platform Engineer
Shanghai
WHO WE ARE:
Optiver is a global market maker founded in Amsterdam, with offices in London, Chicago, Austin, New York, Sydney, Shanghai, Hong Kong, Singapore, Taipei and Mumbai. Established in 1986, today we are a leading liquidity provider, with close to 2,000 employees in offices around the world, united in our commitment to improve the market through competitive pricing, execution and risk management. By providing liquidity on multiple exchanges across the world in various financial instruments we participate in the safeguarding of healthy and efficient markets. We provide liquidity to financial markets using our own capital, at our own risk, trading a wide range of products: listed derivatives, cash equities, ETFs, bonds and foreign currencies.
Since its establishment in 2012, our Shanghai office is a rapidly growing participant in the Chinese markets, trading exchange-listed futures, options and equities in China mainland. Our vision is to become the trusted partner in the development of Chinese financial markets. With the culture of a start-up but the backing of a 35+ year-old trading firm, the Optiver Shanghai office is truly unique. Everyone who joins us will help shape the future of our company and its global impact. Get ready: we are only just beginning.
Key Responsibilities
- Building the infrastructure and compute platform for large scale machine learning and simulation workloads
- Focus on compute platform stability and efficiency on both CPU and GPU clusters, making the platform observable and scalable
- Utilize cluster monitoring and profiling tools to identify bottlenecks and optimize both infrastructure and software
- Troubleshoot and resolve issues related to OS, storage, network, and GPUs
Requirements:
- Solid experience in running production machine learning infrastructure at a large scale
- Experience in designing, deploying, profiling and troubleshooting in Linux-based computing environments
- Proficiency in GPU, containerization, parallel computing and cluster management tools
- Experience with storage solutions for large scale, cluster-based data intensive workloads
- Experience with Infrastructure as Code tools (Ansible, Terraform, etc.)
Bonus qualifications:
- Experience of supporting machine learning engineers or data scientists for production workloads
- Familiarity with RDMA networking for storage and GPU systems
- Experience working with AWS or other cloud service providers
WHAT YOU CAN EXPECT FROM US:
In return for you joining our elite team, you will be offered a competitive salary package as well as access to a plethora of Optiver-perks. To hear more about what it is like to work here and our great culture, apply now and take the first step towards the best career move you will ever make!
As an intentionally flat organisation, we believe that great ideas and impact can come from everyone. We are passionate about empowering individuals and creating diverse teams that thrive. Every person at Optiver should feel included, valued and respected, because we believe our best work is done together.
Our commitment to diversity and inclusion is hardwired through every stage of our hiring process. We encourage applications from candidates from any and all backgrounds, and we welcome requests for reasonable adjustments during the process to ensure that you can best demonstrate your abilities.