Job Specifications
Job Overview
We are looking for a Staff Performance Solution Engineer to help promoting Arm’s success on data center and cloud. In this role, you will be responsible for analysing, measuring, and optimizing the performance of key data center workloads running on ARM64 platforms, you will also be responsible to optimize the associated system software. The ideal candidate will have a good background in low-level performance profiling, operating systems internals, compiler interactions, CPU core micro-architecture and SoC architecture.
Team Overview
You will join an engineering team sits inside Arm’s infrastructure line of business which focuses on promoting Arm’s business on data center and cloud industry. You will be able to engage with Arm’s partners in this industry directly, the team aims to understand partner’s technology stack deeply and accurately and help making it runs better on Arm’s technology. This team is especially focused on partner which has proprietary technology stack that can’t be open sourced.
Job Responsibilities
Analyze and optimize the performance of workloads running on ARM64 platforms, spanning from C/C++ applications to dynamic language runtimes.
Leverage operating system traces and application-level instrumentation to identify performance bottlenecks in user space, collaborating closely with partners to implement effective optimizations.
Utilize hardware performance counters (including both core and UnCore PMUs) and hardware trace features to root-cause low-level performance issues across CPU pipelines, memory subsystems, I/O, and system interconnects. Present detailed analysis to inform software and hardware optimization strategies.
Design and conduct performance benchmarks, profiling experiments, and diagnostic evaluations to assess and improve system behavior.
Tune system configurations—including compiler flags, kernel parameters, scheduling policies, and runtime environments—to achieve optimal throughput, latency, or power efficiency.
Optimize system software components, including compiler back ends and C libraries.
Required Skills And Experience
Solid understanding of workload performance analysis using profiling tools such as Linux perf, or equivalent tools on other operating systems.
Familiarity with operating system internals, including context switching, interrupt handling, task scheduling, virtual memory, and NUMA architectures.
Strong foundational knowledge of SoC architectures, particularly CPU clusters, interconnects, and memory subsystems.
Experience with top-down performance analysis methodology, with the ability to drill down from application-level behavior to microarchitectural bottlenecks.
Proficient in C/C++, with the ability to navigate and understand complex codebases and interpret compiler-generated assembly.
“Nice To Have” Skills And Experience
Prior experience in performance analysis and optimization on ARM64 platforms
Hands-on experience optimizing data center workloads, with a strong understanding of data center–specific performance challenges such as multi-core scalability, NUMA effects, and resource contention.
Track record of contributions to open-source system software, including the Linux kernel, LLVM, and the GNU toolchain.
At Arm, we’re in a pivotal moment. As AI reshapes every major industry, from the largest data centers to the smallest personal devices, our mission is clear: to help solve some of the world’s greatest challenges through technology that reaches everyone.
The future of AI is being built on Arm. And our 10x Mindset is helping shape that future. The 10x Mindset is how we frame our work for greater impact. It’s not about doing more or working harder. It’s about thinking differently, acting decisively and building in a way that scales; for our customers, our teams and our global ecosystem.
It’s inspired by a founder’s mentality. We ask bold questions, spot opportunities others might miss and stay focused on delivering outcomes that matter. Whether we’re reimagining architectures or rethinking collaboration across time zones, this mindset helps us move forward with clarity and purpose.
Accommodations at Arm
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email accommodations@arm.com . To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Hybrid Working at Arm
Arm’s approach to hybrid working is designed to create a working environment that supports both
About the Company
Arm’s foundational technology is defining the future of computing. A future built by the greatest technology ecosystem in the world. A future built on Arm.
Arm is everywhere technology matters. Technology matters everywhere.
Together, we’ll power every technology revolution moving forward, including cloud computing, automotive and autonomous systems, IoT, the metaverse, and beyond.
Changing the world. Again. On Arm.
Know more