HPC Solutions Architect and System Engineer

Company:  Advanced Micro Devices, Inc
Location: Austin
Closing Date: 18/10/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description
Overview:
WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_

Responsibilities:
HPC Solutions Architect and System Engineer

                                                                                                                

THE TEAM:

AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people.

 

THE ROLE:? 

We are looking for a dynamic, energetic Lead / Principal Systems Design Engineer to join our growing team. As a key contributor to the success of AMD’s product, you will be part of a leading team to drive and improve AMD’s abilities to deliver the highest quality, industry leading technologies to market. The Systems Design Engineering team fosters and encourages continuous technical innovation to showcase successes as well as facilitate continuous career development.? 



THE PERSON:? 

As a Leader in Systems Design Engineering, you will drive balanced, scalable, and automated solutions. In this high visibility position, your software systems engineering expertise will be necessary towards product development, definition, and root cause resolution.? 



KEY RESPONSIBILITIES:? 

Driving technical innovation to improve AMD’s capabilities across validation, including tool and script development, technical and procedural methodology enhancement, and various internal and cross-functional technical initiatives 

Debugging issues found during the process, bring-up, validation, and production phases of SOC programs 

Working with multiple teams, and tracking test execution to make sure all features are validated and optimized on time 

Working closely with supporting technical teams 

Engaging in other software/hardware modeling frameworks 

Leading collaborative approach with multiple teams 

Work with multiple teams within AMD to gather and document requirements, create and derive design details, and create architecture and systems engineering artifacts for new AI- and HPC-focused clustered systems

Work with project management and internal procurement and IT teams to create actionable Bills of Materials

Engage AMD’s partner and OEM ecosystem to have detailed knowledge of current and future offerings in the clustered systems space

Work with internal platform engineering team and other stakeholders (internal and external) to capture cluster software requirements, including tenancy and consumption modalities (e.g., baremetal, virtualization, K8s/container-native, etc.)



PREFERRED EXPERIENCE:? 

Programming/scripting skills (e.g. C/C++, Perl, Ruby, Python). 

Debug techniques and methodologies?? 

Extensive experience with common lab equipment, including protocol/logic analyzers, oscilloscopes, etc.? 

Extensive experience with board/platform-level debug, including delivery, sequencing, analysis, and optimization 

Extensive knowledge of system architecture, technical debug, and validation strategy 

Strong analytical/problem-solving skills and pronounced attention to details 

Must be a self-starter, and able to independently drive tasks to completion 

Extensive knowledge in HPC systems design, to include storage, compute, networking, and software

Expertise in heterogenous (CPU/GPU) and GPU-focused systems for HPC and AI/ML workloads

Experience in HPC facility planning

Understanding and familiarity with the current server, networking, and storage OEMs and their offerings pertinent to HPC and AI/ML workloads. Roadmap and ongoing relationships with OEMs and networking

Experience in creating and maintaining written systems engineering artifacts (security plans, requirements specifications, CONOPs) and drawings (architecture diagrams, logical systems diagrams, cabling diagrams)

Ability to derive strong technical requirements from diverse stakeholders

Ability to support occasional travel for team and design meetings, normally within CONUS, is preferred (anticipate <20%)

Detail-oriented and strong communication skills required.



ACADEMIC CREDENTIALS:? 

Bachelors or Masters degree in electrical or computer engineering 

 

LOCATION:

Austin, Texas, US

Markham, Canada

 

#LI-RW1

Qualifications:
At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail .

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Apply Now
Share this job
  • Similar Jobs

  • IT Solutions Architect - eCommerce

    Austin
    View Job
  • IT Solutions Architect - Interfaces

    Austin
    View Job
  • Cloud Solutions Architect 3

    Austin
    View Job
  • System Solution Architect

    Austin
    View Job
  • Sales Solutions User Experience Architect

    Austin
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙