☑ Bangalore ☑ Full Time ☑ WFH

🚀Post a Job - Free

AI Research Engineer Inference Optimization

💰 Salary: Competitive Salary

Key Skills:
AI Research Inference Optimization Kernel Optimization Metal Shading Language MSL Model Serving Quantization Flash Attention Distributed Inference Diffusion Models Vision Transformers Machine Learning Engineering

📍 Location: Bangalore

Industry: IT

Job Type: Full Time

🏠 Work From Home

Tether Operations Limited is looking for an expert AI Research Engineer specializing in Kernel and Inference Optimization for a full-time remote position connected to Bengaluru. The ideal candidate will spearhead innovations in model serving architectures and low-level GPU execution pipelines to enable highly efficient deployment scenarios.

Roles & Responsibilities:
  • Architect and deploy production-grade model serving systems achieving maximum throughput, low token latency, and optimized memory footprints.
  • Execute and benchmark rigorous inference experiments across simulated and production edge environments, monitoring hardware utilization and error rates.
  • Curate comprehensive evaluation datasets and production simulation profiles tailored specifically for resource-constrained architectures.
  • Profile deep learning computational pipelines to detect and eliminate bottlenecks arising from batch processing, memory allocations, or networking.
  • Collaborate across technical divisions to smoothly embed optimized inference execution setups directly into on-device production codebases.
Requirements:
  • Degree in Computer Science or related engineering discipline; a PhD focused on NLP or Machine Learning with reputable conference publications is highly desirable.
  • Proven capacity to develop custom GPU compute shaders from scratch using Metal Shading Language (MSL).
  • Demonstrated expertise in low-level kernel optimization and fine-tuning inference systems on edge or mobile hardware layers.
  • Solid understanding of advanced serving frameworks and optimization strategies such as Pruning, Quantization, Flash attention, KV Cache, and Speculative Decoding.
  • Familiarity with Distributed Inference Engineering principles involving Tensor, Pipeline, or Expert Parallelism across multi-node configurations.
  • Strong background in the underlying mathematics of modern Diffusion Models and Vision Transformers.
Perks & Benefits:
  • Combine with a globally distributed elite team leading pioneering breakthroughs in digital finance and decentralized tech.
  • Fully remote work setup backed by a stable, agile, and industry-leading financial technology organization.
  • Opportunity to influence cutting-edge AI developments and next-generation secure communication ecosystems.
Additional Information:
  • Candidates are urged to apply exclusively through official channels; all primary correspondence is handled via company domains ending in @tether.to or @tether.io.
  • Tether never requests financial deposits, processing fees, or sensitive bank details at any stage of the recruitment process.

Company Name: Tether Operations Limited

Website: tether.recruitee.com

PIN Code: 560001

Posted:

Valid:

⚠️ JOB SAFETY ADVISORY
Real companies in India NEVER ask for money. If asked for Registration or Processing fees, it is a SCAM. Stop all contact and report them immediately.

Bangalore IT

Lead - Business Analyst (Agile & Analytics)

12.0 - 22.5 LPA

KK Wind Solutions is seeking an experienced Lead - Business Analyst to spearhead requirement management, sprint execution, and Agile delivery for Analytics and...

📍 Bangalore (KA) 🏠 WFH

Founder's Office Associate (New AI Products)

12.0 - 20.0 LPA

Gushwork is looking for a dynamic professional for the Founder's Office to lead the end-to-end development and launch of new AI products. In this role, you will build...

📍 Bangalore (KA)

Technical Analyst for Software Solutions

₹30,000 - ₹40,000 per month

Kavin Corporation is seeking a detail-oriented Technical Analyst to bridge the gap between business needs and technical execution. Join a technology provider with 17...

📍 Bangalore (KA)

Head - Global Partner Program (IT Alliances)

25 - 45 LPA

Inspirion Digital Solutions is seeking an execution-oriented alliances executive to lead our Global Partner Program. This is a strategic role focused on developing...

📍 Bangalore (KA)

Conversational Analyst Chatbot Designer

8.0 - 15.0 LPA

Join Inspirion Digital Solutions to design intuitive consumer-facing chatbots with unique personalities. We are looking for creative thinkers to craft seamless...

📍 Bangalore (KA)

Senior Associate SFDC CRM Backend Testing

12.0 - 22.0 LPA

Exciting opportunity for a Senior Associate with strong SFDC and CRM functional knowledge to join a leading IT solutions provider. This role involves complex backend...

📍 Bangalore (KA)

Other cities IT

Key Account Manager - AI Projects

₹50,000 - ₹80,000 per month

We are looking for an experienced Client Relationship Manager to oversee AI-driven projects for US clients. You will bridge the gap between business needs and technical...

📍 Noida (UP) 🏠 WFH

Software Intern in Dart and Flutter

₹5,000 - ₹20,000 per month

Atruebrand Innovation Solutions is offering an exceptional Application Development Internship for aspiring developers in Pune. This role provides hands-on experience in...

📍 Pune (MH) 🌱 Fresher 🏠 WFH

AI Product Analyst - Remote Position

3.0 - 5.5 LPA

Armino Technologies is seeking an analytical AI Product Analyst to join the LSquared product testing team. In this remote role, you will work closely with AI-powered...

🌱 Fresher 🏠 WFH

Junior Python Developer - Fresher

2.4 - 4.2 LPA

Widle Studio LLP is seeking an enthusiastic Junior Python Developer to join our software engineering team in Ahmedabad. This entry-level position is ideal for fresh...

📍 Ahmedabad (GJ) 🌱 Fresher

Junior Java Developer - Web Applications

2.3 - 8.4 LPA

Awzpact Technologies & Services is looking for a talented Junior Java Developer to join our growing engineering team in Bhopal. This role offers an excellent opportunity...

📍 Bhopal (MP)

AI Full Stack Developer - Fresher Level

₹20,000 - ₹30,000 per month

Placement Services is searching for a motivated Junior Full Stack Developer to join our engineering team. This remote role is perfect for freshers passionate about...

🌱 Fresher 🏠 WFH

ID: 1501777
👁 9