AI Research
Inference Optimization
Kernel Optimization
Metal Shading Language
MSL
Model Serving
Quantization
Flash Attention
Distributed Inference
Diffusion Models
Vision Transformers
Machine Learning Engineering
📍 Location: Bangalore
Industry: IT
Job Type: Full Time
🏠 Work From Home
Tether Operations Limited is looking for an expert AI Research Engineer specializing in Kernel and Inference Optimization for a full-time remote position connected to Bengaluru. The ideal candidate will spearhead innovations in model serving architectures and low-level GPU execution pipelines to enable highly efficient deployment scenarios.
Roles & Responsibilities:
Architect and deploy production-grade model serving systems achieving maximum throughput, low token latency, and optimized memory footprints.
Execute and benchmark rigorous inference experiments across simulated and production edge environments, monitoring hardware utilization and error rates.
Curate comprehensive evaluation datasets and production simulation profiles tailored specifically for resource-constrained architectures.
Profile deep learning computational pipelines to detect and eliminate bottlenecks arising from batch processing, memory allocations, or networking.
Collaborate across technical divisions to smoothly embed optimized inference execution setups directly into on-device production codebases.
Requirements:
Degree in Computer Science or related engineering discipline; a PhD focused on NLP or Machine Learning with reputable conference publications is highly desirable.
Proven capacity to develop custom GPU compute shaders from scratch using Metal Shading Language (MSL).
Demonstrated expertise in low-level kernel optimization and fine-tuning inference systems on edge or mobile hardware layers.
Solid understanding of advanced serving frameworks and optimization strategies such as Pruning, Quantization, Flash attention, KV Cache, and Speculative Decoding.
Familiarity with Distributed Inference Engineering principles involving Tensor, Pipeline, or Expert Parallelism across multi-node configurations.
Strong background in the underlying mathematics of modern Diffusion Models and Vision Transformers.
Perks & Benefits:
Combine with a globally distributed elite team leading pioneering breakthroughs in digital finance and decentralized tech.
Fully remote work setup backed by a stable, agile, and industry-leading financial technology organization.
Opportunity to influence cutting-edge AI developments and next-generation secure communication ecosystems.
Additional Information:
Candidates are urged to apply exclusively through official channels; all primary correspondence is handled via company domains ending in @tether.to or @tether.io.
Tether never requests financial deposits, processing fees, or sensitive bank details at any stage of the recruitment process.
Company Name: Tether Operations Limited
Website: tether.recruitee.com
PIN Code: 560001
Select a reason for reporting:
Posted:
Valid:
⚠️ JOB SAFETY ADVISORY
Real companies in India NEVER ask for money.
If asked for Registration or Processing fees, it is a SCAM.
Stop all contact and report them immediately.
KK Wind Solutions is seeking an experienced Lead - Business Analyst to spearhead requirement management, sprint execution, and Agile delivery for Analytics and...
Gushwork is looking for a dynamic professional for the Founder's Office to lead the end-to-end development and launch of new AI products. In this role, you will build...
Kavin Corporation is seeking a detail-oriented Technical Analyst to bridge the gap between business needs and technical execution. Join a technology provider with 17...
Inspirion Digital Solutions is seeking an execution-oriented alliances executive to lead our Global Partner Program. This is a strategic role focused on developing...
Join Inspirion Digital Solutions to design intuitive consumer-facing chatbots with unique personalities. We are looking for creative thinkers to craft seamless...
Exciting opportunity for a Senior Associate with strong SFDC and CRM functional knowledge to join a leading IT solutions provider. This role involves complex backend...
We are looking for an experienced Client Relationship Manager to oversee AI-driven projects for US clients. You will bridge the gap between business needs and technical...
Atruebrand Innovation Solutions is offering an exceptional Application Development Internship for aspiring developers in Pune. This role provides hands-on experience in...
Armino Technologies is seeking an analytical AI Product Analyst to join the LSquared product testing team. In this remote role, you will work closely with AI-powered...
Widle Studio LLP is seeking an enthusiastic Junior Python Developer to join our software engineering team in Ahmedabad. This entry-level position is ideal for fresh...
Awzpact Technologies & Services is looking for a talented Junior Java Developer to join our growing engineering team in Bhopal. This role offers an excellent opportunity...
Placement Services is searching for a motivated Junior Full Stack Developer to join our engineering team. This remote role is perfect for freshers passionate about...