Tushar Semwal

Tushar Semwal

Tushar Semwal

|
Lead Data Scientist @ Presight (G42), Abu Dhabi

Lead Data Scientist at Presight (G42), Abu Dhabi — an ADX-listed AI and decision intelligence company. Leading agentic systems, multi-agent orchestration, and LLM-based tool development across Legal, Energy, and enterprise domains.

My work sits at the intersection of research and production: LLM fine-tuning, RAG, knowledge graphs, hallucination control, and decision-boundary problems in low-resource agentic settings. I care about research that survives contact with real systems.

Previously at UNEY (Dubai), Microsoft Copilot (Principal Applied Scientist), Mobius Labs (Berlin), and The University of Edinburgh (Research Associate).

PhD in Distributed Machine Learning from IIT Guwahati. 20+ publications · 84+ citations on top paper · Google Scholar profile.

✦ HOVER TO SEE ME ✦
Tushar Semwal
20+
Publications
200+
Citations
8+
Years Research
3
Continents
Experience
  • Presight (G42)
    Lead Data Scientist
    Mar 2026 – Present
  • UNEY
    Senior Applied Scientist
    Jul 2025 – Mar 2026
  • Microsoft
    Microsoft
    Principal Applied Scientist
    May 2021 – Jul 2025
  • Mobius Labs GmbH
    Mobius Labs GmbH
    Machine Learning Scientist
    Jan 2021 – May 2021
  • The University of Edinburgh
    The University of Edinburgh
    Research Associate (Energy Sector)
    Jan 2019 – Jan 2021
  • OpenMined
    Research Scientist (Open Source)
    Mar 2020 – Jan 2021
  • Samsung Research Institute, Bengaluru
    Research Intern (Bixby Team)
    Jun 2017 – Sep 2017
Education
  • Indian Institute of Technology Guwahati
    Indian Institute of Technology Guwahati
    PhD — Distributed Machine Learning
    2015 – 2019
  • Indian Institute of Technology Guwahati
    Indian Institute of Technology Guwahati
    MTech — Computer Science and Engineering
    2013 – 2015
  • College of Engineering Roorkee
    BTech — Electronics and Telecommunication Engineering
    2009 – 2013
Honors & Awards
  • Microsoft Global Hackathon Winner (international & country regions)
    2022
  • Covid-19 Medal Award — University of Edinburgh
    2022
  • Coverage in a leading Scottish newspaper for applied ML work
    2022
  • Microsoft Travel Grant — SDM 2018, San Diego
    2018
  • DST SERB Travel Grant — SDM 2018
    2018
  • Travel Grant — AAMAS Summer School on Multi-Agent Systems
    2016
  • TCS Research Fellowship (4-year national industrial award)
    2015
News
2026
Paper accepted at SIGIR 2026 (Main Track, Rank A*): "Teaching Small Models When Not to Call Functions: Structured Reasoning for Tool Refusal in Low-Resource Languages." — on decision boundaries and hallucination control in agentic LLM systems.
Apr 20
Joined Presight (G42) as Lead Data Scientist — building SOTA agentic systems for Legal, Energy, and enterprise at one of the UAE's top AI companies.
Mar 01
2025
Left Microsoft and relocated to Dubai to join an early-stage AI startup — trading comfort for conviction.
Jan 01
2023
My daughter was born — the best moment of my life. ❤️
Dec 31
Invited tutorial session on Graph Learning organised by LBRCE, Vijaywada. Delivered a successfull lecture to 300+ students and faculty members.
Dec 01
I got promoted to senior position @ microsoft.
Jul 22
2022
Invited tutorial on Federated Graph Learning organised by IIIT Hyderbad.
Dec 02
Invited talk on Privacy-aware Machine Learning on Data you Cannot SEE, organised by SAB, IIT Guwahati (alma mater).
Feb 02
2021
I have joined Microsoft India as an Applied Scientist and relocated back to India. I would like to cherish my time with my family.
May 02
My book published and now available on amazon.
Mar 02
Selected Publications (view all )
Teaching Small Models When Not to Call Functions: Structured Reasoning for Tool Refusal in Low-Resource Languages
ACM SIGIR 2026 (Main Track) 2026 Rank A*
Teaching Small Models When Not to Call Functions: Structured Reasoning for Tool Refusal in Low-Resource Languages

Standard fine-tuning teaches models how to call tools — not when not to. In low-resource languages (Vietnamese, Thai) on 1B-parameter models, this causes catastrophic hallucination (−21pp on Gemma-Thai). We fix it with structured key-value reasoning forms that make the refusal decision explicit at training time, improving irrelevance detection and hallucination by +10–30pp and slashing hallucination probability 50–60× (from 67–76% down to 1.2–1.3%) — at 1.9× lower token cost than chain-of-thought. Published at SIGIR '26, Melbourne.

Dung Pham Tuan Vo*, Thai Trung Tran*, Tushar Semwal (* equal contribution)

[Paper] [ACM DL]

FedPerf: A Practitioners' Guide to Performance of Federated Learning Algorithms
Proceedings of Machine Learning Research (PMLR 148), NeurIPS 2020 Preregistration Workshop 2021 12 citations
FedPerf: A Practitioners' Guide to Performance of Federated Learning Algorithms

Which federated learning algorithm should you actually deploy? FedAvg, FedProx, FedMed, and qFedAvg all look similar on accuracy — but diverge dramatically when your data is skewed, nodes drop out, or fairness across clients matters. This large-scale empirical study benchmarks them across all three dimensions simultaneously using spider charts, giving practitioners a single diagnostic view to pick the right algorithm for their constraints. Published in PMLR 148, 2021.

Ajinkya Mulay, Baye Gaspard, Rakshit Naidu, Santiago Gonzalez-Toral, Vineeth S, Tushar Semwal, Ayush Manish Agrawal

[Paper] [Google Scholar]

Selective Federated Transfer Learning using Representation Similarity
NeurIPS 2020 Workshop on Scalable, Privacy-preserving and Federated Learning (SpicyFL) 2020
Selective Federated Transfer Learning using Representation Similarity

In federated learning you can't see client data — so how do you pick the right pretrained model to transfer? We use Centered Kernel Alignment (CKA) with sketching to compare model representations locally on-device, then run a federated voting algorithm to select the best source model without sharing any raw data. Cuts communication rounds up to 5× versus FedAvg baseline.

Tushar Semwal, Haofan Wang, Chinnakotla Krishna Teja Reddy

[Paper] [Google Scholar]

A Practitioners' Guide to Transfer Learning for Text Classification using Convolutional Neural Networks
SIAM International Conference on Data Mining (SDM) 2018 84 citations
A Practitioners' Guide to Transfer Learning for Text Classification using Convolutional Neural Networks

How much of a CNN trained on one text domain transfers to another — and which layers should you freeze? This practitioner's guide runs systematic experiments across layer depth, fine-tuning strategies, and hyperparameters, delivering concrete rules of thumb for real NLP pipelines. Originated from work at Samsung Research; 84 citations and still a go-to reference for applied transfer learning.

Tushar Semwal, Promod Yenigalla, Gaurav Mathur, Sumit Bose Nair

[Paper] [Google Scholar]

On Ordering Multi-Robot Task Executions within a Cyber Physical System
ACM Transactions on Autonomous and Adaptive Systems (TAAS) 2017 25 citations
On Ordering Multi-Robot Task Executions within a Cyber Physical System

When multiple robots share a workspace, wrong task ordering causes bottlenecks, collisions, and deadlocks. This paper formalises the scheduling problem in cyber-physical systems and proposes a distributed solution that balances workload without a central coordinator — validated on real robot hardware. Published in ACM TAAS, one of the top journals in autonomous adaptive systems.

Tushar Semwal, Shashi Shekhar Jha, Sumit Bose Nair

[ACM DL] [Google Scholar]