Home
World Journal of Advanced Research and Reviews
International Journal with High Impact Factor for fast publication of Research and Review articles

Main navigation

  • Home
    • Journal Information
    • Editorial Board Members
    • Reviewer Panel
    • Abstracting and Indexing
    • Journal Policies
    • Our CrossMark Policy
    • Publication Ethics
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Join Editorial Board
    • Join Reviewer Panel
  • Contact us
  • Downloads

eISSN: 2581-9615 || CODEN: WJARAI || Impact Factor 8.2 ||  CrossRef DOI

Research and review articles are invited for publication in March 2026 (Volume 29, Issue 3) Submit manuscript

Benchmarking cross‑platform AI: Web Assembly, ONNX Runtime and TVM for Real‑Time Web, Mobile, and IoT Deployment

Breadcrumb

  • Home
  • Benchmarking cross‑platform AI: Web Assembly, ONNX Runtime and TVM for Real‑Time Web, Mobile, and IoT Deployment

Aravind Chinnaraju *

Seattle, WA.

Review Article

World Journal of Advanced Research and Reviews, 2025, 26(02), 1937-1963

Article DOI: 10.30574/wjarr.2025.26.2.1832

DOI url: https://doi.org/10.30574/wjarr.2025.26.2.1832

Received on 02 April 2025; revised on 09 May 2025; accepted on 11 May 2025

Cross‑platform deployment of machine‑learning inference now spans browser tabs, handheld applications, and resource‑constrained sensors, yet the performance landscape remains fragmented by heterogeneous runtimes. This study conducts the first holistic benchmark that positions WebAssembly, ONNX Runtime, and Apache TVM side‑by‑side under a unified test harness across Web, mobile, and IoT devices. A theoretical foundation distinguishes compilation from interpretation, ahead‑of‑time from just‑in‑time pipelines, and outlines how hardware‑abstraction layers mediate latency, throughput, memory, and energy trade‑offs. Empirical evaluations draw on a curated model zoo and cold‑start vs. steady‑state runs to expose four‑dimensional performance frontiers. Results show that TVM’s auto‑tuned kernels deliver up to a 42 % latency reduction on ARM microcontrollers, whereas WebAssembly narrows browser‑native overheads to within 1.4× of device‑bound baselines when SIMD extensions are available. ONNX Runtime provides the broadest portability, though execution‑provider selection must be coupled with quantization to remain within sub‑100 ms response budgets on mid‑tier smartphones. Integrating telemetry pipelines through OpenTelemetry and Delta Lake permits real‑time drift detection, AIOps‑driven auto‑rollback, and carbon‑aware scheduling that lowers energy use by 18 % without SLA violations. Security analysis contrasts browser sandboxes with enclave‑based protection for mobile and IoT, while risk‑management blueprints extend chaos‑engineering to runtime drift and compatibility faults. Case studies spanning a browser‑side image classifier, a mobile augmented‑reality pose estimator, and an IoT anomaly detector validate the decision matrix that maps workload characteristics to optimal runtime choice. The findings synthesise technical insights into actionable deployment playbooks, offering researchers and practitioners a reproducible framework for balancing performance, sustainability, and resilience in real‑time edge AI. 

Cross‑Platform Inference; Real‑Time Edge AI; Latency Optimization; Energy‑Efficient Deployment; Telemetry‑Driven Observability.

https://wjarr.com/sites/default/files/fulltext_pdf/WJARR-2025-1832.pdf

Preview Article PDF

Aravind Chinnaraju. Benchmarking cross‑platform AI: Web Assembly, ONNX Runtime and TVM for Real‑Time Web, Mobile, and IoT Deployment. World Journal of Advanced Research and Reviews, 2025, 26(2), 1937-1963. Article DOI: https://doi.org/10.30574/wjarr.2025.26.2.1832

Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


All statements, opinions, and data contained in this publication are solely those of the individual author(s) and contributor(s). The journal, editors, reviewers, and publisher disclaim any responsibility or liability for the content, including accuracy, completeness, or any consequences arising from its use.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content

Copyright © 2026 World Journal of Advanced Research and Reviews - All rights reserved

Developed & Designed by VS Infosolution