Home
World Journal of Advanced Research and Reviews
International Journal with High Impact Factor for fast publication of Research and Review articles

Main navigation

  • Home
    • Journal Information
    • Editorial Board Members
    • Reviewer Panel
    • Abstracting and Indexing
    • Journal Policies
    • Our CrossMark Policy
    • Publication Ethics
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Join Editorial Board
    • Join Reviewer Panel
  • Contact us
  • Downloads

eISSN: 2582-8185 || CODEN: WJARAI || Impact Factor 8.2 ||  CrossRef DOI

Research and review articles are invited for publication in March 2026 (Volume 29, Issue 3) Submit manuscript

Vision-guided automation: A generic approach to web form filling using GPT and computer vision

Breadcrumb

  • Home
  • Vision-guided automation: A generic approach to web form filling using GPT and computer vision

Leela Gowtham Yanamaddi 1, * and Balaji Kummari 2

1 CEO and VP of Engineering, scale.jobs 537 Payne Rd, Woodstock, GA, USA 30188.
2 CTO, scale.jobs 1-84, Beside Venugopala Swamy Temple, Rayanapadu, Vijayawada, AP 521241, India.
 
Research Article
World Journal of Advanced Research and Reviews, 2023, 20(03), 2096-2107
Article DOI: 10.30574/wjarr.2023.20.3.2524
DOI url: https://doi.org/10.30574/wjarr.2023.20.3.2524
 
Received on 03 November 2023; revised on 16 December 2023; accepted on 18 December 2023
 
Combining computer vision approaches with GPT (Generative Pre-trained Transformer) models, this research presents a novel approach to automating web-based form filling tasks. A general approach that can adapt to different forms without knowing their structure is made possible by the suggested system, which detects and labels interactive elements on web pages visually. This allows it to transcend the restrictions of hardcoded DOM element interactions. Notable advancements include utilising computer vision to recognise and label form elements and integrating GPT models to read form fields semantically and produce context-appropriate responses (for instance, using resume data). Plus, AI-guided judgements are made using a versatile action system that mimics human-like interactions like typing, clicking, and scrolling. An automated job application form filling case study demonstrates the system's efficacy and highlights its potential for wide-ranging online automation activities.
 
Vision-Guided Automation; Web Form Filling; GPT; Computer Vision
 
https://wjarr.com/sites/default/files/fulltext_pdf/WJARR-2023-2524.pdf

Preview Article PDF

Leela Gowtham Yanamaddi and Balaji Kummari. Vision-guided automation: A generic approach to web form filling using GPT and computer vision. World Journal of Advanced Research and Reviews, 2023, 20(3), 2096-2107. Article DOI: https://doi.org/10.30574/wjarr.2023.20.3.2524

Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


All statements, opinions, and data contained in this publication are solely those of the individual author(s) and contributor(s). The journal, editors, reviewers, and publisher disclaim any responsibility or liability for the content, including accuracy, completeness, or any consequences arising from its use.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content

Copyright © 2026 International Journal of Science and Research Archive - All rights reserved

Developed & Designed by VS Infosolution