<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">67538</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2025.067538</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Towards Efficient Vehicle Recognition: A Unified System for VMMR, ANPR, and Color Classification</article-title>
<alt-title alt-title-type="left-running-head">Towards Efficient Vehicle Recognition: A Unified System for VMMR, ANPR, and Color Classification</alt-title>
<alt-title alt-title-type="right-running-head">Towards Efficient Vehicle Recognition: A Unified System for VMMR, ANPR, and Color Classification</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Sadiq</surname><given-names>Saad</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western"><surname>Sultan</surname><given-names>Kashif</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Sheraz</surname><given-names>Muhammad</given-names></name><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-4" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Chee Chuah</surname><given-names>Teong</given-names></name><xref ref-type="aff" rid="aff-2">2</xref><email>tcchuah@mmu.edu.my</email></contrib>
<contrib id="author-5" contrib-type="author">
<name name-style="western"><surname>Hashmi</surname><given-names>Muhammad Usman</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<aff id="aff-1"><label>1</label><institution>Department of Software Engineering, Bahria University H-11 Campus</institution>, <addr-line>Islamabad, 44000</addr-line>, <country>Pakistan</country></aff>
<aff id="aff-2"><label>2</label><institution>Centre for Smart Systems and Automation, CoE for Robotics and Sensing Technologies, 
Faculty of Artificial Intelligence and Engineering, Multimedia University, Persiaran Multimedia</institution>, <addr-line>Cyberjaya, 63100, Selangor</addr-line>, <country>Malaysia</country></aff>
<aff id="aff-3"><label>3</label><institution>Department of Computer Science, Bahria University E-8 Campus</institution>, <addr-line>Islamabad, 44220</addr-line>, <country>Pakistan</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Teong Chee Chuah. Email: <email>tcchuah@mmu.edu.my</email></corresp>
</author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2025</year></pub-date>
<pub-date date-type="pub" publication-format="electronic">
<day>23</day><month>09</month><year>2025</year></pub-date>
<volume>85</volume>
<issue>2</issue>
<fpage>3945</fpage>
<lpage>3963</lpage>
<history>
<date date-type="received">
<day>06</day>
<month>5</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>03</day>
<month>7</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2025 The Authors.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Published by Tech Science Press.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_67538.pdf"></self-uri>
<abstract>
<p>Vehicle recognition plays a vital role in intelligent transportation systems, law enforcement, access control, and security operations&#x2014;domains that are becoming increasingly dynamic and complex. Despite advancements, most existing solutions remain siloed, addressing individual tasks such as vehicle make and model recognition (VMMR), automatic number plate recognition (ANPR), and color classification separately. This fragmented approach limits real-world efficiency, leading to slower processing, reduced accuracy, and increased operational costs, particularly in traffic monitoring and surveillance scenarios. To address these limitations, we present a unified framework that consolidates all three recognition tasks into a single, lightweight system. The framework utilizes MobileNetV2 for efficient VMMR, YOLO (You Only Look Once) for accurate license plate detection, and histogram-based clustering in the HSV color space for precise color identification. Rather than optimizing each module in isolation, our approach emphasizes tight integration, enabling improved performance and reliability. The system also features adaptive image calibration and robust algorithmic enhancements to ensure consistent results under varying environmental conditions. Experimental evaluations demonstrate that the proposed model achieves a combined accuracy of 93.3%, outperforming traditional methods and offering practical scalability for deployment in real-world transportation infrastructures.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>VMMR</kwd>
<kwd>ANPR</kwd>
<kwd>unified framework</kwd>
<kwd>deep learning</kwd>
</kwd-group>
<funding-group>
<award-group id="awg1">
<funding-source>Multimedia University</funding-source>
<award-id>MMUI/250008</award-id>
</award-group>
<award-group id="awg2">
<funding-source>Telekom Research and Development Sdn Bhd</funding-source>
<award-id>RDTC/241149</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Vehicle recognition technologies are essential for effective traffic management and law enforcement, enabling regulated vehicle flow and controlled access to restricted zones. Modern systems typically rely on three core components: Vehicle Make and Model Recognition (VMMR), Automatic Number Plate Recognition (ANPR), and vehicle color detection. VMMR enhances security by assisting in vehicle identification, supporting enforcement and surveillance tasks [<xref ref-type="bibr" rid="ref-1">1</xref>]. ANPR enables real-time tracking and precise license plate recognition, while color detection systems improve traffic monitoring and public safety through automated color extraction.</p>
<p>Recent advances in deep learning have demonstrated promising results by integrating VMMR and ANPR [<xref ref-type="bibr" rid="ref-1">1</xref>]. Building on these developments, our framework employs MobileNetV2 and YOLO for initial vehicle detection, while YOLOv4-tiny, PaddleOCR, and SVTR-tiny enhance ANPR accuracy under varying environmental conditions. Our approach refines existing methods, prioritizing lightweight architecture and operational efficiency for real-world deployment.</p>
<p>Despite progress in each domain, standalone systems face significant limitations in practical scenarios. VMMR systems often struggle with environmental noise, such as poor lighting, adverse weather, and congested traffic [<xref ref-type="bibr" rid="ref-2">2</xref>]. Similarly, ANPR performance degrades under low-light conditions, non-standard camera angles, and regional regulatory variations. Vehicle color detection is also susceptible to lighting fluctuations and weather-related distortions, leading to misclassifications. Without addressing these challenges, systems fail to provide comprehensive, actionable vehicle data.</p>
<p>Advanced deep learning models like EfficientNet [<xref ref-type="bibr" rid="ref-3">3</xref>] and Vision Transformers [<xref ref-type="bibr" rid="ref-4">4</xref>] have improved feature extraction and pattern recognition. EfficientNet achieves high accuracy through compound scaling, while Vision Transformers use attention mechanisms to capture fine-grained details. However, both demand substantial computational resources, limiting their use in resource-constrained environments.</p>
<p>While individual subsystems have matured, a critical gap remains in developing unified, lightweight frameworks that integrate VMMR, ANPR, and color detection. Our work bridges this gap by combining the strengths of each subsystem to compensate for their weaknesses. We anchor our framework in MobileNetV2, chosen for its efficiency and low computational overhead, making it ideal for real-time, edge-level intelligent transportation applications.</p>
<p>Despite extensive research, few holistic systems reliably integrate these capabilities under diverse conditions. This paper contributes the following:
<list list-type="order">
<list-item>
<p>We propose an integrated system combining VMMR, ANPR, and color detection in a modular, extensible architecture. By merging adaptive image processing with deep learning, our solution delivers efficient, multifunctional recognition.</p></list-item>
<list-item>
<p>Our framework incorporates an augmentation strategy that enhances performance across varying conditions (e.g., illumination changes, weather effects). Combined with optimized inference, this ensures accurate real-time recognition in complex environments.</p></list-item>
<list-item>
<p>To maximize applicability, we combine MobileNetV2, YOLO-based detection, and HSV color clustering. This hybrid approach improves modularity, scalability, and robustness, making the system viable for traffic monitoring, law enforcement, and security applications.</p></list-item>
</list></p>
<p>By consolidating these functionalities into a single framework, our solution eliminates the need for separate systems, offering a more efficient and reliable approach to real-world vehicle recognition.</p>
<sec id="s1_1">
<label>1.1</label>
<title>Motivation</title>
<p>The innovative aspect of our work is that it combines Vehicle Make and Model Recognition (VMMR), Automatic Number Plate Recognition (ANPR) and color detection into a single, lightweight model, instead of suggesting new solutions to each of those tasks. Although tremendous progress has been achieved in VMMR (e.g., CNN-based methodologies [<xref ref-type="bibr" rid="ref-5">5</xref>]), ANPR (e.g., YOLO-based detection [<xref ref-type="bibr" rid="ref-6">6</xref>]) and color identification (e.g., HSV clustering [<xref ref-type="bibr" rid="ref-7">7</xref>]), the techniques are often used alone, resulting in inefficient multi-task vehicle recognition systems. The framework is the integration of these known methods, where MobileNetV2 is used in VMMR, YOLO and EasyOCR in ANPR, and histogram-based HSV clustering in color detection, which results in the concurrent processing of the methods and therefore, leads to a better performance in applications such as traffic surveillance, law enforcement, and security management in Pakistan. The reason behind this combined effort is that a practical and context related vehicle recognition system is needed that best fits the Pakistani traffic conditions, which require more robust and efficient solutions to environmental issues (e.g., poor lighting, occlusions) and specific license plate format in the region (e.g., Pakistani plates). Current approaches, including conventional ANPR schemes [<xref ref-type="bibr" rid="ref-8">8</xref>], or deep learning systems pretrained over cross-geographical datasets, are either prone to differences between local plates or have significant computational costs that are not acceptable in edge devices. We build our framework on a dataset of 4000 Pakistani license plates to train ANPR, and a dataset of approximately 364 additional Pakistani images (Stanford Cars) to train VMMR to maximize performance in the target scenario. Our system is able to achieve a 93.3% overall accuracy (<xref ref-type="sec" rid="s4">Section 4</xref>) and computational efficiency, which provides a scalable solution to intelligent transportation systems, by focusing on integration and not reinvention. The cohesive nature of such a system not only eliminates the redundancy of sequential or separate processing, but also improves reliability because of cross-verification of VMMR, ANPR and color data. Again, as an example, the text of license plates and vehicle make and color can be combined to enhance identification of vehicles in security systems.</p>
<p>This integrated solution does not only solve the inefficient nature of sequential or divisive processing, but also improves reliability due to cross-checking of VMMR, ANPR and color data. To illustrate, integration of license plate text and vehicle make and color enhances the accuracy of identification in the security fields. Our work therefore introduces a novel method of vehicle recognition by coordinating the current approaches into one practical framework that can be applied in real world.</p>
</sec>
<sec id="s1_2">
<label>1.2</label>
<title>Problem Formulation</title>
<p>The vehicle recognition systems are important in areas that include traffic monitoring, police work, and security. Nevertheless, it can be said that their efficiency is frequently hindered by the disjointed nature of the most important elements, i.e., Vehicle Make and Model Recognition (VMMR), Automatic Number Plate Recognition (ANPR), and color detection. Such components are usually established and function as independent or sequential processes, resulting in inefficiencies that impedes real time performance. It is especially challenging in region-specific cases, such as Pakistan, where environmental conditions, such as bad lighting, occlusions, or bad weather, and license plate formats that are unique to the country are an added problem. In addition, most of the available recognition algorithms are computationally intensive, and thus they cannot be implemented on resource-limited devices that are often used in practice. The research question that is answered in this work is to come up with a unified theoretical framework to combine the existing methods of VMMR, ANPR and color detection to one integrated system. The idea is to allow parallel computation of these parts to make vehicle detection quick, stable, and resilient. In particular, the framework is meant to integrate the inference of the make and model of a vehicle as well as the reading of its license plate text along with the most dominant color of the vehicle into a simplified procedure. Integrated system aims at avoiding the inefficiencies of isolated systems, providing that all elements operate collaboratively to improve the general performance in real life scenarios.</p>
<p>The goal is to develop a system that would achieve single result from integrated recognition tasks but at the same time be computationally efficient and robust to large variations, especially those that may occur in Pakistani traffic conditions. Instead of devoting attention to the creation of new approaches to each particular task, the given framework is aimed at the combination of already existing methods and techniques in order to develop a new way of vehicle recognition. The proposed system would address the issues of fragmentation and context-specific needs, and thus it would offer a feasible solution towards real-time vehicle identification within intelligent transportation systems.</p>
</sec>
</sec>
<sec id="s2">
<label>2</label>
<title>Literature Review</title>
<p>Vehicle Make and Model Recognition (VMMR), Automatic Number Plate Recognition (ANPR), and car color detection have been the focus of extensive research aimed at enhancing recognition performance across diverse real-world scenarios. While notable advancements have been made in each of these areas, most existing systems have been developed independently, resulting in fragmented solutions that often fail to deliver comprehensive performance in dynamic and varied environments.</p>
<p>VMMR, in particular, poses significant challenges due to high intra-class variance arising from vehicle modifications, inconsistent lighting conditions, and varying viewing angles. For instance, the study in [<xref ref-type="bibr" rid="ref-5">5</xref>] presents a deep learning-based VMMR model that demonstrates high accuracy under controlled settings. By leveraging convolutional neural networks (CNNs) for feature extraction, the system achieved impressive recognition rates. However, its performance declined significantly in real-world scenarios involving inconsistent backgrounds and fluctuating lighting conditions. Another approach in [<xref ref-type="bibr" rid="ref-5">5</xref>] employed a hybrid method combining CNNs and classical image processing techniques. Although effective in structured environments, the model&#x2019;s reliance on high computational power and large volumes of labeled data limits its scalability and real-time application.</p>
<p>Recent developments in ANPR systems have also garnered attention. Research cited in [<xref ref-type="bibr" rid="ref-8">8</xref>] highlighted the success of traditional image processing techniques, particularly during daytime conditions. However, our analysis reveals a sharp performance drop during nighttime or under adverse weather conditions. Enhanced systems, as described in [<xref ref-type="bibr" rid="ref-8">8</xref>], integrated image preprocessing with deep learning to improve visibility robustness. Despite these improvements, they struggled with unusual license plate fonts and partially obscured characters, indicating room for further refinement.</p>
<p>Color recognition, too, remains sensitive to environmental conditions. The work in [<xref ref-type="bibr" rid="ref-9">9</xref>] introduced a technique that combined histogram equalization with image analysis to compensate for lighting inconsistencies. While this method achieved satisfactory results in stable lighting, it was less effective during rapid illumination changes, such as those caused by natural weather transitions. Further developments, including the machine learning algorithm proposed in [<xref ref-type="bibr" rid="ref-10">10</xref>], improved performance under such dynamic conditions. However, the model faced limitations when differentiating between similar dark shades like navy blue and black, often confusing them.</p>
<p>Integrated vehicle recognition systems combining VMMR, ANPR, and color detection are still relatively rare. &#x00C1;lvarez-Bazo et al. [<xref ref-type="bibr" rid="ref-11">11</xref>] proposed a joint VMMR and ANPR system optimized for well-lit conditions, but it lacked support for color detection. Another notable work by Guerrero-Ibez et al. [<xref ref-type="bibr" rid="ref-12">12</xref>] combined VMMR, ANPR, and facial recognition using SIFT and OCR techniques, achieving 75% accuracy on a toy car dataset. However, the system&#x2019;s limited scalability and modest accuracy prevent its practical deployment. While several integrated models have demonstrated improved accuracy by using CNNs for VMMR and ANPR, their computational demands render them unsuitable for edge devices or real-time use. Moreover, most existing solutions overlook challenges related to environmental robustness, computational efficiency, and region-specific needs, such as accommodating the variations in Pakistani license plates.</p>
<p>Addressing these gaps, our framework integrates proven methods into a unified architecture that is both efficient and robust. It achieves a significantly improved overall accuracy of 93.3% (as discussed in <xref ref-type="sec" rid="s4">Section 4</xref>), making it a strong candidate for real-world deployment.</p>
<p>Additionally, scalability to large datasets posed a challenge for the system. The findings of existing work are summarized in <xref ref-type="table" rid="table-1">Table 1</xref>.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Summary of existing work</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th>Reference</th>
<th>Year of publication</th>
<th align="center">Limitations and gaps</th>
</tr>
</thead>
<tbody>
<tr>
<td>[<xref ref-type="bibr" rid="ref-1">1</xref>]</td>
<td>2024</td>
<td>Limited discussion on real-time performance under extreme weather conditions; lacks scalability analysis for diverse regions.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-2">2</xref>]</td>
<td>2019</td>
<td>Unsupervised feature learning may struggle with diverse vehicle makes and models; lacks robustness in low-light conditions.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-4">4</xref>]</td>
<td>2020</td>
<td>Transformers for image recognition are computationally intensive, limiting real-time deployment on resource-constrained devices.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-5">5</xref>]</td>
<td>2021</td>
<td>Pipeline tailored for specific access control scenarios; may not generalize well across varying plate formats globally.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-6">6</xref>]</td>
<td>2022</td>
<td>Focuses on computing technology but lacks detailed integration with real-world autonomous vehicle challenges like occlusion.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-7">7</xref>]</td>
<td>2019</td>
<td>Impact on law enforcement is theoretical; lacks empirical data on autonomous police vehicle deployment challenges.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-10">10</xref>]</td>
<td>2020</td>
<td>Low-cost sensor design may compromise accuracy in high-traffic or adverse weather conditions.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-12">12</xref>]</td>
<td>2018</td>
<td>Sensor technologies overview lacks specific performance metrics for vehicle recognition in dynamic environments.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-13">13</xref>]</td>
<td>2020</td>
<td>Fusion system depends on controlled gate scenarios; real-world variability in lighting and angles is a gap.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-14">14</xref>]</td>
<td>2020</td>
<td>Robustness for Iranian plates is strong but may not extend effectively to multilingual or non-standard formats.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-15">15</xref>]</td>
<td>2014</td>
<td>Vehicle color classification relies on controlled urban video data; performance in varied lighting is untested.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>2022</td>
<td>Fine-grained classification in urban scenes lacks robustness against occlusion and diverse weather conditions.</td>
</tr>
<tr>
<td>[<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td>2020</td>
<td>CNN-based system tested on limited datasets; scalability and adaptability to diverse plate styles are unaddressed.</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3">
<label>3</label>
<title>Proposed Framework</title>
<p>The proposed vehicle recognition and color detection framework employs advanced deep-learning models and image processing techniques to identify cars, license plates, manufacturers, and colors, as shown in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>. The process begins with image import from CCTV, dashcams, or other sources, followed by checks to ensure images meet acceptable formats, such as .jpg, .jpeg, and .png. Resizing, normalization, noise reduction, and contrast enhancement improve image quality and uniformity.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>System architecture modular diagram showing ANPR, color, and VMMR detection modules</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-1.tif"/>
</fig>
<p>Using grid-scanning, YOLO (You Only Look Once) predicts bounding boxes and class probabilities to identify vehicles and output their coordinates [<xref ref-type="bibr" rid="ref-18">18</xref>]. Vehicle parts are trimmed for processing. We have trained the MobileNetV2 algorithm to accurately classify automobile manufacturers and models. The choice of MobileNetV2 for the VMMR task was driven by its ability to balance accuracy and computational efficiency. Unlike heavier architectures, MobileNetV2 is designed to perform well on resource-constrained edge devices, ensuring real-time performance critical to intelligent transportation systems [<xref ref-type="bibr" rid="ref-19">19</xref>]. Additionally, its depth-wise separable convolutions reduce computational cost without compromising feature extraction quality for ANPR, another YOLO model detects license plates, and EasyOCR extracts text. License plate OCR begins with grayscale conversion, scaling, and enhancement.</p>
<p>To detect color, BGR automobile images are converted to HSV color space, where color is clustered by HSV ranges to identify the dominant color. Vehicle make, model, license plate text, and color data are used for applications such as traffic monitoring, automatic toll collection, and smart parking. To handle data volume and scale, containerization and microservices are deployed on edge devices and cloud infrastructure. Integrating these modules with a registered vehicle database allows cross- checking, improving system efficacy. A CSV file backup serves as the test database for real-time verified vehicle information. The proposed framework comprises the following modules: VMMR, ANPR, color detection, as elaborated next.</p>
<sec id="s3_1">
<label>3.1</label>
<title>Vehicle Make and Model Recognition (VMMR)</title>
<p>The VMMR module serves as the foundation of our vehicle recognition framework. It leverages Convolutional Neural Networks (CNNs) to accurately identify a vehicle&#x2019;s make and model from captured images. CNNs are particularly well-suited for this task due to their ability to detect and interpret spatial patterns within visual data.</p>
<p>In this module, the CNN analyzes distinctive design features of vehicles to determine both the manufacturer and model. The process begins with the collection of wide-angle images of various vehicle types, captured from multiple perspectives. This dataset includes images taken under diverse environmental conditions to reflect real-world variability, enabling the CNN to generalize effectively during training.</p>
<p>To enhance robustness, we employ data augmentation techniques such as image rotation and flipping. These transformations expose the model to different visual orientations, allowing it to recognize vehicles even when they appear at unusual angles or are partially obscured. As a result, the model performs more reliably in practical, real-world scenarios. Throughout its layered architecture, the CNN progressively extracts and filters image features such as edges, textures, and shapes, crucial for distinguishing vehicle characteristics. The model is trained using labeled datasets, with optimization algorithms like Adam and SGD applied to fine-tune its parameters for better performance.</p>
<p>We also integrate regularization methods such as dropout and batch normalization to reduce overfitting and improve the model&#x2019;s adaptability across varying inputs. The final trained model is evaluated using standard performance metrics including accuracy, precision, recall, and F1 score, to ensure it generalizes well to unseen data. <xref ref-type="fig" rid="fig-2">Fig. 2</xref> presents a visual overview of the complete workflow, from image input (supporting .JPEG, .JPG, and .PNG formats) to final vehicle identification output.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Conceptual framework of the VMMR module</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-2.tif"/>
</fig>
<p>Popular deep learning frameworks, such as TensorFlow and PyTorch, facilitate the implementation and training the CNN model, streamlining development for researchers. In addition, transfer learning is applied, by utilizing MobileNetV2 pre-trained on the ImageNet dataset. This approach ensures faster convergence during training and enhances generalization to diverse vehicle types and conditions. To adapt the model to real-world scenarios, it was fine-tuned on a curated, domain-specific dataset that reflects variations in lighting, weather, and occlusions. This fine-tuning process enables the model to effectively handle challenging environmental conditions. The integration of transfer learning significantly reduces computational overhead and improves real-time performance, making it suitable for intelligent transportation systems.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Automatic Number Plate Recognition (ANPR)</title>
<p>The ANPR module detects and reads license plates from vehicle images, a critical component in vehicle identification systems. This process involves several sub-steps: image preprocessing, license plate detection, and Optical Character Recognition (OCR), each vital to accurately extracting textual information from license plates, as shown in <xref ref-type="fig" rid="fig-3">Fig. 3</xref>.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Automatic number plate recognition coordinates extraction framework</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-3.tif"/>
</fig>
<p><xref ref-type="fig" rid="fig-4">Fig. 4</xref> shows the role of EasyOCR, which receives the grayscale, scaled, and enhanced license plate images. EasyOCR uses CNNs and RNNs to accurately extract text from images by analyzing character patterns and structures, converting images to text. Supporting a variety of license plate fonts and styles, EasyOCR enables applications like toll collection, traffic monitoring, and access control through the automated conversion of license plate images to text.</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>OCR image processing framework</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-4.tif"/>
</fig>
<p>By registering vehicle number plates, we compare the ANPR results to verify if the plates detected by OCR match the registered plates. When a match is found, we calculate the probability of license plate recognition accuracy. For example, given two plates: ABC123 and ABD123, the algorithm first verifies if the detected text contains the original characters. If true, we use the Sequence-Matcher (<xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>) to find the Longest Common Matching Subsequence (LCMS), in this case is &#x201C;AB&#x201D; and &#x201C;123&#x201D;.
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:mi>M</mml:mi><mml:mo>=</mml:mo><mml:mn>5</mml:mn><mml:mspace width="thinmathspace" /><mml:mrow><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mi>B</mml:mi><mml:mo>+</mml:mo><mml:mn>123</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>
<inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn>12</mml:mn></mml:math></inline-formula> (total number of characters in both sequences: 6 &#x002B; 6)</p>
<p>The similarity ratio <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:mi>r</mml:mi></mml:math></inline-formula> is calculated as follows:
<disp-formula id="eqn-2"><label>(2)</label><mml:math id="mml-eqn-2" display="block"><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>2.0</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mi>M</mml:mi><mml:mrow><mml:mo>/</mml:mo></mml:mrow><mml:mi>T</mml:mi></mml:math></disp-formula>
With <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mi>M</mml:mi><mml:mo>=</mml:mo><mml:mn>5</mml:mn></mml:math></inline-formula> and <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn>12</mml:mn></mml:math></inline-formula>, the ratio is:
<disp-formula id="eqn-3"><label>(3)</label><mml:math id="mml-eqn-3" display="block"><mml:mi>r</mml:mi><mml:mo>&#x2248;</mml:mo><mml:mn>0.833</mml:mn></mml:math></disp-formula></p>
<p>In <xref ref-type="disp-formula" rid="eqn-2">Eq. (2)</xref>, <italic>M</italic> represents the number of matching characters, <italic>T</italic> is the total character count in both sequences, and r is the similarity ratio. The system marks a positive match whenever the result value <italic>r</italic> hits or beats the required level (for example 0.75).</p>
<p>After grayscale conversion the system prepares license plate images by increasing image contrast and reducing background noise for better detection. Modern object detection systems (YOLO and Faster R-CNN) find license plates by locating possible plate areas through their specific recognition algorithms. These algorithms serve their purpose by finding license plates correctly even in challenging scenes.</p>
<p>OCR automatically gets textual data from detected license plate images. EasyOCR scans characters because it was trained to spot multiple font styles across various plates. Text accuracy remains reliable after OCR processing because post processing tools verify spelling and check character sequences to fix any detection mistakes. EasyOCR achieves good OCR results because it contains machine learning models trained to identify license plate characters from multiple types. For accurate license plate recognition YOLO and Faster R-CNN detection methods assist this process. Texture mapping and visual contrast adjustments happen through the OpenCV package [<xref ref-type="bibr" rid="ref-20">20</xref>] on it. EasyOCR reads the characters, being trained to recognize different fonts and designs for robustness. Common OCR errors are corrected through post processing, such as spell checking and character validation, to maintain text accuracy. EasyOCR&#x2019;s effectiveness in OCR tasks is supported by its pre-trained models, which recognize characters in different license plate formats. Additionally, object detection algorithms such as YOLO or Faster R-CNN are used for efficient and accurate license plate detection. Image preprocessing and enhancement are performed with OpenCV [<xref ref-type="bibr" rid="ref-21">21</xref>].</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Color Detection</title>
<p>Vehicles need their specific colors to help people recognize them. Our Color Detection tool uses the HSV color model to precisely identify and group vehicle color patterns.</p>
<p>Our system transforms BGR color space images into HSV space for processing. HSV processing gives better results because it splits color data from brightness data making the system work well in different lighting conditions. Our system divides the transformed image into color segments using precise HSV parameters. Our system creates separate color-based masks that locate all image sections matching defined color ranges.</p>
<p>The system detects the vehicle color by studying where pixels of each hue cluster in the separated image areas. Our system improves color recognition by using real-world measurements to update threshold values so it detects colors that match real-life scenarios. The repeated improvement steps keep the results accurate in different situations. OpenCV processes color space conversion and segmentation to make image processing run faster. The system identifies vehicle colors by using predefined HSV color ranges for red, blue and gray plus making additional changes from observed results.</p>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Unified Approach</title>
<p>The combination of VMMR, ANPR, and Color Detection enables us to develop a strong system that extracts vehicle details from images. Combining these features lets the system see and study vehicles better in different environments. The system takes input images as it moves them through one module after another with the output from each step connecting to the next. The connected workflow helps the system run better while each part helps the other parts work. Through the VMMR module the system gains essential vehicle information which makes both license plate recognition and color detection operate better.</p>
<p>The proposed unified approach is summarized in Algorithm 1. The Algorithm 1 is a theoretical framework that transforms an input image into a decision outcome, enabling vehicle identification for applications such as traffic monitoring and law enforcement in Pakistani contexts. The process begins with an input image I, representing raw data acquired from sources such as CCTV cameras, dashcams, or manual uploads. This image undergoes a series of transformations, including pre-processing, vehicle detection, VMMR, ANPR with a Sequence Matcher, and color detection, culminating in a feature vector F that is compared against a registered vehicle database DB to produce an output decision (matched or unmatched).</p>
<fig id="fig-9">
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-9.tif"/>
</fig>
<p><bold>Algorithm Formulation</bold></p>
<p>The algorithm is defined by a function <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:mi>M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> that maps the input image <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:mi>I</mml:mi></mml:math></inline-formula> to the feature vector <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:mi>F</mml:mi></mml:math></inline-formula>, where:
<list list-type="bullet">
<list-item>
<p><inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:mi>F</mml:mi></mml:math></inline-formula> is the complete set of recognized features, defined as</p></list-item>
</list></p>
<p><disp-formula id="ueqn-4"><mml:math id="mml-ueqn-4" display="block"><mml:mi>F</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>V</mml:mi><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>.</mml:mo></mml:math></disp-formula>
<list list-type="bullet">
<list-item>
<p><inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>V</mml:mi><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> denotes the vehicle make and model.</p></list-item>
<list-item>
<p><inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> denotes the license plate text, refined by a Sequence Matcher.</p></list-item>
<list-item>
<p><inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> denotes the dominant color of the vehicle.</p></list-item>
</list></p>
<p>The transformation <inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:mi>M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> comprises sequential and parallel sub-functions, reflecting the algorithm&#x2019;s stages:</p>
<p><list list-type="roman-lower">
<list-item>
<p><bold>Pre-processing:</bold> The input <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mi>I</mml:mi></mml:math></inline-formula> is transformed by a pre-processing function <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, which includes resizing, normalization, noise reduction, and contrast enhancement, yielding a processed image <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:msup><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:math></inline-formula></p></list-item>
<list-item>
<p><bold>Vehicle Detection:</bold> A detection function <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:mi>D</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> identifies the vehicle region, producing coordinates or a boundary box, expressed as <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:mi>D</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p></list-item>
<list-item>
<p><bold>VMMR:</bold> A recognition function <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>V</mml:mi><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> processes <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:msup><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> to determine <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>V</mml:mi><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, the vehicle make and model.</p></list-item>
<list-item>
<p><bold>ANPR:</bold> The ANPR process involves multiple steps:</p></list-item>
</list></p>
<p>A plate detection function <inline-formula id="ieqn-26"><mml:math id="mml-ieqn-26"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>C</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> identifies the license plate region within <inline-formula id="ieqn-27"><mml:math id="mml-ieqn-27"><mml:mi>C</mml:mi></mml:math></inline-formula>, yielding a plate image <inline-formula id="ieqn-28"><mml:math id="mml-ieqn-28"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>.
<list list-type="bullet">
<list-item>
<p>A text extraction function <inline-formula id="ieqn-29"><mml:math id="mml-ieqn-29"><mml:mi>T</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> generates an initial text output <inline-formula id="ieqn-30"><mml:math id="mml-ieqn-30"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>.</p></list-item>
<list-item>
<p>A Sequence Matcher function:-</p></list-item>
</list></p>
<p><inline-formula id="ieqn-31"><mml:math id="mml-ieqn-31"><mml:mi>S</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> refines <inline-formula id="ieqn-32"><mml:math id="mml-ieqn-32"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></p>
<p>using a reference dataset <inline-formula id="ieqn-33"><mml:math id="mml-ieqn-33"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> (e.g., common plate formats in Pakistan), producing</p>
<p><inline-formula id="ieqn-34"><mml:math id="mml-ieqn-34"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>S</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.
<list list-type="simple">
<list-item><label>v.</label><p><bold>Color Detection:</bold> A color analysis function <inline-formula id="ieqn-35"><mml:math id="mml-ieqn-35"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>C</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> determines <inline-formula id="ieqn-36"><mml:math id="mml-ieqn-36"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, the dominant color, based on the vehicle region.</p></list-item>
</list></p>
<p>The outputs of these sub-functions are combined into <inline-formula id="ieqn-37"><mml:math id="mml-ieqn-37"><mml:mi>F</mml:mi></mml:math></inline-formula>. The overall mapping is expressed as:</p>
<p><inline-formula id="ieqn-38"><mml:math id="mml-ieqn-38"><mml:mi>F</mml:mi><mml:mo>=</mml:mo><mml:mi>M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> &#x003D; {<inline-formula id="ieqn-39"><mml:math id="mml-ieqn-39"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>V</mml:mi><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>(<inline-formula id="ieqn-40"><mml:math id="mml-ieqn-40"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>), <inline-formula id="ieqn-41"><mml:math id="mml-ieqn-41"><mml:mi>S</mml:mi></mml:math></inline-formula>(<inline-formula id="ieqn-42"><mml:math id="mml-ieqn-42"><mml:mi>T</mml:mi></mml:math></inline-formula>(<inline-formula id="ieqn-43"><mml:math id="mml-ieqn-43"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>(<inline-formula id="ieqn-44"><mml:math id="mml-ieqn-44"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, <inline-formula id="ieqn-45"><mml:math id="mml-ieqn-45"><mml:mi>D</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>)), <inline-formula id="ieqn-46"><mml:math id="mml-ieqn-46"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>), <inline-formula id="ieqn-47"><mml:math id="mml-ieqn-47"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>(<inline-formula id="ieqn-48"><mml:math id="mml-ieqn-48"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, <inline-formula id="ieqn-49"><mml:math id="mml-ieqn-49"><mml:mi>D</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>))}.</p>
<p>The algorithm concludes with a matching function <inline-formula id="ieqn-50"><mml:math id="mml-ieqn-50"><mml:mi>M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>F</mml:mi><mml:mo>,</mml:mo><mml:mi>D</mml:mi><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> that compares <inline-formula id="ieqn-51"><mml:math id="mml-ieqn-51"><mml:mi>F</mml:mi></mml:math></inline-formula> against the registered vehicle database <inline-formula id="ieqn-52"><mml:math id="mml-ieqn-52"><mml:mi>D</mml:mi><mml:mi>B</mml:mi></mml:math></inline-formula>, producing the output <inline-formula id="ieqn-53"><mml:math id="mml-ieqn-53"><mml:mi>O</mml:mi></mml:math></inline-formula>:
<list list-type="bullet">
<list-item>
<p><inline-formula id="ieqn-54"><mml:math id="mml-ieqn-54"><mml:mi>O</mml:mi><mml:mo>=</mml:mo><mml:mi>M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>F</mml:mi><mml:mo>,</mml:mo><mml:mi>D</mml:mi><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> if the vehicle is matched.</p></list-item>
<list-item>
<p><inline-formula id="ieqn-55"><mml:math id="mml-ieqn-55"><mml:mi>O</mml:mi><mml:mo>=</mml:mo><mml:mi>M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>F</mml:mi><mml:mo>,</mml:mo><mml:mi>D</mml:mi><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula> if the vehicle is unmatched.</p></list-item>
</list></p>
<p>The objective is to maximize the accuracy of <inline-formula id="ieqn-56"><mml:math id="mml-ieqn-56"><mml:mi>F</mml:mi></mml:math></inline-formula> while ensuring computational efficiency and robustness to environmental variations (e.g., poor lighting, occlusions in Pakistan). This is formulated as: Maximize the accuracy of <inline-formula id="ieqn-57"><mml:math id="mml-ieqn-57"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>V</mml:mi><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-58"><mml:math id="mml-ieqn-58"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and <inline-formula id="ieqn-59"><mml:math id="mml-ieqn-59"><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:math></inline-formula> subject to the constraint that the computational cost is below a threshold <inline-formula id="ieqn-60"><mml:math id="mml-ieqn-60"><mml:mi>C</mml:mi></mml:math></inline-formula>, and the robustness is above a threshold <inline-formula id="ieqn-61"><mml:math id="mml-ieqn-61"><mml:mi>R</mml:mi></mml:math></inline-formula>.</p>
<p>The system uses parallel processing to handle big data efficiently while doing image work at the same time. The system design improves performance for real-time vehicle recognition tasks. The system displays results through an easy-to-understand display that shows users vehicle information including make, model, license plate number, and color. The system uses Python as its main programming language while OpenCV, TensorFlow, and PyTorch support modules to process images and learn with machine and deep learning techniques.</p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Experimentation and Results</title>
<p>Our system combines Vehicle VMMR, Automatic Number Plate Recognition technology, and color recognition to identify vehicles fully.</p>
<p>To ensure robustness and generalizability, our system was validated on public and curated datasets, reflecting the integration of VMMR, ANPR, and color detection for Pakistani traffic scenarios. Pakistani Vehicles Cars Dataset, with approximately 364 images, was used alongside a curated dataset of 4000 Stanford Car images having 48 classes for training and validation. The Pakistani dataset achieved 90.63% accuracy, reflecting optimization for local vehicle types.</p>
<p>ANPR module was trained and validated on 364 Pakistani license plates along with YOLO, achieving 93.0% detection accuracy and 88.5% OCR accuracy, aligning with the overall 93.75% (<xref ref-type="fig" rid="fig-5">Fig. 5</xref>). The combined system shows reliable performance in these test results.</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Accuracy across modules: VMMR, ANPR and color detection</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-5.tif"/>
</fig>
<p><xref ref-type="fig" rid="fig-5">Fig. 5</xref> shows the accuracy rate for each module. The ANPR module achieved an accuracy of 93.7% along with varying lightening conditions. The VMMR module achieved an accuracy of 90.63% in identifying vehicle makes and models. This high accuracy indicates the module&#x2019;s ability to process complex visual data, effectively distinguishing different vehicle brands and models.</p>
<p><xref ref-type="fig" rid="fig-6">Fig. 6</xref> illustrates the training process of the VMMR system using the CNN architecture, MobileNetV2. The left graph displays the training and validation loss over 40 epochs, while the right graph shows the training and validation accuracy over the same number of epochs. These metrics are crucial for understanding the model&#x2019;s performance and generalization capabilities.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Training and validation accuracy graph for VMMR model training</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-6.tif"/>
</fig>
<p>The training accuracy plateaus near 0.99, indicating that the model has effectively learned from the training data. However, the validation accuracy levels out around 0.85, with slight fluctuations, suggesting the model&#x2019;s generalization on new unseen data has reached a peak with the current configuration.</p>
<p>In <xref ref-type="table" rid="table-2">Table 2</xref>, the VMMR outcomes are summarized as follows:</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>VMMR results, testing against the correct model and make label</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Instances</th>
<th>False positive</th>
<th>False negative</th>
<th>True positive</th>
<th>True negative</th>
<th>Results</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>2</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>3</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>4</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>5</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>6</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>7</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>8</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>9</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>10</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>11</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>12</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>13</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>14</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>15</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>16</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>17</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>18</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>19</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>20</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>Correct</td>
</tr>
<tr>
<td>21</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>22</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>23</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>24</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>25</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>26</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>27</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>28</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>29</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>Correct</td>
</tr>
<tr>
<td>30</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>Incorrect</td>
</tr>
<tr>
<td>31</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Incorrect</td>
</tr>
<tr>
<td>32</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>Incorrect</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><list list-type="bullet">
<list-item>
<p>True Positives (TP): 20 (cases with &#x201C;TRUE&#x201D; for Is True Vehicle)</p></list-item>
<list-item>
<p>True Negatives (TN): 9 (cases with &#x201C;FALSE&#x201D; for Is True Vehicle)</p></list-item>
<list-item>
<p>False Positives (FP): 2 (cases where &#x201C;Is True Vehicle&#x201D; is &#x201C;FALSE&#x201D; but the model predicted &#x201C;TRUE&#x201D;)</p></list-item>
<list-item>
<p>False Negatives (FN): 1 (cases where &#x201C;Is True Vehicle&#x201D; is &#x201C;TRUE&#x201D; but the model predicted &#x201C;FALSE&#x201D;)</p></list-item>
</list></p>
<p>From the results of VMMR shown in <xref ref-type="table" rid="table-2">Table 2</xref>, the accuracy, precision and recall has been calculated and is presented in <xref ref-type="table" rid="table-3">Table 3</xref>.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>VMMR results: accuracy, precision, and recall</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Metric</th>
<th>Score</th>
</tr>
</thead>
<tbody>
<tr>
<td>True positives (TP)</td>
<td>20</td>
</tr>
<tr>
<td>True negatives (TN)</td>
<td>9</td>
</tr>
<tr>
<td>False positives (FP)</td>
<td>2</td>
</tr>
<tr>
<td>False negatives (FN)</td>
<td>1</td>
</tr>
<tr>
<td>Accuracy</td>
<td>0.90625</td>
</tr>
<tr>
<td>Precision</td>
<td>0.9090909091</td>
</tr>
<tr>
<td>Recall</td>
<td>0.9523809524</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The ANPR module demonstrated a notable accuracy of 93.75% in reading license plates, as shown in <xref ref-type="fig" rid="fig-5">Fig. 5</xref>. This result indicates the module&#x2019;s effectiveness in preprocessing and accurately extracting license plate information. Techniques such as grayscale conversion and contrast enhancement contribute significantly to the module&#x2019;s ability to accurately recognize and read license plates.</p>
<p>The color detection module based on the HSV color space reached the accuracy of 71.87% in color classification (<xref ref-type="fig" rid="fig-5">Fig. 5</xref>). Although this accuracy is smaller than that of VMMR (90.63%) and ANPR (93.75%) it is good enough to be useful in practice, since it adds to the rest of the modules in our single framework. The HSV-based clustering algorithm was selected because of its computational simplicity, and therefore, could be used in resource-limited edge devices, and its resistance to illumination changes due to decoupling of hue and brightness (<xref ref-type="sec" rid="s3_3">Section 3.3</xref>). The issues such as shadows or color fading are overcome thanks to adaptive thresholding and preprocessing methods (e.g., normalization, contrast enhancement), confirmed on 32 examples (<xref ref-type="table" rid="table-4">Table 4</xref>) while color detection showing 15 True Positives out of 32 examples (<xref ref-type="table" rid="table-5">Table 5</xref>) and Suzuki Alto case study (92% accuracy, <xref ref-type="table" rid="table-6">Table 6</xref>). In comparison to deep learning-based classifiers (e.g., CNNs) that are resource-demanding and needs a substantial amount of labelled data, HSV clustering is more efficient and performance-wise, which is why it is part of the overall 93.3% vehicle matching accuracy <xref ref-type="table" rid="table-7">Table 7</xref>.</p>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>Color classification results (basic colors)</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Color name</th>
<th>False positive</th>
<th>False negative</th>
<th>True positive</th>
<th>True negative</th>
</tr>
</thead>
<tbody>
<tr>
<td>Blue/Cyan</td>
<td>0</td>
<td>0</td>
<td>3</td>
<td>0</td>
</tr>
<tr>
<td>Red/Pink/Maroon</td>
<td>1</td>
<td>0</td>
<td>3</td>
<td>2</td>
</tr>
<tr>
<td>Green</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>1</td>
</tr>
<tr>
<td>Yellow</td>
<td>0</td>
<td>2</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>Black/Dark Gray</td>
<td>2</td>
<td>0</td>
<td>3</td>
<td>0</td>
</tr>
<tr>
<td>White/Gray/Silver</td>
<td>1</td>
<td>1</td>
<td>3</td>
<td>4</td>
</tr>
<tr>
<td>Blue/Cyan</td>
<td>0</td>
<td>0</td>
<td>3</td>
<td>0</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Color detection results: accuracy, precision, and recall</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Metric</th>
<th>Score</th>
</tr>
</thead>
<tbody>
<tr>
<td>True positives (TP)</td>
<td>15</td>
</tr>
<tr>
<td>True negatives (TN)</td>
<td>8</td>
</tr>
<tr>
<td>False positives (FP)</td>
<td>5</td>
</tr>
<tr>
<td>False negatives (FN)</td>
<td>4</td>
</tr>
<tr>
<td>Accuracy</td>
<td>0.71875</td>
</tr>
<tr>
<td>Precision</td>
<td>0.75</td>
</tr>
<tr>
<td>Recall</td>
<td>0.7894736842</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-6">
<label>Table 6</label>
<caption>
<title>Evaluation of a single car result example</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col/>
<col/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th align="center">Module</th>
<th>Output</th>
<th>Accuracy rate</th>
<th align="center">Comments</th>
</tr>
</thead>
<tbody>
<tr>
<td>Vehicle Make and Model Recognition (VMMR)</td>
<td>Suzuki Alto 2007</td>
<td>95%</td>
<td>Successfully identified the make and model.</td>
</tr>
<tr>
<td>Automatic Number Plate Recognition (ANPR)</td>
<td>MNACB 3685</td>
<td>90%</td>
<td>Correctly detected and read the license plate.</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-7">
<label>Table 7</label>
<caption>
<title>Summary of results</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Module</th>
<th>Accuracy rate</th>
<th>Analysis</th>
</tr>
</thead>
<tbody>
<tr>
<td>Overall system performance</td>
<td>93.3%</td>
<td>High overall accuracy in matching vehicles.</td>
</tr>
<tr>
<td>Matched vehicles</td>
<td>93.3%</td>
<td>Percentage of vehicles correctly matched in the database.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The integrated performance of the VMMR, ANPR, and Color Detection modules resulted in a vehicle matching accuracy of 93.33% when cross-referenced with the registered database. This signifies that 93.3% of the vehicles were successfully matched with the registration database, with only 6.7% unmatched. The high success rate underscores the effectiveness of combining results from multiple modules for comprehensive vehicle identification. High accuracy across individual modules, coupled with overall vehicle matching success, indicates the system&#x2019;s strong potential for practical applications in vehicle recognition and management.</p>
<p><xref ref-type="table" rid="table-6">Table 6</xref> presents result for a specific vehicle, the Suzuki Alto 2007, tested for system consistency and robustness. 35 variations of this vehicle&#x2019;s image were generated with adjustments in factors like angles and lighting conditions. Each image was tested across all three modules&#x2014;VMMR, ANPR, and Color Detection, with the following consolidated results:
<list list-type="bullet">
<list-item>
<p>VMMR: The system correctly identified the Suzuki Alto with overall VMMR accuracy of 95% across the 35 variations.</p>
</list-item>
<list-item>
<p>ANPR: The license plate was successfully detected in 32 out of 35 cases. A plate recognition accuracy of 90% was achieved with a 0.75 matching threshold, requiring at least a 75% match to determine success.</p></list-item>
<list-item>
<p>Color Detection: In 33 out of 35 cases, the system accurately classified the color as &#x2018;Gray/ Silver&#x2019;, achieving a color detection accuracy of 92%.</p></list-item>
</list></p>
<p>Next, we tested the system with 35 different variations of the same vehicle image to evaluate its robustness under varying conditions. These unified results for the Suzuki Alto show that the system maintains high accuracy in vehicle identification, recognition, and classification even with variations in the vehicle image data.</p>
<p><xref ref-type="table" rid="table-7">Table 7</xref> shows the results, confirming that the integrated system effectively performs vehicle make and model recognition, license plate recognition, and color detection, resulting in a comprehensive vehicle identification solution. The system&#x2019;s high accuracy and reliability across modules make it suitable for implementation in applications like traffic monitoring and law enforcement.</p>

<p>Since the system&#x2019;s overall system performance relies on the accuracy of matched vehicles, the overall system performance aligns with matched vehicle accuracy. Visual results and statistical data further validate the system, showing its ability to process and analyze vehicle images under diverse conditions. From a holistic perspective, the system combines several vehicles identification features, with each module contributing to a better understanding of each vehicle. This integrated solution leverages advanced technologies to enhance recognition accuracy and offers the practical framework for real-world applications.</p>
</sec>
<sec id="s5">
<label>5</label>
<title>Discussions and Comparison</title>
<p>The combined results from the VMMR, ANPR, and Color Detection modules offer a clear view of the system&#x2019;s capabilities. By merging these components, the vehicle identification system proves to be both robust and effective. As shown in <xref ref-type="fig" rid="fig-7">Fig. 7</xref>, an example vehicle is detected with key details such as make, model, license plate, and color. This integrated approach highlights the system&#x2019;s ability to compile data from different modules, forming a complete profile of the vehicle.</p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>Example of detected vehicle showing detected number plate, color, and model</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-7.tif"/>
</fig>
<p>The system&#x2019;s overall performance in matching vehicles with a registered database is illustrated in <xref ref-type="fig" rid="fig-8">Fig. 8</xref>. The system successfully matched 93.3% of vehicles, with only 6.7% remaining unmatched. This high matching rate demonstrates the system&#x2019;s effectiveness in cross-referencing data from each module for accuracy. The success rate emphasizes the system&#x2019;s reliability and strength in identifying vehicles.</p>
<fig id="fig-8">
<label>Figure 8</label>
<caption>
<title>Registered vs. matched vehicles from the database</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_67538-fig-8.tif"/>
</fig>
<p>This study presents a unified approach that integrates Vehicle Make and Model Recognition (VMMR), Automatic Number Plate Recognition (ANPR), and color detection into a cohesive system for advanced vehicle tracking. By leveraging complementary modules, the system enhances accuracy and reliability through cross-verification, minimizing recognition errors. Pixel-level integration further refines the process, enabling more precise and efficient identification&#x2014;an essential attribute for real-world vehicle recognition and verification systems.</p>
<p>Ghaida Saadouli et al. [<xref ref-type="bibr" rid="ref-13">13</xref>] proposed a security solution combining license plate recognition, car model identification, and facial recognition for access control. Their method utilized SIFT and DoG for vehicle detection, while OCR and the Viola&#x2013;Jones algorithm were used for number plate and facial recognition. Although their system achieved a 75% success rate in identifying vehicles, the results were based on tests involving toy cars under controlled conditions, limiting the generalizability of their findings.</p>
<p>In contrast, our research introduces a more scalable and contemporary system employing MobileNetV2 for VMMR, YOLO for ANPR, and HSV-based clustering for color detection. Tested in diverse real-time environments, our system consistently achieved a 93.3% accuracy rate&#x2014;demonstrating strong performance regardless of lighting or environmental variations. Moreover, the use of deep learning techniques, along with containerized deployment, makes our solution suitable for large-scale smart city implementations.</p>
<p>While the HSV clustering method for color detection yielded lower accuracy (71.87%) compared to deep learning-based classifiers, it offered a lightweight and computationally efficient alternative&#x2014;particularly advantageous for deployment on edge devices. In future work, we aim to explore hybrid models that combine HSV clustering with lightweight convolutional neural networks (CNNs) to improve accuracy under extreme lighting conditions while maintaining operational efficiency.</p>
<p>Overall, the integrated system proved to be more robust than standalone solutions, achieving a 93.3% matching accuracy (<xref ref-type="fig" rid="fig-8">Fig. 8</xref>). This demonstrates the effectiveness of the combined modules in delivering reliable performance suitable for real-world intelligent transportation applications.</p>
</sec>
<sec id="s6">
<label>6</label>
<title>Conclusion</title>
<p>The integration of VMMR, ANPR, and color detection within a unified framework yields a robust and reliable vehicle identification system, well-suited for traffic monitoring, law enforcement, and vehicle management&#x2014;particularly in the context of Pakistan. By leveraging proven techniques&#x2014;MobileNetV2 for vehicle make and model recognition, YOLO combined with EasyOCR for license plate recognition, and HSV clustering for color detection&#x2014;the proposed system achieves a commendable accuracy of 93.3% (as detailed in <xref ref-type="sec" rid="s4">Section 4</xref>), consistently outperforming modular or standalone solutions. Comprehensive testing on diverse datasets&#x2014;including the Stanford Cars dataset for VMMR and a custom set of 364 Pakistani license plates for ANPR&#x2014;demonstrates the system&#x2019;s effectiveness, adaptability, and resilience under various environmental conditions.</p>
<p>In the future, we intend to extend the ANPR module to support multilingual and international license plates, such as those found in Chinese and European regions, using datasets like CCPD and UFPR-ALPR. Enhancing color recognition under challenging lighting scenarios is another focus, with plans to incorporate lightweight CNN-based classifiers. To improve real-time deployment, we will investigate edge computing solutions, including Docker-based containerization, to minimize latency within smart city infrastructure. In parallel, advanced data augmentation strategies will be developed to better handle occlusion and adverse weather conditions such as fog or rain. Finally, adopting a cloud-native architecture with microservices will enable horizontal scaling, making the system capable of managing large-scale urban traffic networks efficiently.These enhancements aim to reinforce the framework&#x2019;s real-world applicability, enabling high-performance, real-time vehicle recognition while preserving its modularity and computational efficiency.</p>
</sec>
</body>
<back>
<ack>
<p>The authors thank the Multimedia University AI Research Lab and Telekom Research and Development Sdn Bhd for support. We are grateful to the contributors of the Pakistani Cars Image Dataset on Kaggle and to our colleagues for valuable discussions.</p>
</ack>
<sec>
<title>Funding Statement</title>
<p>This work was supported in part by Multimedia University Research Fellow under Grant MMUI/250008 and in part by Telekom Research and Development Sdn Bhd under Grant RDTC/241149.</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>Conceptualization, Kashif Sultan and Saad Sadiq; Methodology, Saad Sadiq; Software, Saad Sadiq; Validation, Muhammad Usman Hashmi and Kashif sultan; Formal analysis, Muhammad Sheraz; Investigation, Muhammad Sheraz and Teong Chee Chuah; Data curation, Muhammad Sheraz and Kashif Sultan; Writing&#x2014;original draft, Saad Sadiq and Kashif Sultan; Writing&#x2014;review &#x0026; editing, Muhammad Usman Hashmi and Teong Chee Chuah; Supervision, Kashif Sultan. All authors reviewed the results and approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>The data that support the findings of this study are openly available in Kaggle at <ext-link ext-link-type="uri" xlink:href="https://www.kaggle.com/datasets/salmanadam1052/pakistani-cars-image-dataset">https://www.kaggle.com/datasets/salmanadam1052/pakistani-cars-image-dataset</ext-link> (accessed on 2 July 2025).</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>There is no ethical approval required.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mustafa</surname> <given-names>TT</given-names></string-name>, <string-name><surname>Karabatak</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Real time car model and plate detection system by using deep learning architectures</article-title>. <source>IEEE Access</source>. <year>2024</year>;<volume>12</volume>:<fpage>107616</fpage>&#x2013;<lpage>30</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ACCESS.2024.3413554</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nazemi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Azimifar</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Shafiee</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Wong</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Real-time vehicle make and model recognition using unsupervised feature learning</article-title>. <source>IEEE Trans Intell Transp Syst</source>. <year>2020</year>;<volume>21</volume>(<issue>7</issue>):<fpage>3080</fpage>&#x2013;<lpage>90</lpage>. doi:<pub-id pub-id-type="doi">10.1109/tits.2019.2924830</pub-id>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Tan</surname> <given-names>M</given-names></string-name>, <string-name><surname>Le</surname> <given-names>Q</given-names></string-name></person-group>. <article-title>EfficientNet: rethinking model scaling for convolutional neural networks</article-title>. In: <conf-name>Proceedings of the International Conference on Machine Learning (ICML)</conf-name>; <year>2019</year>; <publisher-loc>Long Beach, CA, USA</publisher-loc>. doi:<pub-id pub-id-type="doi">10.48550/arXiv.1905.11946</pub-id>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Dosovitskiy</surname> <given-names>A</given-names></string-name>, <string-name><surname>Beyer</surname> <given-names>L</given-names></string-name>, <string-name><surname>Kolesnikov</surname> <given-names>A</given-names></string-name>, <string-name><surname>Weissenborn</surname> <given-names>D</given-names></string-name>, <string-name><surname>Zhai</surname> <given-names>X</given-names></string-name>, <string-name><surname>Unterthiner</surname> <given-names>T</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>An image is worth 16 x 16 words: transformers for image recognition at scale</article-title>. <comment>arXiv:2010.11929. 2020</comment>. doi:<pub-id pub-id-type="doi">10.48550/arXiv.2010.11929</pub-id>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Shashidhar</surname> <given-names>R</given-names></string-name>, <string-name><surname>Manjunath</surname> <given-names>AS</given-names></string-name>, <string-name><surname>Kumar</surname> <given-names>RS</given-names></string-name>, <string-name><surname>Roopa</surname> <given-names>M</given-names></string-name>, <string-name><surname>Puneeth</surname> <given-names>SB</given-names></string-name></person-group>. <article-title>Vehicle number plate detection and recognition using YOLO-v3 and OCR method</article-title>. In: <conf-name>2021 IEEE International Conference on Mobile Networks and Wireless Communications (ICMNWC)</conf-name>. <publisher-loc>Tumkur, Karnataka, India</publisher-loc>: <publisher-name>IEEE</publisher-name>; <year>2021</year>. p. <fpage>1</fpage>&#x2013;<lpage>5</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICMNWC52512.2021.9688266</pub-id>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Khan</surname> <given-names>MG</given-names></string-name>, <string-name><surname>Saeed</surname> <given-names>M</given-names></string-name>, <string-name><surname>Zulfiqar</surname> <given-names>A</given-names></string-name>, <string-name><surname>Ghadi</surname> <given-names>YY</given-names></string-name>, <string-name><surname>Adnan</surname> <given-names>M</given-names></string-name></person-group>. <article-title>A novel deep learning based anpr pipeline for vehicle access control</article-title>. <source>IEEE Access</source>. <year>2022</year>;<volume>10</volume>:<fpage>64172</fpage>&#x2013;<lpage>84</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ACCESS.2022.3183017</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Gregg</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Autonomous police vehicles: the impact on law enforcement. Homeland Security Affairs</article-title>; <year>2019</year>. <comment>[cited 2025 Aug 2]</comment>. Available from: <ext-link ext-link-type="uri" xlink:href="https://www.hsdl.org/c/view?docid=825205">https://www.hsdl.org/c/view?docid=825205</ext-link>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Shrivastava</surname> <given-names>S</given-names></string-name>, <string-name><surname>Singh</surname> <given-names>SK</given-names></string-name>, <string-name><surname>Shrivastava</surname> <given-names>K</given-names></string-name>, <string-name><surname>Sharma</surname> <given-names>V</given-names></string-name></person-group>. <article-title>CNN-based automated vehicle registration number plate recognition system</article-title>. In: <conf-name>2020 2nd International Conference on Advances in Computing, Communication Control and Networking (ICACCCN)</conf-name>. <publisher-loc>Greater Noida, India</publisher-loc>: <publisher-name>IEEE</publisher-name>. p. <fpage>795</fpage>&#x2013;<lpage>802</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICACCCN51052.2020.9362705</pub-id>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Siegel</surname> <given-names>JE</given-names></string-name>, <string-name><surname>Erb</surname> <given-names>DC</given-names></string-name>, <string-name><surname>Sarma</surname> <given-names>SE</given-names></string-name></person-group>. <article-title>A survey of the connected vehicle landscape&#x2014;architectures, enabling technologies, applications, and development areas</article-title>. <source>IEEE Trans Intell Transp Syst</source>. <year>2018</year>;<volume>19</volume>(<issue>8</issue>):<fpage>2391</fpage>&#x2013;<lpage>406</lpage>. doi:<pub-id pub-id-type="doi">10.1109/TITS.2017.2750624</pub-id>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>F</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>D</given-names></string-name></person-group>. <chapter-title>Computing technology in autonomous vehicle</chapter-title>. In: <person-group person-group-type="editor"><string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Shi</surname> <given-names>H</given-names></string-name></person-group>, editors. <source>Advanced driver assistance systems and autonomous vehicles</source>. <publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer</publisher-name>; <year>2022</year>. doi:<pub-id pub-id-type="doi">10.1007/978-981-19-5053-7_3</pub-id>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>&#x00C5;lvarez-Bazo</surname> <given-names>F</given-names></string-name>, <string-name><surname>Snchez-Cambronero</surname> <given-names>S</given-names></string-name>, <string-name><surname>Vallejo</surname> <given-names>D</given-names></string-name>, <string-name><surname>Glez-Morcillo</surname> <given-names>C</given-names></string-name>, <string-name><surname>Rivas</surname> <given-names>A</given-names></string-name>, <string-name><surname>Gallego</surname> <given-names>I</given-names></string-name></person-group>. <article-title>A low-cost automatic vehicle identification sensor for traffic networks analysis</article-title>. <source>Sensors</source>. <year>2020</year>;<volume>20</volume>(<issue>19</issue>):<fpage>5589</fpage>. doi:<pub-id pub-id-type="doi">10.3390/s20195589</pub-id>; <pub-id pub-id-type="pmid">33003528</pub-id></mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Guerrero-Ibez</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zeadally</surname> <given-names>S</given-names></string-name>, <string-name><surname>Contreras-Castillo</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Sensor technologies for intelligent. transportation systems</article-title>. <source>Sensors</source>. <year>2018</year>;<volume>18</volume>(<issue>4</issue>):<fpage>1212</fpage>. doi:<pub-id pub-id-type="doi">10.3390/s18041212</pub-id>; <pub-id pub-id-type="pmid">29659524</pub-id></mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Saadouli</surname> <given-names>G</given-names></string-name>, <string-name><surname>Elburdani</surname> <given-names>MI</given-names></string-name>, <string-name><surname>Al-Qatouni</surname> <given-names>RM</given-names></string-name>, <string-name><surname>Kunhoth</surname> <given-names>S</given-names></string-name>, <string-name><surname>Al-Maadeed</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Automatic and secure electronic gate system using fusion of license plate, car make recognition and face detection</article-title>. In: <conf-name>2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT)</conf-name>. <publisher-loc>Doha, Qatar</publisher-loc>; <year>2020</year>. p. <fpage>79</fpage>&#x2013;<lpage>84</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICIoT48696.2020.9089615</pub-id>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tourani</surname> <given-names>A</given-names></string-name>, <string-name><surname>Shahbahrami</surname> <given-names>A</given-names></string-name>, <string-name><surname>Soroori</surname> <given-names>S</given-names></string-name>, <string-name><surname>Khazaee</surname> <given-names>S</given-names></string-name>, <string-name><surname>Suen</surname> <given-names>CY</given-names></string-name></person-group>. <article-title>A robust deep learning approach for automatic iranian vehicle license plate detection and recognition for surveillance systems</article-title>. <source>IEEE Access</source>. <year>2020</year>;<volume>8</volume>:<fpage>201317</fpage>&#x2013;<lpage>30</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ACCESS.2020.3035613</pub-id>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>YC</given-names></string-name>, <string-name><surname>Han</surname> <given-names>CC</given-names></string-name>, <string-name><surname>Hsieh</surname> <given-names>CT</given-names></string-name>, <string-name><surname>Fan</surname> <given-names>KC</given-names></string-name></person-group>. <article-title>Vehicle color classification using manifold learning methods from urban surveillance videos</article-title>. <source>EURASIP J Image Video Process</source>. <year>2014</year>;<volume>2014</volume>(<issue>1</issue>) :<fpage>48</fpage>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Najeeb</surname> <given-names>SA</given-names></string-name>, <string-name><surname>Raza</surname> <given-names>RH</given-names></string-name>, <string-name><surname>Yusuf</surname> <given-names>A</given-names></string-name>, <string-name><surname>Sultan</surname> <given-names>Z</given-names></string-name></person-group>. <article-title>Fine-grained vehicle classification in urban traffic scenes using deep learning</article-title>. In: <conf-name>Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications: Enhancing Research and Innovation through the Fourth Industrial Revolution</conf-name>. <publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer Singapore</publisher-name>; <year>2022</year>. p. <fpage>902</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.1007/978-981-19-1589-2_86</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Kamoji</surname> <given-names>S</given-names></string-name>, <string-name><surname>Koshti</surname> <given-names>D</given-names></string-name>, <string-name><surname>Dmonte</surname> <given-names>A</given-names></string-name>, <string-name><surname>George</surname> <given-names>SJ</given-names></string-name>, <string-name><surname>Pereira</surname> <given-names>CS</given-names></string-name></person-group>. <article-title>Image processing based vehicle identification and speed measurement</article-title>. In: <conf-name>2020 International Conference on Inventive Computation Technologies (ICICT)</conf-name>. <publisher-loc>Coimbatore, India</publisher-loc>: <publisher-name>IEEE</publisher-name>; <year>2020</year>. p. <fpage>523</fpage>&#x2013;<lpage>7</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICICT48043.2020.9112482</pub-id>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Sinha</surname> <given-names>H</given-names></string-name>, <string-name><surname>Soumya</surname> <given-names>GV</given-names></string-name>, <string-name><surname>Undavalli</surname> <given-names>S</given-names></string-name>, <string-name><surname>Jeyanthi</surname> <given-names>R</given-names></string-name></person-group>. <article-title>An effective real-time approach to automatic number plate recognition (ANPR) using YOLOv3 and OCR</article-title>. In: <conf-name> Intelligent Systems, Technologies and Applications: Proceedings of Sixth ISTA 2020, India</conf-name>. <publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer</publisher-name>; <year>2021</year>. p. <fpage>299</fpage>&#x2013;<lpage>314</lpage>. doi:<pub-id pub-id-type="doi">10.1007/978-981-33-4265-9_21</pub-id>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tian</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Han</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Adaptive binarization for vehicle state images based on contrast preserving decolorization and major cluster estimation</article-title>. <source>IEICE Trans Inf Syst</source>. <year>2022</year>;<volume>105</volume>(<issue>3</issue>):<fpage>679</fpage>&#x2013;<lpage>88</lpage>. doi:<pub-id pub-id-type="doi">10.1587/transinf.2021EDP7207</pub-id>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Alamgir</surname> <given-names>RM</given-names></string-name>, <string-name><surname>Shuvro</surname> <given-names>AA</given-names></string-name>, <string-name><surname>Al Mushabbir</surname> <given-names>M</given-names></string-name>, <string-name><surname>Rani</surname> <given-names>NJ</given-names></string-name>, <string-name><surname>Rahman</surname> <given-names>MM</given-names></string-name>, <string-name><surname>Ahmed</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Performance analysis of yolo-based architectures for vehicle detection from traffic images in Bangladesh</article-title>. In: <conf-name>2022 25th International Conference on Computer and Information Technology (ICCIT)</conf-name>. <publisher-loc>Cox&#x0027;s Bazar, Bangladesh</publisher-loc>: <publisher-name>IEEE</publisher-name>; <year>2022</year>. p. <fpage>982</fpage>&#x2013;<lpage>7</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICCIT57467.2022.10052737</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lubna</surname> <given-names>N</given-names></string-name>, <string-name><surname>Mufti</surname> <given-names>N</given-names></string-name>, <string-name><surname>Shah</surname> <given-names>SAA</given-names></string-name></person-group>. <article-title>Automatic number plate recognition: a detailed survey of relevant algorithms</article-title>. <source>Sensors</source>. <year>2021</year>;<volume>21</volume>(<issue>9</issue>):<fpage>3028</fpage>. doi:<pub-id pub-id-type="doi">10.3390/s21093028</pub-id>; <pub-id pub-id-type="pmid">33925845</pub-id></mixed-citation></ref>
</ref-list>
</back></article>