اعتبارسنجی اطلاعات مکانی شهروندمحور آلودگی بصری شهر با استفاده از الگوریتم‌های یادگیری عمیق

نوع مقاله : پژوهشی - کاربردی

نویسندگان

1 گروه سنجش‌ازدور و سیستم‌های اطلاعات جغرافیایی، دانشکده جغرافیا، دانشگاه تهران، تهران، ایران

2 گروه مهندسی طراحی محیط‌زیست، دانشکده محیط‌زیست، دانشگاه تهران، تهران، ایران

10.22059/jurbangeo.2026.405038.2121

چکیده

آلودگی بصری به عنوان چالشی مهم در مدیریت منظر مطرح است. روش‌های پایش سنتی، هزینه‌بر و با پوشش محدود هستند، در حالی که داده‌های شهروند محور با وجود پوشش گسترده و هزینه کمتر، به دلیل ناهمگونی در کیفیت و احتمال خطا، نیازمند سازوکار اعتبارسنجی می‌باشند. این پژوهش با هدف اعتبارسنجی داده‌های مکانی شهروند محور در ارزیابی آلودگی بصری دیوارهای شهری و ارائه چارچوبی ترکیبی مبتنی بر یادگیری عمیق و مشارکت شهروندان انجام شد. مطالعه به صورت ترکیبی و در چهار مرحله طراحی شد: گردآوری داده‌های مکانی-تصویری از طریق سامانه‌ای تحت‌وب در تهران، پالایش و برچسب‌گذاری تصاویر در چهار طبقه، آموزش سه مدل یادگیری عمیق ResNet50، EfficientNetB0 و  EfficientNetV2-L و در نهایت اعتبارسنجی داده‌های شهروند محور با مقایسه برچسب‌های کاربران و نتایج مدل‌ها. مدل EfficientNetV2-L با دقت ۸۷/۷۸٪ بهترین عملکرد را نشان داد و در تشخیص کلاس‌های دشوار پایدارتر عمل کرد. منحنی‌های یادگیری، همگرایی مناسب و کنترل بیش‌برازش را تأیید نمودند. تلفیق این دو منبع، چارچوبی کارآمد و قابل‌اعتماد برای پایش آلودگی بصری و پشتیبانی از تصمیم‌گیری مشارکتی ایجاد کرد. پژوهش حاضر چارچوبی عملی برای استفاده از مدل‌های یادگیری عمیق به عنوان مرجع پویا در اعتبارسنجی داده‌های شهروند محور ارائه می‌دهد. ادغام این نتایج در سامانه‌های مدیریتی، امکان پایش مستمر، بازخورد بلادرنگ و ارتقای همزمان کیفیت داده و دقت مدل را فراهم کرده و به ابزاری مؤثر در تصمیم‌گیری مدیریت شهری تبدیل می‌شود.

کلیدواژه‌ها


عنوان مقاله [English]

Validation of Volunteered Geographic Information on Urban Visual Pollution Using Deep Learning Algorithms

نویسندگان [English]

  • Mohammadreza Jelokhani 1
  • Sahar Danyali 1
  • Erfan Motaghiyan 1
  • Azadeh Mohajer Milani 2
1 Department of GIS and Remote Sensing, Faculty of Geography, University of Tehran, Tehran, Iran
2 Department of Environmental Design Engineering, Faculty of Environment, University of Tehran, Tehran, Iran
چکیده [English]

ABSTRACT
Visual pollution is one of the most significant challenges in urban landscape management. Traditional monitoring methods, despite their high accuracy, are time-consuming, costly, and limited in spatial coverage. In contrast, citizen-centered spatial data enable broader and more cost-effective monitoring; however, the heterogeneity and potential errors in crowdsourced data highlight the need for their scientific validation. This study aims to validate citizen-centered spatial data in assessing visual pollution on urban walls and to develop a hybrid framework combining deep learning and citizen participation, thereby enhancing the reliability of these data for urban decision-making.The study employed a mixed-methods approach in four stages. First, spatial and visual data on visual pollution were collected from various areas of Tehran via a web-based platform. Next, the images were labeled into four classes. In the third stage, three convolutional neural network (CNN) models—ResNet50, EfficientNetB0, and EfficientNetV2-L—were trained. Finally, by comparing model outputs with citizen labels, a validation mechanism for the data was developed and model performance was evaluated.The EfficientNetV2-L model achieved the highest accuracy at 87.78% and showed greater stability in classifying difficult data. Learning curves confirmed stable convergence and effective control of overfitting.The results demonstrate that integrating citizen-centered data with deep learning models provides an efficient and reliable framework for monitoring visual pollution. This framework can serve as a dynamic reference for validating spatial data and as an effective tool for intelligent urban landscape management.
 
Extended Abstract
Introduction
Visual pollution, as one of the growing challenges in contemporary cities, exerts a multilayered and profound impact on the quality of the urban environment, the mental well-being of citizens, and the way public spaces are perceived. The urban landscape, as the setting that citizens encounter on a daily basis, plays a significant role in shaping feelings of belonging, safety, satisfaction, and place identity. When this landscape becomes visually disordered, its consequences extend beyond a mere decline in aesthetic quality and may lead to heightened feelings of abandonment, disorder, reduced public trust, and the weakening of social capital. In this context, urban walls, due to their spatial extent, continuous visibility, and capacity to function as carriers of both formal and informal messages, play a substantial role in shaping the visual quality of the city.
Urban walls can perform a dual function. On the one hand, they may serve as platforms for urban art, convey local identity, and strengthen a sense of belonging. On the other hand, in the absence of effective management and supervision, they can become sites for the accumulation of illegal advertisements, deteriorated posters, wall writings, unauthorized graffiti, and signs of physical decay. This condition is particularly intensified in high-density areas and disadvantaged neighborhoods, contributing to the reproduction of spatial inequality across the city. Accordingly, understanding the extent of visual pollution on urban walls and continuously monitoring it are fundamental prerequisites for urban landscape management.
Despite the importance of this issue, conventional approaches to monitoring visual pollution have largely relied on field observations, expert assessments, and qualitative judgments. While these methods can yield acceptable results in limited projects and small-scale contexts, they face serious limitations at the metropolitan scale. The high cost of field studies, the time-consuming nature of data collection, dependence on specialized human resources, and the lack of rapid repeatability are among the main challenges associated with these approaches. Moreover, the outcomes of such studies are often static and cross-sectional, limiting their capacity to capture temporal and spatial changes in a timely manner.
In recent years, citizen-generated data have attracted increasing attention as a novel source of urban information. Citizen participation in reporting and documenting environmental conditions enables the production of large volumes of data with extensive spatial and temporal coverage. In addition to reducing monitoring costs, this approach enhances social participation, increases public awareness, and strengthens civic responsibility toward the living environment. Nevertheless, participatory data are inherently heterogeneous and uncertain. Variations in users’ levels of knowledge, differences in perceptual interpretations of concepts, poor image quality, and spatial inaccuracies are among the factors that may challenge the reliability of such data.
At the same time, recent advances in deep learning and urban image analysis have created new opportunities for automated and scalable monitoring of visual phenomena. Convolutional neural networks demonstrate a strong capacity to extract visual features and classify images, enabling them to identify complex patterns that are difficult or time-consuming for humans to detect. However, these models are also highly dependent on accurate and reliable training data, and the use of noisy or biased data may lead to errors or systematic distortions. Consequently, the central issue addressed by this research lies at the intersection of these two domains: how to leverage the potential of citizen-generated data and the analytical power of deep learning within a complementary, self-correcting framework simultaneously.
The primary objective of this study is to design and test a human–machine feedback mechanism to validate citizen-generated data on visual pollution on urban walls. Within this framework, humans and machines are not treated as independent sources of information, but rather as components of a learning system that mutually enhance one another’s performance. By focusing on urban walls as one of the most prominent elements of the urban landscape, the study enables a more precise definition of indicators and reduces conceptual ambiguity surrounding visual pollution.
From a conceptual perspective, visual pollution in this research is defined through a multidimensional framework encompassing physical, perceptual, and behavioral dimensions. The physical dimension includes manifestations such as illegal advertisements, deteriorated posters, dilapidated and abandoned walls. The perceptual dimension addresses citizens’ interpretations of order, aesthetics, and environmental quality, while the behavioral dimension examines how these conditions influence social actions and reactions. This framework allows for a clearer distinction between visually disruptive walls and those possessing artistic or cultural value.
 
Methodology
The research adopts a mixed approach, integrating field data, citizen participation, and deep learning algorithms. Tehran was selected as the study area due to its large scale, high degree of morphological diversity, and significant levels of visual pollution, providing an appropriate context for testing the proposed framework. The selection of this city enabled the examination of a wide range of urban wall conditions and enhanced the generalizability of the findings. The data collection process followed two complementary paths. In the first path, researchers conducted systematic field surveys to capture images of urban walls. These images were collected to establish a controlled reference dataset and played a crucial role in the initial training of deep learning models. Attention was given to controlling image quality, viewing angles, and spatial accuracy to minimize data errors. In the second path, a web-based platform was designed and implemented to facilitate citizen participation. This platform allowed citizens to upload images of urban walls along with their geographic locations, enabling voluntary reporting of observed conditions. This stage significantly increased the volume of data and improved the spatial coverage of the study, while also promoting active citizen engagement in the monitoring process. Following data collection, a data cleaning process was conducted to enhance dataset quality. Duplicate, low-quality, irrelevant, or spatially inaccurate images were removed. This initial filtering was essential to reduce noise and prevent the negative influence of unreliable data on model training. Subsequently, the remaining images were labeled according to predefined visual pollution categories. The categories used in the study included walls with illegal advertisements, wall writings and unauthorized graffiti, deteriorated and abandoned walls, and clean or artistically painted walls as a control class. Label quality was ensured through random reviews and corrections of ambiguous cases, significantly reducing human labeling errors and improving the reliability of the training data. After preparing the dataset, several convolutional neural network architectures were employed for image classification. Images were preprocessed and augmented before being divided into training, validation, and test sets. Models were trained with consistent settings to enable fair performance comparisons.  A range of evaluation metrics was applied to enable a detailed assessment of each model’s strengths and weaknesses.
 
Results and discussion
The results of model training and evaluation indicated that deeper, better-optimized models achieved superior performance in identifying different manifestations of visual pollution. These models were particularly effective in distinguishing subtle differences between walls with artistic value and those that constituted visual disturbance. In contrast, simpler models exhibited higher error rates in certain categories, highlighting the importance of selecting appropriate architectures for analyzing participatory data. One of the most significant components of the study is the validation of citizen-generated data through the human–machine feedback mechanism. In this process, the label assigned by the citizen is compared with the output generated by the deep learning model. When the two labels are consistent, the data point is considered reliable. In cases of inconsistency, the data are flagged as requiring review and can be reintroduced into a correction cycle. This process enables the gradual elimination of unreliable data and improves the overall quality of the dataset. Simultaneously, the deep learning model benefits from receiving corrected data, thereby enhancing its performance over time. As a result, a dynamic feedback loop is established that concurrently improves data quality and model accuracy. Analysis of the results reveals that the highest level of agreement between citizen labels and model predictions occurs in clearer categories, such as clean walls or walls with illegal advertisements. Conversely, the greatest discrepancies are observed in distinguishing unauthorized graffiti from artistic wall paintings. This finding reflects the perceptual complexity of these categories and underscores the influence of cultural and social context on citizens’ interpretations. Overall, the findings demonstrate that citizen-generated data, when supported by deep learning models and appropriate validation mechanisms, can serve as a reliable source for monitoring urban visual pollution. The proposed framework enables the production of timely information, identification of visual pollution hotspots, and prioritization of urban management interventions.
 
 
Conclusion
In conclusion, the study demonstrates that integrating citizen-generated data with deep learning offers a novel, cost-effective, and scalable approach to monitoring visual pollution. Despite limitations such as the focus on a single city and specific visual categories, the proposed framework shows strong potential for extension and application in future research. It can serve as a strategic tool for urban landscape management and the enhancement of environmental quality in cities.
 
Funding
There is no funding support.
 
Authors’ Contribution
Authors contributed equally to the conceptualization and writing of the article. All of the authors approved thecontent of the manuscript and agreed on all aspects of the work declaration of competing interest none.
 
Conflict of Interest
Authors declared no conflict of interest.
 
Acknowledgments
We are grateful to all the scientific consultants of this paper.

کلیدواژه‌ها [English]

  • Visual Pollution Assessment
  • Volunteered Geographic Information (VGI)
  • Deep Learning
  • Convolutional Neural Networks
  • Data Validation
  1. جلوخانی‌نیارکی، محمدرضا؛ فلسفی، پیمان و خداوردیان، مجیدرضا. (۱۳۹۹). طراحی سامانه وب GIS مشارکتی مدیریت بحران و پایش تهدیدات کشاورزی و منابع طبیعی. در هشتمین کنگره ملی علوم ترویج و آموزش کشاورزی، منابع طبیعی و محیط‌زیست پایدار، تهران، ایران.
  2. نادری گرزالدینی، مرجانه و اردیبهشتی، اطلس. (۱۳۹۸). نقش تبلیغات محیطی در آلودگی‌های بصری فضاهای شهری؛ مطالعه موردی: تبلیغات محیطی شهر بابل. مطالعات طراحی شهری و پژوهش‌های شهری، ۲(۶)، ۷۹–۹۲.
  3. ظریف‌پور لنگرودی، آناهیتا؛ البرزی، فریبا و سهیلی، جمال‌الدین. (۱۴۰۱). بررسی نماهای خیابان‌های شهری از منظر ادراک شهروندان؛ نمونه موردی: پیاده‌راه ۱۵ خرداد، تهران. تحقیقات کاربردی علوم جغرافیایی، ۲۲(۶۶)، ۵۹–۷۶.
  4. AlElaiwi, M., Al-antari, M. A., Ahmad, H. F., Azhar, A., Almarri, B., & Hussain, J. (2022). VPP: Visual pollution prediction framework based on a deep active learning approach using public road images. Mathematics, 11(1), 186. https://doi.org/10.3390/math11010186
  5. Antoniou, V., & Skopeliti, A. (2015). Measures and indicators of VGI quality: An overview. ISPRS International Journal of Geo-Information, 6(7), 217. https://doi.org/10.3390/ijgi6070217
  6. Borowiak, J., Zielinska, M., & Kowalska, A. (2024). Urban visual pollution: Comparison of two ways of evaluation – A case study from Europe. Scientific Reports, 14(1), 56403. https://doi.org/10.1038/s41598-024-56403-9
  7. Chmielewski, S. (2020). Chaos in motion: Measuring visual pollution with tangential view landscape metrics. Land, 9(12), 515. https://doi.org/10.3390/land9120515
  8. Chmielewski, S., Samulowska, M., Lupa, M., Lee, D., & Zagajewski, B. (2018). Citizen science and WebGIS for outdoor advertisement visual pollution assessment. Computers, Environment and Urban Systems, 67, 97-109. https://doi.org/10.1016/j.compenvurbsys.2017.09.004
  9. Eshrati, P., & Rahmati, M. M. (2022). Developing a conceptual framework for evaluation of elimination of visual pollution plans: Case of study – Enghelab Street, Tehran. Journal of Research in Islamic Architecture, 10(4), 18–30. https://doi.org/10.52547/jria.10.4.4
  10. Fatehian, E., & Jelokhani-Niaraki, M. (2018). A volunteered geographic information system for managing environmental pollution of coastal zones: A case study in Nowshahr, Iran. Ocean & Coastal Management, 163, 54–65. https://doi.org/10.1016/j.ocecoaman.2018.06.004
  11. Foody, G., Long, G., Schultz, M., & Olteanu-Raimond, A. M. (2024). Assuring the quality of VGI on land use and land cover: Experiences and learnings from the LandSense project. Geo-spatial Information Science, 27(1), 16–37
  12. Gao, H., Bakar, S. A., Maulan, S., Yusof, M. J. M., Mundher, R., & Guo, C. (2024). A systematic literature review and analysis of visual pollution. Land, 13(7), 994. https://doi.org/10.3390/land13070994
  13. Gao, X., Li, Y., Wang, Q., & Zhang, H. (2024). A systematic literature review and analysis of visual pollution. Land, 13(7), 994. https://doi.org/10.3390/land13070994
  14. Ghorbanzadeh, O., Jafari, M., & Omid, M. (2021). Real-time VGI quality assessment using IoT observations. ISPRS International Journal of Geo-Information, 10(3), 151. https://doi.org/10.3390/ijgi10030151
  15. Goodchild, M. F. (2007). Citizens as sensors: The world of volunteered geography. GeoJournal, 69(4), 211–221
  16. Goodchild, M. F., & Li, L. (2012). Assuring the quality of volunteered geographic information. Spatial Statistics, 1, 110–120. https://doi.org/10.1016/j.spasta.2012.03.002
  17. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
  18. Haklay, M. (2013). Citizen science and volunteered geographic information: Overview and typology of participation.
  19. Haklay, M., Basiouka, S., Antoniou, V., & Ather, A. (2010). How many volunteers does it take to map an area well? The validity of Linus’s Law to volunteered geographic information. The Cartographic Journal, 47(4), 315–322. https://doi.org/10.1179/000870410X12911304958827
  20. Jaśkiewicz, M. (2015). Place attachment, place identity and aesthetic appraisal of urban landscape. Miscellanea Geographica – Regional Studies on Development, 19(4), 37–45. https://doi.org/10.1515/mgrsd-2015-0017
  21. Jelokhani-Niaraki, M., Hajiloo, F., & Samany, N. N. (2019). A web-based public participation GIS for assessing the age-friendliness of cities: A case study in Tehran, Iran. Cities, 95, 102471. https://doi.org/10.1016/j.cities.2019.102471
  22. Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980
  23. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (pp. 1097–1105)
  24. LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278–2324
  25. Nair, V., & Hinton, G. E. (2010). Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (pp. 807–814)
  26. Plattenberg, R. H. (2007). Environmental pollution: New research. Nova Publishers.
  27. Portella, A. (2016). Visual pollution: Advertising, signage and environmental quality. Routledge.
  28. Sadeghi-Niaraki, A., Jelokhani-Niaraki, M., & Choi, S.-M. (2020). A volunteered geographic information-based environmental decision support system for waste management and decision making. Sustainability, 12(15), 6012. https://doi.org/10.3390/su12156012
  29. See, L., Mooney, P., Foody, G., Bastin, L., Comber, A., Estima, J., … & Rutzinger, M. (2016). Crowdsourcing, citizen science or volunteered geographic information? The current state of crowdsourced geographic information. ISPRS International Journal of Geo-Information, 5(5), 55. https://doi.org/10.3390/ijgi5050055
  30. Senaratne, H., Ali, A. L., Mobasheri, A., Capineri, C., & Haklay, M. (2021). Anomaly detection for volunteered geographic information. International Journal of Geographical Information Science, 35(7), 1374–1398. https://doi.org/10.1080/13658816.2021.1981333
  31. Senaratne, H., Mobasheri, A., Ali, A. L., Capineri, C., & Haklay, M. (2016). A review of volunteered geographic information quality assessment methods. International Journal of Geographical Information Science, 31(1), 139–167. https://doi.org/10.1080/13658816.2016.1189556
  32. Sokolova, M., & Lapalme, G. (2009). A systematic analysis of performance measures for classification tasks. Information Processing & Management, 45(4), 427–437. https://doi.org/10.1016/j.ipm.2009.03.002
  33. Szczepańska, M., Wilkaniec, A., & Škamlová, L. (2019). Visual pollution in natural and landscape protected areas: Case studies from Poland and Slovakia. Quaestiones Geographicae, 38(4), 133–149. https://doi.org/10.2478/quageo-2019-0043
  34. Titu, M. F. S., Chowdhury, A. A., Haque, S. R., & Khan, R. (2024). Deep-Learning-Based Real-Time Visual Pollution Detection in Urban and Textile Environments. Sci, 6(1), 5. https://doi.org/10.3390/sci6010005
  35. Wakil, K., Naeem, M. A., Anjum, G. A., Waheed, A., Thaheem, M. J., & Hussnain, M. Q. (2019). The assessment and mapping of urban visual pollution through an assembly of open-source geospatial tools. In Proceedings of REAL CORP (pp. 723–730). https://doi.org/10.13140/RG.2.2.28191.61603
  36. Wakil, K., Naeem, M. A., Anjum, G. A., Waheed, A., Thaheem, M. J., & Hussnain, M. Q. (2021). Mitigating urban visual pollution through a multistakeholder spatial decision support system to optimize locational potential of billboards. ISPRS International Journal of Geo-Information, 10(2), 60. https://doi.org/10.3390/ijgi10020060
  37. Zook, M., Graham, M., Shelton, T., & Gorman, S. (2010). Volunteered geographic information and crowdsourcing disaster relief: A case study of the Haitian earthquake. World Medical & Health Policy, 2(2), 7–33