Fusión temprana de descriptores extraídos de mapas de prominencia multi-nivel para clasificar imágenes

Eduardo Fidalgo Fernández; Enrique Alegre Gutiérrez; Laura Fernández Robles; Víctor González Castro

Ayuda

Fusión temprana de descriptores extraídos de mapas de prominencia multi-nivel para clasificar imágenes

Fidalgo, E. ^[1] ; Alegre, E. ^[1] ; Fernández-Robles, L. ^[1] ; González-Castro, V. ^[1]
1. [1] Universidad de León
  
  Universidad de León
  
  León, España
Localización: Revista iberoamericana de automática e informática industrial ( RIAI ), ISSN-e 1697-7920, Vol. 16, Nº. 3, 2019, págs. 358-368
Idioma: español
DOI: 10.4995/riai.2019.10640
Títulos paralelos:
- Early Fusion of Multi-level Saliency Descriptors for Image Classification
Enlaces
- Texto completo
Resumen
- español
  En este artículo proponemos un método que permite mejorar la clasificación de imágenes en conjuntos de datos en los que la imagen contiene un único objeto. Para ello, consideramos los mapas de prominencia como si se trataran de mapas topográficos y filtramos las características del fondo de la imagen mejorando de esta forma la codificación que realiza sobre la imagen completa un modelo clásico basado en Bag of Visual Words (BoVW). En primer lugar, evaluamos seis conocidos algoritmos para la generación de mapas de prominencia y seleccionamos los métodos de GBVS y SIM al determinar que son los que retienen la mayor parte de la información del objeto. Utilizando la información de dichos mapas de prominencia eliminamos los descriptores SIFT extraídos de forma densa pertenecientes al fondo mediante el filtrado de características en base a imágenes binarias obtenidas a diversos niveles del mapa de prominencia. Realizamos el filtrado de descriptores obteniendo capas a diversos niveles del mapa de prominencia, y evaluamos la fusión temprana de los descriptores SIFT contenidos en dichas capas en cinco conjuntos de datos diferentes. Los resultados obtenidos en nuestra experimentación indican que el método propuesto mejora siempre al método de referencia cuando se combinan las dos primeras capas de GBVS o de SIM y el dataset contiene imágenes con un único objeto.
- English
  In this paper, we propose a method that improves the classification of images. Considering saliency maps as if they were topographic maps and filtering the characteristics of the image’s background, the Bag of VisualWords (BoVW) coding is improved. First, we evaluated six known algorithms to generate saliency maps and we selected GBVS and SIM because they are the ones that retain most of the information of the object. Next, we eliminated the extracted SIFT descriptors belonging to the background by filtering features based on binary images obtained at various levels of the selected saliency maps. We filtered the descriptors by obtaining layers at various levels of the saliency maps, and we evaluated the early fusion of the SIFT descriptors contained in these layers into five dierent datasets. The results obtained indicate that the proposed method always improves the reference method when combining the first two layers of GBVS or SIM and the dataset contains images with a single object.
Referencias bibliográficas
- Al-khafaji, S. L., Zhou, J., Zia, A., Liew, A. W. C., Feb 2018. Spectral-spatial scale invariant feature transform for hyperspectral images....
- Al-Nabki, W., Fidalgo, E., Alegre, E., De Paz, I., 2017. Classifying Illegal Activities on Tor Network Based on Web Textual Contents. 15th...
- Beucher, S., Lantuejoul, C., 1979. Use of Watersheds in Contour Detection.
- Biagio, M. S., Bazzani, L., Cristani, M., Murino, V., oct 2014. Weighted bag of visual words for object recognition. In: 2014 IEEE International...
- Biswas, R., Fidalgo, E., Alegre, E., 2017. Recognition of Service Domains on TOR Dark Net using Perceptual Hashing and Image Classification...
- Borji, A., Itti, L., jan 2013. State-of-the-art in visual attention modeling. IEEE Transactions on Pattern Analysis and Machine Intelligence...
- Cervantes, J., Taltempa, J., Garcíaa-Lamont, F., Castilla, J. S. R., Rendon, A. Y., Jalili, L. D., 2017. Análisis comparativo de las técnicas...
- Chatzichristofis, S. A., Iakovidou, C., Boutalis, Y., Marques, O., feb 2013. Co.Vi.Wo.: Color visual words based on non-predefined size codebooks....
- Chaves, D., Saikia, S., Fernández-Robles, L., Alegre, E., Trujillo, M., 2018. A Systematic Review on Object Localisation Methods in Images....
- Chen, J., Feng, B., Xu, B., 2014a. Spatial similarity measure of visual phrases for image retrieval. In: Lecture Notes in Computer Science...
- Chen, Y., Li, X., Dick, A., Hill, R., 2014b. Ranking consistency for image matching and object retrieval. Pattern Recognition 47 (3), 1349...
- Csurka, G., Csurka, G., Dance, C. R., Fan, L., Willamowski, J., Bray, C., 2004. Visual categorization with bags of keypoints. IN WORKSHOP...
- Digabel, H., Lantuéjoul, C., 1978. Iterative algorithms. Actes du Second Symposium Europ'een d'Analyse Quantitative des Microstructures...
- Fang, Y., Lei, J., Li, J., Xu, L., Lin, W., Callet, P. L., 2017. Learning visual saliency from human fixations for stereoscopic images. Neurocomputing...
- Fidalgo, E., Alegre, E., González-Castro, V., Fernández-Robles, L., 2016. Compass radius estimation for improved image classification using...
- Fidalgo, E., Alegre, E., González-Castro, V., Fernández-Robles, L., 2017. Illegal activity categorisation in DarkNet based on image classification...
- Fidalgo, E., Alegre, E., González-Castro, V., Fernández-Robles, L., 2018. Boosting image classification through semantic attention filtering...
- Field, D. J., dec 1987. Relations between the statistics of natural images and the response properties of cortical cells. Journal of the Optical...
- Gangwar, A., Fidalgo, E., Alegre, E., González-Castro, V., 2017. Pornography and child sexual abuse detection in image and video: A comparative...
- García-Olalla, O., Alegre, E., Fernández-Robles, L., Fidalgo, E., Saikia, S., apr 2018. Textile retrieval based on image content from CDC...
- Gonzalez, R., Woods, R., 2002. Digital image processing. Prentice Hall. https://doi.org/10.1016/0734-189X(90)90171-Q
- González-Castro, V., Valdés Hernández, M. d. C., Chappell, F. M., Armitage, P. A., Makin, S., Wardlaw, J. M., 2017. Reliability of an automatic...
- Greenspan, H., Belongie, S., Goodman, R., Perona, P., Rakshit, S., Anderson, C., 1994. Overcomplete steerable pyramid filters and rotation...
- Harel, J., Koch, C., Perona, P., 2007. Graph-Based Visual Saliency.
- He, Y., Deng, G., Wang, Y., Wei, L., Yang, J., Li, X., Zhang, Y., 2018. Optimization of sift algorithm for fast-image feature extraction in...
- Hou, X., Harel, J., Koch, C., 2012. Image signature: Highlighting sparse salient regions. IEEE Transactions on Pattern Analysis and Machine...
- Itti, L., Koch, C., Niebur, E., 1998. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis...
- Jian, M.,Wu, L., Jung, C., Fu, Q., Jia, T., 2018. Visual saliency estimation using constraints. Neurocomputing 290, 1 - 11. https://doi.org/10.1016/j.neucom.2018.02.004
- Kienzle, W., Wichmann, F., Sch¨olkopf, B., Franz, M., 2007. A Nonparametric Approach to Bottom-Up Visual Saliency. Advances in Neural Information...
- Lahouli, I., Karakasis, E., Haelterman, R., Chtourou, Z., Cubber, G. D., Gasteratos, A., Attia, R., 2018. Hot spot method for pedestrian detection...
- Lazebnik, S., Schmid, C., Ponce, J., 2005. A maximum entropy framework for part-based texture and object recognition. Proceedings of the IEEE...
- Lowe, D. G., 2004. Distinctive image features from scale invariant keypoints. Int'l Journal of Computer Vision 60, 91-11020042. https://doi.org/10.1023/B:VISI.0000029664.99615.94
- Mallat, S., mar 2009. Geometrical grouplets. Applied and Computational Harmonic Analysis 26 (2), 161-180. https://doi.org/10.1016/j.acha.2008.03.004
- Margolin, R., Zelnik-Manor, L., Tal, A., may 2013. Saliency for image manipulation. Visual Computer 29 (5), 381-392. https://doi.org/10.1007/s00371-012-0740-x
- Murray, N., Vanrell, M., Otazu, X., Parraga, C. A., nov 2013. Low-level spatiochromatic grouping for saliency estimation. IEEE Transactions...
- Otsu, N., 1979. A threshold selection method from Gray-level. IEEE Transactions on Systems, Man, and Cybernetics SMC-9 (1), 62-66. https://doi.org/10.1109/TSMC.1979.4310076
- Pinto, N., Doukhan, D., DiCarlo, J. J., Cox, D. D., nov 2009. A high-throughput screening approach to discovering good forms of biologically...
- Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A. C., Fei-Fei,...
- Saikia, S., Fidalgo, E., Alegre, E., Fernández-Robles, L., sep 2017. Object Detection for Crime Scene Evidence Analysis Using Deep Learning....
- Saikia, S., Fidalgo, E., Alegre, E., Fernández-Robles, L., 2018. Query based object retrieval using neural codes. In: Advances in Intelligent...
- Sepúlveda, G. V., Torriti, M. T., Calero, M. F., 2017. Sistema de detección de señales de tráfico para la localización de intersecciones viales...
- Shen, X., Wu, Y., jun 2012. A unified approach to salient object detection via low rank matrix recovery. In: Proceedings of the IEEE Computer...
- Tilke, J., Ehinger, K., Durand, F., Torralba, A., sep 2009. Learning to predict where humans look. In: Proceedings of the IEEE International...
- Toet, A., Sadaka, N. G., Karam, jun 2009. Frequency-tuned salient region detection. Vision Research 45 (1), II - 169-II - 172. https://doi.org/10.1109/CVPR.2009.5206596
- Trzcinski, T., Christoudias, M., Lepetit, V., mar 2015. Learning image descriptors with boosting. IEEE Transactions on Pattern Analysis and...
- van de Weijer, J., Schmid, C., 2006. Coloring Local Feature Extraction. In: Computer Vision - ECCV 2006. Springer Berlin Heidelberg, Berlin,...
- Vapnik, V. N., 2000. The Nature of Statistical Learning Theory. Springer New York. https://doi.org/10.1007/978-1-4757-3264-1
- Vedaldi, A., Fulkerson, B., 2010. Vlfeat. Proceedings of the international conference on Multimedia - MM '10 3 (1), 1469. https://doi.org/10.1145/1873951.1874249
- Vikram, T. N., Tscherepanow, M., Wrede, B., sep 2012. A saliency map based on sampling an image into random rectangular regions of interest....
- Yan, Q., Xu, L., Shi, J., Jia, J., 2013. Hierarchical saliency detection. In: Proceedings of the IEEE Computer Society Conference on Computer...
- Zhang, L., Gu, Z., Li, H., sep 2013. SDSP: A novel saliency detection method by combining simple priors. In: 2013 IEEE International Conference...
- Zhao, Q., Koch, C., jun 2012. Learning visual saliency by combining featuremaps in a nonlinear manner using AdaBoost. Journal of Vision 12...
- Zheng, L., Wang, S., Liu, Z., Tian, Q., jun 2013. Lp-Norm IDF for Large Scale Image Search. Computer Vision and Pattern Recognition (CVPR),...
- Zheng, L., Wang, S., Zhou, W., Tian, Q., jun 2014. Bayes merging of multiple vocabularies for scalable image retrieval. In: Proceedings of...