Mapeando la pobreza y el bienestar a través del procesamiento de lenguaje natural
No. 93 (2025-07-25)Autor/a(es/as)
-
Guberney Muñetón-SantaUniversidad de Antioquia, ColombiaIdentificador ORCID: https://orcid.org/0000-0002-5194-1914
-
Carlos Andrés Pérez AguirreGrupo de Epidemiología Facultad Nacional de Salud Pública - UdeAIdentificador ORCID: https://orcid.org/0000-0003-4937-1163
-
Juan Rafael Orozco-ArroyaveUniversidad de Antioquia, ColombiaIdentificador ORCID: https://orcid.org/0000-0002-8507-0782
Resumen
Los índices de pobreza y bienestar abarcan una serie de dimensiones que reflejan aspectos valiosos y susceptibles de cuantificación de las vidas de las personas. Tales dimensiones evidencian sus preocupaciones y prioridades, de manera que ofrecen un acercamiento a sus experiencias. Este estudio identifica las dimensiones de la pobreza y del bienestar directamente en el lenguaje cotidiano y propone un método novedoso para asignarle valor a cada una de acuerdo con lo que las personas consideran importante. Mediante técnicas de modelado de temas, en el marco del procesamiento del lenguaje natural, detectamos las principales temáticas que la gente, en sus propias palabras, asocia con la pobreza y el bienestar. Igualmente, aplicamos la técnica de aprendizaje por transferencia a partir de un modelo de clasificación de disparo cero, con el fin de asignar valor a las dimensiones de estudio y ordenarlas según su relevancia para el grupo poblacional que fue examinado. En nuestros casos de estudio encontramos que las dimensiones relacionadas con la pobreza que más sobresalían eran la falta de oportunidades, el desempleo, la desmotivación, la falta de dinero y los intentos por salir adelante. Por su parte, entre las dimensiones relacionadas con el bienestar hallamos la calidad de vida, la satisfacción de las necesidades básicas, la alimentación y la salud. Este enfoque permite precisar cuáles deben ser las áreas prioritarias para la intervención y la asignación de recursos. Finalmente, recomendamos aplicar técnicas de modelado de temas al momento de diseñar indicadores multidimensionales, pues esto posibilita que los investigadores y los encargados de formular las políticas construyan los indicadores sociales a la medida de las necesidades de las personas a las que buscan servir.
Referencias
Alkire, Sabina. 2002. Valuing Freedoms: Sen’s Capability Approach and Poverty Reduction. Oxford: Oxford University Press.
Alkire, Sabina. 2007. “The Missing Dimensions of Poverty Data: Introduction to the Special Issue.” Oxford Development Studies 35 (4): 347-359. https://doi.org/10.1080/13600810701701863
Alkire, Sabina. 2008. “Choosing Dimensions: The Capability Approach and Multidimensional Poverty.” In The Many Dimensions of Poverty, edited by Nanak Kakwani and Jacques Silber, 89-119. New York: Palgrave Macmillan.
Alkire, Sabina, and Maria Emma Santos. 2014. “Measuring Acute Poverty in the Developing World: Robustness and Scope of the Multidimensional Poverty Index.” World Development 59: 251-274. https://doi.org/10.1016/j.worlddev.2014.01.026
Alkire, Sabina, José Manuel Roche, Paola Ballon, James Foster, Maria Emma Santos, and Suman Seth. 2015. Multidimensional Poverty Measurement and Analysis. Oxford: Oxford University Press.
Angulo, Roberto, Yadira Díaz, and Raúl Andrés Pardo. 2016. “The Colombian Multidimensional Poverty Index: Measuring Poverty in a Public Policy Context.” Social Indicators Research 127 (1): 1-38. https://doi.org/10.1007/s11205-015-0964-z
Arun, R., V. Suresh, C. E. Veni Madhavan, and M. N. Narasimha Murthy. 2010. “On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations.” In Advances in Knowledge Discovery and Data Mining, edited by Mohammed J. Zaki, Jeffrey Xu Yu, B. Ravindran, and Vikram Pudi, 391-402. Berlin: Springer. https://doi.org/10.1007/978-3-642-13657-3_43
Bender, Emily M., Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. 2021. “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?” In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 610-623. March 3-10, Virtual Event, Canada. https://doi.org/10.1145/3442188.3445922
Biggeri, Mario, Renato Libanora, Stefano Mariani, and Leonardo Menchini. 2006. “Children Conceptualizing Their Capabilities: Results of a Survey Conducted during the First Children’s World Congress on Child Labour.” Journal of Human Development 7 (1): 59-83. https://doi.org/10.1080/14649880500501179
Blei, David M., Andrew Y. Ng, and Michael I. Jordan. 2003. “Latent Dirichlet Allocation.” Journal of Machine Learning Research 3: 993-1022. https://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf
Blei, David M., and John D. Lafferty. 2007. “A Correlated Topic Model of Science.” The Annals of Applied Statistics 1 (1): 17-35. https://doi.org/10.1214/07-AOAS114
Burchi, Francesco, Pasquale De Muro, and Eszter Kollar. 2014. “Which Dimensions Should Matter for Capabilities? A Constitutional Approach.” Ethics and Social Welfare 8 (3): 233-247. https://doi.org/10.1080/17496535.2014.932415
Burchi, Francesco, Pasquale De Muro, and Eszter Kollar. 2018. “Constructing Well-Being and Poverty Dimensions on Political Grounds.” Social Indicators Research 137 (2): 441-462. https://doi.org/10.1007/s11205-017-1618-0
Burchi, Francesco, José Espinoza-Delgado, Claudio E. Montenegro, and Nicole Rippin. 2021. “An Individual-Based Index of Multidimensional Poverty for Low- and Middle-Income Countries.” Journal of Human Development and Capabilities 22 (4): 682-705. https://doi.org/10.1080/19452829.2021.1964450
Cao, Juan, Tian Xia, Jintao Li, Yongdong Zhang, and Sheng Tang. 2009. “A Density-Based Method for Adaptive LDA Model Selection.” Neurocomputing 72 (7-9): 1775-1781. https://doi.org/10.1016/j.neucom.2008.06.011
Chiquito, Ana Beatriz, Elena Rojas Mayer, Gabriela Llull, Carolina Pinardi, and Lara Barbosa Quadros Côrtes, eds. 2019. La pobreza en la prensa: palabras claves en los diarios de Argentina, Brasil, Colombia y México. Buenos Aires: Clacso.
Clark, David A. 2000. “Concepts and Perceptions of Development: Some Evidence from the Western Cape.” Southern Africa Labour and Development Research Unit Working Paper No. 88.
Clark, David A. 2003. “Concepts and Perceptions of Human Well-Being: Some Evidence from South Africa.” Oxford Development Studies 31 (2): 173-196. https://doi.org/10.1080/13600810307428
Decancq, Koen, and María Ana Lugo. 2013. “Weights in Multidimensional Indices of Wellbeing: An Overview.” Econometric Reviews 32 (1): 7-34. https://doi.org/10.1080/07474938.2012.690641
Deveaud, Romain, Eric SanJuan, and Patrice Bellot. 2014. “Accurate and Effective Latent Concept Modeling for Ad Hoc Information Retrieval.” Document Numérique 17 (1): 61-84. https://doi.org/10.3166/dn.17.1.61-84
Devlin, Jacob, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.” In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, 4171-4186. June 2-7, Minneapolis, United States. https://aclanthology.org/N19-1423/
Dumais, Susan T. 2004. “Latent Semantic Analysis.” Annual Review of Information Science and Technology 38 (1): 188-230. https://doi.org/10.1002/aris.1440380105
Eisenstein, Jacob. 2019. Introduction to Natural Language Processing. Cambridge, MA: MIT Press.
Frediani, Alexandre Apsan. 2019. “Participatory Research Methods and the Capability Approach: Researching the Housing Dimensions of Squatter Upgrading Initiatives in Salvador da Bahia, Brazil.” In The Capability Approach, Empowerment and Participation: Concepts, Methods and Applications, edited by David A. Clark, Mario Biggeri, and Alexandre Apsan Frediani, 261-288. London: Palgrave Macmillan. https://doi.org/10.1057/978-1-137-35230-9_10
Fukuda-Parr, Sakiko, and Ismael Cid-Martinez. 2019. “Capability Approach and Human Development.” In The Palgrave Handbook of Development Economics: Critical Reflections on Globalization and Development, edited by Machiko Nissanke and José Antonio Ocampo, 441-468. Cham: Springer. https://doi.org/10.1007/978-3-030-14000-7_13
Greco, Giulia. 2018. “Setting the Weights: The Women’s Capabilities Index for Malawi.” Social Indicators Research 135 (2): 457-478. https://doi.org/10.1007/s11205-016-1502-3
Grootendorst, Maarten. 2020. “BERTopic: Leveraging BERT and c-TF-IDF to Create Easily Interpretable Topics.” Version v0.7.0. Zenodo. https://doi.org/10.5281/zenodo.4381785
Halliday, M. A. K. 2001. El lenguaje como semiótica social: la interpretación social del lenguaje y del significado. Ciudad de México: FCE.
Lafferty, John D., and David M. Blei. 2006. “Correlated Topic Models.” In Advances in Neural Information Processing Systems 18, edited by Yair Weiss, Bernhard Schölkopf and John Platt. Cambridge, MA: MIT Press.
Madsen, Andreas, Siva Reddy, and Sarath Chandar. 2021. “Post-hoc Interpretability for Neural NLP: A Survey.” arXiv preprint arXiv:2108.04840. https://doi.org/10.48550/arXiv.2108.04840
McElreath, Richard. 2020. Statistical Rethinking: A Bayesian Course with Examples in R and STAN. New York: Chapman and Hall/CRC. https://doi.org/10.1201/9780429029608
Monroe, Burt L., Michael P. Colaresi, and Kevin M. Quinn. 2017. “Fightin’ Words: Lexical Feature Selection and Evaluation for Identifying the Content of Political Conflict.” Political Analysis 16 (4): 372-403. https://doi.org/10.1093/pan/mpn018
Muñetón-Santa, Gloria, Diana Escobar-Grisales, Felipe Orlando López-Pabón, Paula Andrea Pérez-Toro, and Juan Rafael Orozco-Arroyave. 2022. “Classification of Poverty Condition Using Natural Language Processing.” Social Indicators Research 162: 1413-1435. https://doi.org/10.1007/s11205-022-02883-z
Narayan, Deepa, Raj Patel, Kai Schafft, Anne Rademacher, and Sarah Koch-Schulte. 2000. Voices of the Poor: Can Anyone Hear Us? Oxford: Oxford University Press.
Murzin, Nikita. 2020. ldatuning: Tuning of the Latent Dirichlet Allocation Models Parameters. R package. https://github.com/nikita-moor/ldatuning
Nussbaum, Martha C. 2000. Women and Human Development: The Capabilities Approach. Cambridge: Cambridge University Press.
Nussbaum, Martha C. 2011. Creating Capabilities: The Human Development Approach. Cambridge, MA: Belknap Press.
Nussey, Charlotte, Alexandre Apsan Frediani, Rosiana Lagi, Janaína Mazutti, and Jackline Nyerere. 2022. “Building University Capabilities to Respond to Climate Change through Participatory Action Research: Towards a Comparative Analytical Framework.” Journal of Human Development and Capabilities 23 (1): 95-115. https://doi.org/10.1080/19452829.2021.2014427
OECD (Organisation for Economic Co-operation and Development). 2021. How’s Life in Latin America? Measuring Well-being for Policy Making. Paris: OECD Publishing. https://doi.org/10.1787/2965f4fe-en
Prabhakaran, Vinodkumar, Ben Hutchinson, and Margaret Mitchell. 2019. “Perturbation Sensitivity Analysis to Detect Unintended Model Biases.” In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 5740-5745. November, Hong Kong, China. https://doi.org/10.18653/v1/D19-1578
Reimers, Nils, and Iryna Gurevych. 2019. “Sentence-BERT: Sentence Embeddings Using Siamese BERT-Networks.” Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 3982-3992. November, Hong Kong, China. https://doi.org/10.18653/v1/D19-1410
Robeyns, Ingrid. 2017. Wellbeing, Freedom and Social Justice: The Capability Approach Re-examined. Cambridge: Open Book Publishers.
Robeyns, Ingrid. 2020. “Wellbeing, Place and Technology.” Wellbeing, Space and Society 1: 100013. https://doi.org/10.1016/j.wss.2020.100013
Salton, Gerard, and Christopher Buckley. 1988. “Term-Weighting Approaches in Automatic Text Retrieval.” Information Processing & Management 24 (5): 513-523. https://doi.org/10.1016/0306-4573(88)90021-0
Salvatore, Camilla, Silvia Biffignandi, and Annamaria Bianchi. 2021. “Social Media and Twitter Data Quality for New Social Indicators.” Social Indicators Research 156: 601-630. https://doi.org/10.1007/s11205-020-02296-w
Schofield, Alexandra, Michael Magnusson, Lauren Thompson, and David Mimno. 2017. “Understanding Text Pre-Processing for Latent Dirichlet Allocation.” In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol. 2. Association of Computational Linguistics. https://www.cs.cornell.edu/~xanda/winlp2017.pdf
Sen, Amartya. 1985. Commodities and Capabilities. Oxford: Oxford University Press.
Sen, Amartya. 1992. Inequality Reexamined. Cambridge, MA: Harvard University Press.
Sen, Amartya. 1999. Development as Freedom. Boston: Anchor Books.
Sen, Amartya. 2004. “Capabilities, Lists, and Public Reason: Continuing the Conversation.” Feminist Economics 10 (3): 77-80. https://doi.org/10.1080/1354570042000315163
Sen, Amartya. 2009. The Idea of Justice. Cambridge, MA: Harvard University Press.
Sen, Amartya. 2017. Collective Choice and Social Welfare. Cambridge, MA: Harvard University Press.
Shaffer, Paul. 2002. “Participatory Analyses of Poverty Dynamics: Reflections on the Myanmar PPA.” In Knowing Poverty: Critical Reflections on Participatory Research and Policy, edited by Karen Brock and Rosemary McGee, 44-68. London: Routledge.
Sievert, Carson, and Kenneth Shirley. 2014. “LDAvis: A Method for Visualizing and Interpreting Topics.” In Proceedings of the Workshop on Interactive Language Learning, Visualization, and Interfaces, 63-70. June, Baltimore, United States. https://aclanthology.org/W14-3110
Silge, Julia, and David Robinson. 2017. Text Mining with R: A Tidy Approach. O’Reilly Media.
Townsend, Peter. 1979. Poverty in the United Kingdom: A Survey of Household Resources and Standards of Living. Los Angeles: University of California Press.
United Nations. 2015. Transforming Our World: The 2030 Agenda for Sustainable Development. New York: United Nations. https://unctad.org/system/files/official-document/ares70d1_en.pdf
Vayansky, Ike, and Sathish A. P. Kumar. 2020. “A Review of Topic Modeling Methods.” Information Systems 94: 101582. https://doi.org/10.1016/j.is.2020.101582
Wisor, Scott, Sharon Bessell, Fatima Castillo, Joanne Crawford, Kieran Donaghue, Janet Hunt, Alison M. Jaggar, Amy Liu, and Thomas Pogge. 2016. The Individual Deprivation Measure: A Gender-Sensitive Approach to Poverty Measurement. Cham: Palgrave Macmillan.
World Bank. 2018. Poverty and Shared Prosperity 2018: Piecing Together the Poverty Puzzle. Washington D.C.: World Bank. https://www.worldbank.org/en/publication/poverty-and-shared-prosperity-2018
Yin, Wenpeng, Jiajun Hay, and Dan Roth. 2019. “Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach.” arXiv preprint arXiv:1909.00161. https://doi.org/10.48550/arXiv.1909.00161
Licencia
Derechos de autor 2025 Guberney Muñetón-Santa, Carlos Andrés Pérez Aguirre, Juan Rafael Orozco-Arroyave

Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial-SinDerivadas 4.0.