Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques

Autores
Oppenheim, Abi; Albanese, Federico; Feuerstein, Esteban
Año de publicación
2023
Idioma
inglés
Tipo de recurso
documento de conferencia
Estado
versión publicada
Descripción
The homophily phenomenon in social networks causes users to interact primarily with others who share their interests and cultural backgrounds, leading to the formation of "echo chambers" [1–3]. The notion of cultural diversity among users and communities becomes relevant in this context. While previous studies have investigated diversity in interaction graphs, to the best of our knowledge, none have explored the degree of diversity based on community embedding, which has been proven effective in measuring the positioning of communities in various social dimensions [4–7]. Building on the work of [7], we propose characterizing and measuring diversity through an innovative algorithm based on community embedding. We propose a novel algorithm based on community embedding to characterize and measure diversity. Our approach builds upon prior work on diversity in social media and involves iteratively updating values for the diversity of communities and individual users. To demonstrate the effectiveness of our algorithm, we conduct a case study analyzing over over 800 million posts in 9 million discussion subreddits of different ethnic groups on Reddit. Next, we generated embeddings for each community using community2vec [8] and developed algorithms to quantify cultural diversity based on these embeddings.
Sociedad Argentina de Informática e Investigación Operativa
Materia
Ciencias Informáticas
Machine learning
Social Media
Reddit
Community Embedding
Diversity
Nivel de accesibilidad
acceso abierto
Condiciones de uso
http://creativecommons.org/licenses/by-nc-sa/4.0/
Repositorio
SEDICI (UNLP)
Institución
Universidad Nacional de La Plata
OAI Identificador
oai:sedici.unlp.edu.ar:10915/165745

id SEDICI_085bb85adc25f9edccc4a653e42d2005
oai_identifier_str oai:sedici.unlp.edu.ar:10915/165745
network_acronym_str SEDICI
repository_id_str 1329
network_name_str SEDICI (UNLP)
spelling Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniquesOppenheim, AbiAlbanese, FedericoFeuerstein, EstebanCiencias InformáticasMachine learningSocial MediaRedditCommunity EmbeddingDiversityThe homophily phenomenon in social networks causes users to interact primarily with others who share their interests and cultural backgrounds, leading to the formation of "echo chambers" [1–3]. The notion of cultural diversity among users and communities becomes relevant in this context. While previous studies have investigated diversity in interaction graphs, to the best of our knowledge, none have explored the degree of diversity based on community embedding, which has been proven effective in measuring the positioning of communities in various social dimensions [4–7]. Building on the work of [7], we propose characterizing and measuring diversity through an innovative algorithm based on community embedding. We propose a novel algorithm based on community embedding to characterize and measure diversity. Our approach builds upon prior work on diversity in social media and involves iteratively updating values for the diversity of communities and individual users. To demonstrate the effectiveness of our algorithm, we conduct a case study analyzing over over 800 million posts in 9 million discussion subreddits of different ethnic groups on Reddit. Next, we generated embeddings for each community using community2vec [8] and developed algorithms to quantify cultural diversity based on these embeddings.Sociedad Argentina de Informática e Investigación Operativa2023-09info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionResumenhttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf66-67http://sedici.unlp.edu.ar/handle/10915/165745enginfo:eu-repo/semantics/altIdentifier/url/https://publicaciones.sadio.org.ar/index.php/JAIIO/article/view/490info:eu-repo/semantics/altIdentifier/issn/2451-7496info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2025-09-17T10:26:36Zoai:sedici.unlp.edu.ar:10915/165745Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292025-09-17 10:26:36.718SEDICI (UNLP) - Universidad Nacional de La Platafalse
dc.title.none.fl_str_mv Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
title Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
spellingShingle Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
Oppenheim, Abi
Ciencias Informáticas
Machine learning
Social Media
Reddit
Community Embedding
Diversity
title_short Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
title_full Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
title_fullStr Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
title_full_unstemmed Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
title_sort Quantifying cultural diversity in social networks: a community embedding approach : Defining diversity measures through graph and machine learning techniques
dc.creator.none.fl_str_mv Oppenheim, Abi
Albanese, Federico
Feuerstein, Esteban
author Oppenheim, Abi
author_facet Oppenheim, Abi
Albanese, Federico
Feuerstein, Esteban
author_role author
author2 Albanese, Federico
Feuerstein, Esteban
author2_role author
author
dc.subject.none.fl_str_mv Ciencias Informáticas
Machine learning
Social Media
Reddit
Community Embedding
Diversity
topic Ciencias Informáticas
Machine learning
Social Media
Reddit
Community Embedding
Diversity
dc.description.none.fl_txt_mv The homophily phenomenon in social networks causes users to interact primarily with others who share their interests and cultural backgrounds, leading to the formation of "echo chambers" [1–3]. The notion of cultural diversity among users and communities becomes relevant in this context. While previous studies have investigated diversity in interaction graphs, to the best of our knowledge, none have explored the degree of diversity based on community embedding, which has been proven effective in measuring the positioning of communities in various social dimensions [4–7]. Building on the work of [7], we propose characterizing and measuring diversity through an innovative algorithm based on community embedding. We propose a novel algorithm based on community embedding to characterize and measure diversity. Our approach builds upon prior work on diversity in social media and involves iteratively updating values for the diversity of communities and individual users. To demonstrate the effectiveness of our algorithm, we conduct a case study analyzing over over 800 million posts in 9 million discussion subreddits of different ethnic groups on Reddit. Next, we generated embeddings for each community using community2vec [8] and developed algorithms to quantify cultural diversity based on these embeddings.
Sociedad Argentina de Informática e Investigación Operativa
description The homophily phenomenon in social networks causes users to interact primarily with others who share their interests and cultural backgrounds, leading to the formation of "echo chambers" [1–3]. The notion of cultural diversity among users and communities becomes relevant in this context. While previous studies have investigated diversity in interaction graphs, to the best of our knowledge, none have explored the degree of diversity based on community embedding, which has been proven effective in measuring the positioning of communities in various social dimensions [4–7]. Building on the work of [7], we propose characterizing and measuring diversity through an innovative algorithm based on community embedding. We propose a novel algorithm based on community embedding to characterize and measure diversity. Our approach builds upon prior work on diversity in social media and involves iteratively updating values for the diversity of communities and individual users. To demonstrate the effectiveness of our algorithm, we conduct a case study analyzing over over 800 million posts in 9 million discussion subreddits of different ethnic groups on Reddit. Next, we generated embeddings for each community using community2vec [8] and developed algorithms to quantify cultural diversity based on these embeddings.
publishDate 2023
dc.date.none.fl_str_mv 2023-09
dc.type.none.fl_str_mv info:eu-repo/semantics/conferenceObject
info:eu-repo/semantics/publishedVersion
Resumen
http://purl.org/coar/resource_type/c_5794
info:ar-repo/semantics/documentoDeConferencia
format conferenceObject
status_str publishedVersion
dc.identifier.none.fl_str_mv http://sedici.unlp.edu.ar/handle/10915/165745
url http://sedici.unlp.edu.ar/handle/10915/165745
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/url/https://publicaciones.sadio.org.ar/index.php/JAIIO/article/view/490
info:eu-repo/semantics/altIdentifier/issn/2451-7496
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
eu_rights_str_mv openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.format.none.fl_str_mv application/pdf
66-67
dc.source.none.fl_str_mv reponame:SEDICI (UNLP)
instname:Universidad Nacional de La Plata
instacron:UNLP
reponame_str SEDICI (UNLP)
collection SEDICI (UNLP)
instname_str Universidad Nacional de La Plata
instacron_str UNLP
institution UNLP
repository.name.fl_str_mv SEDICI (UNLP) - Universidad Nacional de La Plata
repository.mail.fl_str_mv alira@sedici.unlp.edu.ar
_version_ 1843533019936194560
score 13.001348