nano-JEPA: Democratizing Video Understanding with Personal Computers
- Autores
- Rostagno, Adrián; Iparraguirre, Javier; Ermantraut, Joel; Tobio, Lucas; Foissac, Segundo; Aggio, Santiago; Friedrich, Guillermo Rodolfo
- Año de publicación
- 2024
- Idioma
- inglés
- Tipo de recurso
- documento de conferencia
- Estado
- versión publicada
- Descripción
- The Video Joint Embedding Predictive Architecture (V-JEPA) has shown great promise in self-supervised video representation learning. However, its substantial computational demands, often necessitates powerful GPU clusters, limit accessibility for many researchers. We introduce nano-JEPA, a streamlined adaptation of V-JEPA designed to run efficiently on resource-constrained personal computers, even those with only CPUs. Additionally, we present the nano-datasets repository, facilitating the creation of manageable subsets from large public video datasets. Our work aims to democratize research in this field, enabling broader participation and experimentation with V-JEPA-like models. We demonstrate that nano-JEPA, trained on smaller datasets and hardware, can still achieve reasonable performance on downstream tasks, opening doors for further exploration and innovation.
Red de Universidades con Carreras en Informática - Materia
-
Ciencias Informáticas
feature prediction
unsupervised learning
visual representations
video
joint-embedding predictive architecture - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-nc-sa/4.0/
- Repositorio
.jpg)
- Institución
- Universidad Nacional de La Plata
- OAI Identificador
- oai:sedici.unlp.edu.ar:10915/176281
Ver los metadatos del registro completo
| id |
SEDICI_62db3241a64308190c333941a8b081b5 |
|---|---|
| oai_identifier_str |
oai:sedici.unlp.edu.ar:10915/176281 |
| network_acronym_str |
SEDICI |
| repository_id_str |
1329 |
| network_name_str |
SEDICI (UNLP) |
| spelling |
nano-JEPA: Democratizing Video Understanding with Personal ComputersRostagno, AdriánIparraguirre, JavierErmantraut, JoelTobio, LucasFoissac, SegundoAggio, SantiagoFriedrich, Guillermo RodolfoCiencias Informáticasfeature predictionunsupervised learningvisual representationsvideojoint-embedding predictive architectureThe Video Joint Embedding Predictive Architecture (V-JEPA) has shown great promise in self-supervised video representation learning. However, its substantial computational demands, often necessitates powerful GPU clusters, limit accessibility for many researchers. We introduce nano-JEPA, a streamlined adaptation of V-JEPA designed to run efficiently on resource-constrained personal computers, even those with only CPUs. Additionally, we present the nano-datasets repository, facilitating the creation of manageable subsets from large public video datasets. Our work aims to democratize research in this field, enabling broader participation and experimentation with V-JEPA-like models. We demonstrate that nano-JEPA, trained on smaller datasets and hardware, can still achieve reasonable performance on downstream tasks, opening doors for further exploration and innovation.Red de Universidades con Carreras en Informática2024-10info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf94-103http://sedici.unlp.edu.ar/handle/10915/176281enginfo:eu-repo/semantics/altIdentifier/isbn/978-950-34-2428-5info:eu-repo/semantics/reference/hdl/10915/172755info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2026-04-28T13:57:00Zoai:sedici.unlp.edu.ar:10915/176281Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292026-04-28 13:57:00.723SEDICI (UNLP) - Universidad Nacional de La Platafalse |
| dc.title.none.fl_str_mv |
nano-JEPA: Democratizing Video Understanding with Personal Computers |
| title |
nano-JEPA: Democratizing Video Understanding with Personal Computers |
| spellingShingle |
nano-JEPA: Democratizing Video Understanding with Personal Computers Rostagno, Adrián Ciencias Informáticas feature prediction unsupervised learning visual representations video joint-embedding predictive architecture |
| title_short |
nano-JEPA: Democratizing Video Understanding with Personal Computers |
| title_full |
nano-JEPA: Democratizing Video Understanding with Personal Computers |
| title_fullStr |
nano-JEPA: Democratizing Video Understanding with Personal Computers |
| title_full_unstemmed |
nano-JEPA: Democratizing Video Understanding with Personal Computers |
| title_sort |
nano-JEPA: Democratizing Video Understanding with Personal Computers |
| dc.creator.none.fl_str_mv |
Rostagno, Adrián Iparraguirre, Javier Ermantraut, Joel Tobio, Lucas Foissac, Segundo Aggio, Santiago Friedrich, Guillermo Rodolfo |
| author |
Rostagno, Adrián |
| author_facet |
Rostagno, Adrián Iparraguirre, Javier Ermantraut, Joel Tobio, Lucas Foissac, Segundo Aggio, Santiago Friedrich, Guillermo Rodolfo |
| author_role |
author |
| author2 |
Iparraguirre, Javier Ermantraut, Joel Tobio, Lucas Foissac, Segundo Aggio, Santiago Friedrich, Guillermo Rodolfo |
| author2_role |
author author author author author author |
| dc.subject.none.fl_str_mv |
Ciencias Informáticas feature prediction unsupervised learning visual representations video joint-embedding predictive architecture |
| topic |
Ciencias Informáticas feature prediction unsupervised learning visual representations video joint-embedding predictive architecture |
| dc.description.none.fl_txt_mv |
The Video Joint Embedding Predictive Architecture (V-JEPA) has shown great promise in self-supervised video representation learning. However, its substantial computational demands, often necessitates powerful GPU clusters, limit accessibility for many researchers. We introduce nano-JEPA, a streamlined adaptation of V-JEPA designed to run efficiently on resource-constrained personal computers, even those with only CPUs. Additionally, we present the nano-datasets repository, facilitating the creation of manageable subsets from large public video datasets. Our work aims to democratize research in this field, enabling broader participation and experimentation with V-JEPA-like models. We demonstrate that nano-JEPA, trained on smaller datasets and hardware, can still achieve reasonable performance on downstream tasks, opening doors for further exploration and innovation. Red de Universidades con Carreras en Informática |
| description |
The Video Joint Embedding Predictive Architecture (V-JEPA) has shown great promise in self-supervised video representation learning. However, its substantial computational demands, often necessitates powerful GPU clusters, limit accessibility for many researchers. We introduce nano-JEPA, a streamlined adaptation of V-JEPA designed to run efficiently on resource-constrained personal computers, even those with only CPUs. Additionally, we present the nano-datasets repository, facilitating the creation of manageable subsets from large public video datasets. Our work aims to democratize research in this field, enabling broader participation and experimentation with V-JEPA-like models. We demonstrate that nano-JEPA, trained on smaller datasets and hardware, can still achieve reasonable performance on downstream tasks, opening doors for further exploration and innovation. |
| publishDate |
2024 |
| dc.date.none.fl_str_mv |
2024-10 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion Objeto de conferencia http://purl.org/coar/resource_type/c_5794 info:ar-repo/semantics/documentoDeConferencia |
| format |
conferenceObject |
| status_str |
publishedVersion |
| dc.identifier.none.fl_str_mv |
http://sedici.unlp.edu.ar/handle/10915/176281 |
| url |
http://sedici.unlp.edu.ar/handle/10915/176281 |
| dc.language.none.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/isbn/978-950-34-2428-5 info:eu-repo/semantics/reference/hdl/10915/172755 |
| dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| eu_rights_str_mv |
openAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| dc.format.none.fl_str_mv |
application/pdf 94-103 |
| dc.source.none.fl_str_mv |
reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP |
| reponame_str |
SEDICI (UNLP) |
| collection |
SEDICI (UNLP) |
| instname_str |
Universidad Nacional de La Plata |
| instacron_str |
UNLP |
| institution |
UNLP |
| repository.name.fl_str_mv |
SEDICI (UNLP) - Universidad Nacional de La Plata |
| repository.mail.fl_str_mv |
alira@sedici.unlp.edu.ar |
| _version_ |
1863817440555696128 |
| score |
13.039539 |