Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

nkkbr
/

ViCA

Video-Text-to-Text

text-generation

vision-language

video understanding

spatial reasoning

visuospatial cognition

Eval Results (legacy)

Model card Files Files and versions

ViCA / assets

5.35 MB

1 contributor

History: 3 commits

nkkbr's picture

.

e2d3083 10 months ago

training_record
update readme 10 months ago
banner.png

2.82 MB
xet

. 10 months ago
data-scale-csr-effect.svg

89.4 kB

update readme 10 months ago
table2.png

475 kB
xet

update readme 10 months ago
table3.png

587 kB
xet

update readme 10 months ago
vsi-bench-comparison.svg

62 kB

update readme 10 months ago
vsi-bench-table.png

673 kB
xet

update readme 10 months ago