[论文简析]DETR: End-to-End Object Detection with Transfromers[2005.12872]
[论文简析]Contrastive Learning for Unpaired Image-to-Image Translation[2007.15651]
[论文简析]DeiT: Data-efficient Image Transformers[2012.12877]
[论文简析]SimSiam: Exploring Simple Siamese Representation Learning[2011.10566]
[论文简析]VQ-VAE:Neural discrete representation learning[1711.00937]
[论文速览]NeRF-RL: Reinforcement Learning with Neural Radiance Fields[2206.01634]
[论文简析]SimCLR: A simple framework for contrastive learning[2002.05709]
[论文简析]VideoMoCo: ...Temporally Adversarial Examples[2103.05905]
[论文简析]MViT: Multiscale Vision Transformers[2104.11227]
[论文简析]TNT: Transformer in Transformer[2103.00112]
[论文简析]VAE: Auto-encoding Variational Bayes[1312.6114]
[论文简析]Barlow Twins:Self-Supervised Learning via Redundancy Reduction[2103.03230]
[论文简析]DAT: Vision Transformer with Deformable Attention[2201.00520]
[论文简析]SwAV: Swapping Assignments between multiple Views[2006.09882]
[论文简析]End-to-End Learning... from Uncurated Instructional Videos[1912.06430]
[论文速览]LLaRA: Supercharging Robot Learning Data for VLM Policy[2406.20095]
[论文速览]Visual Prompt Tuning / VPT[2203.12119]
[论文简析]Equivariant Contrastive Learning[2111.00899]
[论文简析]Towards Better Understanding of Self-Supervised Representation[2203.01881]
[论文速览]Open-vocabulary Object Segmentation with Diffusion Models[2301.05221]