Mar 2021: Paper accepted at CVPR 2021
Jan 2021: The 2nd ActivityNet Entities Object Localization Challenge at CVPR 2021
Dec 2020: Concept Generalization Benchmark on arXiv
Dec 2020: Serving as an Area Chair for CVPR 2021 and ICCV 2021
Sep 2020: Paper accepted at NeurIPS 2020
Aug 2020: Call for PhD position with CTU
Jul 2020: Paper accepted at ECCV 2020
Mar 2020: New chapter: NAVER LABS Europe
Dec 2019: Paper accepted at ICLR 2020
Dec 2019: Organizing the CV for Agriculture workshop at ICLR 2020
Oct 2019: Talk and tutorial at Data Science Africa 2019
Jul 2019: Paper accepted at ICCV 2019
Jul 2019: 4 papers accepted at CVPR 2019
Jan 2019: Organizing the CV4GC workshop at CVPR 2019
Jan 2019: Paper at TPAMI
Nov 2018: Paper accepted at AAAI 2019
Sep 2018: Paper accepted at NeurIPS 2018
Jul 2018: Paper accepted at ECCV 2018
May 2017: Interactive installation at ArtScience Museum in Singapore
Feb 2017: New chapter: Facebook Research
Feb 2017: Best paper at LSCVS Workshop @ NeurIPS 2016
Older: Older news here
Our work on Probabilistic Embeddings for Cross-Modal Retrieval was accepted at CVPR 2021. This is a collaboration with Sanghyuk Chun and Seong Joon Oh from the NAVER AI Lab in Korea as well as Rafael S. de Rezende and Diane Larlus from NAVER LABS Europe.
A pre-print for our work on Concept Generalization in Visual Representation Learning is now public. Check out our project page for code and results or this Twitter thread for a summary of results. Work led by the amazing Mert Bulent Sariyildiz and together with Diane Larlus and Karteek Alahari.
Our work on Hard Negative Mixing for Contrastive Learning was accepted as a poster presentation at NeurIPS 2020. Resources: [Project page] (with pretrained models), [Blog Post], [slides].
[Position has now been filled].
Seeking for a highly motivated candidate for a PhD position at CTU in Prague, sponsored by NAVER LABS Europe and co-supervised by Prof. Giorgos Tolias and myself.
Candidates belonging to underrepresented groups in AI research are highly encouraged to apply.
Our paper Learning to Generate Grounded Image Captions without Localization Supervision. was accepted at ECCV 2020. [Project Page], [code]
After three amazing years at Facebook AI, joined the Computer Vision Group at Naver Labs Europe in Grenoble as a research scientist.
Excited to announce that our paper Decoupling Representation and Classifier for Long-Tailed Recognition was accepted at the International Conference on Representation Learning (ICLR) 2020!
Moreover, the 1st workshop on Computer Vision for Agriculture (CV4A), the second workshop of the Computer Vision for Global Challenges initiative, will be held in April 2020 in conjunction with ICLR, in Addis Ababa, Ethiopia. Find more information at the CV4A Workshop webpage.
Excited to teach a short tutorial on "Image representations and fine-grained recognition" at Data Science Africa, in Accra, Ghana, on October 22nd. The tutorial slides can be found here in normal resolution (~6.2MB) or lower resolution (~1.6MB).
The slides from my DSA talk on "Learnings from the Computer Vision for Global Challenges (CV4GC) initiative" can be found here.
Our paper Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution. was accepted at ICCV 2019! Work with awesome collaborators from the National University of Singapore.
Official code is at the facebook research github repo, and will further be updated and extended soon. Numerous implementations of OctConv can be found on Papers With Code.
Delighted to announce that four papers (3 poster and 1 oral) were accepted at CVPR 2019!
The first Workshop on Computer vision for Global Challenges (CV4GC) was accepted at CVPR this year! Really excited about organizing an initiative to bring the computer vision community closer to socially impactful tasks, datasets and applications for the whole world.
More info:
- The CV4GC website
- Computer vision and global challenges: New research and applications
- Computer Vision for Global Challenges research award winners
Our paper Focal Visual-Text Attention for Memex Question Answering was accepted for publication at the IEEE Transactions on Pattern Analysis and Machine Inteligence (impact factor: 9.455). It introduces our MemexQA Dataset, the first publicly available multimodal question answering dataset consisting of real personal photo albums.
Our paper Large-scale Visual Relationship Detection was accepted at the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019, acceptance rate 16.2%). [Update Feb 2019]: Code is now on github.
Our paper A^2-Nets: Double Attention Networks was accepted at NeurIPS 2018. See you in Montreal! Work with awesome collaborators from the National University of Singapore.
Our paper Multi-Fiber Networks for Video Recognition was accepted at the European Conference on Computer Vision (ECCV) 2018. Work with awesome collaborators from the National University of Singapore. Code on GitHub.
Our interactive art installation entitled Do Silhouettes Dream? will be on display from July 26th till August 2nd 2017 at the ArtScience Museum in Singapore. More information at this page. You may watch a short interview with Cheng and me here.
Our work on visual similarity search over the whole Flickr corpus just launched! Try it yourselves by clicking on the magnifying glass icon at the top right corner of any photopage! Story covered in The Verge, Engadget, Petapixel, Digital Trends and Venture Beat.
After two amazing years at Yahoo Research, joined the Computer Vision Group at Facebook Research in Menlo Park.
Our paper "Tag Prediction in Flickr: A view from the darkroom" on large scale image classification with noisy training data received the best paper award at the 1st Workshop on Large Scale Computer Vision Systems at NeurIPS 2016.
Our paper "Multimodal Classification of Moderated Online Pro-Eating Disorder Content" was accepted at the ACM CHI 2017 conference (25% acceptance rate).
Will be a guest lecturer at Fei-Fei's and Juan Carlos' CS 131 Computer Vision: Foundations and Applications course at Stanford during the 2016-2017 Fall Semester.
Grew up and lived in Greece until 2015 with brief breaks in Sweden, Spain and the United States. Lived in San Francisco and Oakland from 2015 till 2020. I am currently based in Grenoble but probably spend as much time in Barcelona.
Got my PhD in late 2014 from the National Technical University of Athens under the supervision of Prof. Stefanos Kollias and Yannis Avrithis, working closely with my research brother Giorgos Tolias. My PhD was on large-scale geometry indexing, nearest neighbor search and clustering.
From 2015 to 2017 I was a research scientist at Yahoo Research in San Francisco. The large-scale visual similarity search work of my PhD came to a nice closure when we applied it on a trully web-scale real-time application, powering the visual search feature on Flickr. At the same time, my interests expanded towards modeling of vision and language and collaborated with Stanford on the Visual Genome project.
From 2017 and till 2020, I was a research scientist at Facebook AI in Menlo Park, California. During this time my interests expanded to video understanding and deep learning architecture modeling.
On March 2020 I started working as a research scientist at Naver Labs Europe in Grenoble, France. Currently, my research interests include self-supervised representation learning, continual and streaming learning, multi-modal learning, video understanding and vision & language.
I am also very passionate about urging the research community tackle more socially impactful problems. Together with Laura Sevilla-Lara, we lead the Computer Vision for Global Challenges initative. Under this umbrella, we organized the CV4GC workshop at CVPR 2019, and the Computer Vision for Agrigulture (CV4A) workshop at ICLR 2020.
email: ykalant(at)image.ntua.gr yannis.kalantidis(at)naverlabs.com
Full list at my Google Scholar profile.
2021