Klara Janouskova

Klára Janoušková

PhD Student · Computer Vision & Machine Learning · CTU Prague

I am a third-year computer vision and machine learning PhD student at the Czech Technical University in Prague (CTU), supervised by professor Jiří Matas.

My current research focuses on spatial understanding for vision-language models, ImageNet-scale recognition benchmarks and reannotation, and reinforcement learning for video object segmentation. Previously, I worked on fine-grained species classification and biodiversity benchmarks, test-time adaptation for segmentation, AI-assisted labelling for civil infrastructure inspection, and scene text detection and recognition.

I teach labs for the Machine Learning and Pattern Recognition course at CTU and co-supervise several BSc/MSc students.

During my undergraduate studies I interned at the Technion (Chaim Baskin, Alex Bronstein), IBM Research Zurich (Mattia Rigotti, Ioana Giurgiu, Cristiano Malossi), and the CVC at UAB (Dimosthenis Karatzas, Lluis Gomez).

Vision-Language Models
Spatial Understanding for Vision-Language Models

I am now interested in spatial understanding for VLMs — improving the spatial representation for VLMs and efficient VLMs, building on our prior work on context-aware object recognition and closed-form adaptation (Koo-Fu CLIP).

Recognition & Benchmarks
Aiming for Perfect ImageNet-1k

The aim is complete validation set reannotation — an ongoing, soon to be published project. Building on our analysis of ImageNet flaws and VLM-based recognition methods.

Video Understanding
RL for Video Object Segmentation

Investigating reinforcement learning for learned memory control in the Segment Anything Model 2 (SAM2), with the goal of improving long-form video object segmentation by dynamically managing the memory bank.

All projects (including past work) on the Projects page · Publications on the Publications page.