| DOI | Resolve DOI: https://doi.org/10.1109/CVPRW67362.2025.00050 |
|---|
| Author | Search for: Shah, Krish; Search for: Viswanath, Siddharth; Search for: Xi, Pengcheng1ORCID identifier: https://orcid.org/0000-0003-3236-5234; Search for: Wong, Alexander; Search for: Chen, Yuhao |
|---|
| Affiliation | - National Research Council of Canada. Digital Technologies
|
|---|
| Format | Text, Article |
|---|
| Conference | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 11-12, 2025, Nashville, TN, USA |
|---|
| Subject | training; measurement; computer vision; source coding; conferences; semantics; pose estimation; pattern recognition; monitoring; videos |
|---|
| Abstract | Food intake monitoring is a crucial area of research in food computing due to its complexity and significant potential for improving health outcomes. While traditional 2D image-based dietary assessments provide basic information, video offers a more detailed understanding of both the quantity of food consumed and the manner in which it is eaten. However, current video-based dietary analysis remains limited to coarse metrics, such as counting bites. In this paper, we introduce FoodVideoQA, a novel approach that leverages Vision-Language Models (VLMs) to analyze food intake videos comprehensively. We discuss the inherent limitations of a VLM-based approach to this problem, demonstrating the necessity for further novel approaches in this field. This work paves the way for future studies for more advanced multimodal food intake measurement and behavioral studies. Source code is available at https://github.com/isobarbaric/FoodVideoQA. |
|---|
| Publication date | 2025-06-11 |
|---|
| Publisher | IEEE |
|---|
| In | |
|---|
| Related data | |
|---|
| Language | English |
|---|
| Peer reviewed | Yes |
|---|
| Export citation | Export as RIS |
|---|
| Report a correction | Report a correction (opens in a new tab) |
|---|
| Record identifier | 937cac4a-a9fe-43a8-907b-b87937a4355e |
|---|
| Record created | 2025-10-16 |
|---|
| Record modified | 2025-10-16 |
|---|