Publications
Selected Publications
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) -- 2024 This paper introduces FIRE, a novel multimodal methodology for generating recipes from food images, contributing to the growing field of food computing. FIRE effectively produces food titles, ingredients, and cooking instructions using the BLIP model, a Vision Transformer with a decoder, and the T5 model. The paper also explores practical applications like recipe customization and recipe-to-code generation for automated cooking. | |
International Conference on Knowledge Capture (KCap) -- 2023 🏆🏆 Best Student Paper Award 🏆🏆 | |
ACL Workshop on Biomedical Natural Language Processing (BioNLP) -- 2023 This paper explores using pre-trained Language Models (LMs) for assessing mental health risk from social media data. A Question-Answering (QA) approach, utilizing the Unified-QA model, is proposed for analyzing two large mental health datasets. To ensure user privacy, the model is trained with differential privacy techniques. The results show that treating risk assessment as a QA task is effective for mental health scenarios, with minimal performance loss (less than 1%) due to privacy safeguards. This approach signifies a promising direction for creating privacy-conscious diagnostic systems in mental health. | |
NeurIPS Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models -- 2023 This paper examines the limitations of Multimodal Large Language Models (LLMs) in visual question answering (VQA), particularly their sensitivity to the size of visual details in images. The study finds that the zero-shot accuracy of these models decreases by up to 46% with smaller visual subjects. Human visual cropping is shown to mitigate this issue, indicating a causal relationship. The paper proposes three automatic visual cropping methods to enhance zero-shot performance in multimodal LLMs. These methods are evaluated on four VQA datasets and a VQAv2 subset focused on fine details. The results highlight the need for caution in using multimodal LLMs for detail-sensitive VQA tasks and suggest visual cropping as a viable solution for improving performance. | |
International Conference on Data Science & Management of Data (CODS-COMAD) -- 2023 This paper presents an automated pipeline for creating 3D virtual tours in real estate, addressing the time and cost challenges of manual annotation in traditional methods. It introduces a novel HSV-based coloring scheme for paper tags, placed in locations before capturing 360° equirectangular images. These tags are uniquely numbered and bi-colored, enhancing tag detection and digit recognition using YOLOv5 and a custom MobileNet architecture, respectively. The method links equirectangular images based on these detected tags, demonstrating its efficiency with a real-world dataset from Housing.com. | |
European Conference on Machine Learning (ECML) -- 2022 Real-estate image tagging is one of the essential use-cases to save efforts involved in manual annotation and enhance the user experience. This paper proposes an end-to-end pipeline (referred to as RE-Tagger) for the real-estate image classification problem. We present a two-stage transfer learning approach using custom InceptionV3 architecture to classify images into different categories (i.e., bedroom, bathroom, kitchen, balcony, hall, and others). |
All Publications
You can find all my publications at Google Scholar or ResearchGate.
Conferences / Workshops
CORE A*
J Zhang, M Khayatkhoei, P Chhikara, and F Ilievski. ‘Visual Cropping Improves Zero-Shot Question Answering of Multimodal Large Language Models’, 2023. NeurIPS Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models
[Published Version][Poster]P Chhikara, U Pasupulety, J Marshall, D Chaurasia, and S Kumari. ‘Privacy Aware Question-Answering System for Online Mental Health Risk Assessment’, 2023. ACL Proceedings of the 22nd Workshop on Biomedical Language Processing.
[Published Version][Presentation][Poster]P Chhikara, R Tekchandani, N Kumar, and S Tanwar. ‘Federated Learning-based Aerial Image Segmentation for collision-free Movement and Landing’, 2021. Proceedings of the 4th ACM MobiCom Workshops.
[Published Version]
CORE A
P Chhikara, D Chaurasia, Y Jiang, O Masur, and F Ilievski. ‘FIRE: Food Image to REcipe generation’, 2024. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
[Published Version][Poster][Presentation]P Chhikara, A Goyal, and C Sharma. ‘RE-Tagger: A light-weight Real-Estate Image Classifier’, 2022. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD).
[Published Version][Video]
CORE B
P Chhikara, J Zhang, F Ilievski, J Francis, and K Ma. ‘Knowledge-enhanced Agents for Interactive Text Games’, 2023. International Conference on Knowledge Capture (KCap). 🏆🏆 Best Student Paper Award 🏆🏆
[Published Version][Presentation]P Chhikara, H Kuhar, A Goyal, and C Sharma. ‘DIGITOUR: Automatic Digital Tours for Real-Estate Properties’, 2023. ACM CODS-COMAD.
[Published Version]P Chhikara, R Tekchandani, N Kumar, S Tanwar, and JJPC Rodrigues. ‘Federated Learning for Air Quality Index Prediction using UAV Swarm Networks’, 2021. IEEE Globecom.
[Published Version]S Arora, S Goel, P Chhikara, H Singh, N Kumar, and PS Rana. ‘An efficient scheme for wireless charging of electric vehicles using RFID with an optimal path planning’, 2019. IEEE Globecom.
[Published Version]
Unranked
- P Singh, P Chhikara, and J Singh. ‘An ensemble approach for extractive text summarization’, 2020. International Conference on Emerging Trends in Information Technology and Engineering.
[Published Version]
Journals
Quartile 1 (Q1)
A Barnawi, P Chhikara, R Tekchandani, N Kumar, and B Alzahrani. ‘A Differentially Privacy Assisted Federated Learning Scheme to Preserve Data Privacy for IoMT Applications’, 2024. IEEE Transactions on Network and Service Management.
[Impact Factor: 5.300] [Published Version]D Chaurasia and P Chhikara. ‘Sea-Pix-GAN: Underwater image enhancement using adversarial neural network’, 2024. Journal of Visual Communication and Image Representation.
[Impact Factor: 2.600] [Published Version]P Chhikara, R Tekchandani, N Kumar, M Guizani, and MM Hassan. ‘Federated learning and autonomous UAVs for hazardous zone detection and AQI prediction in IoT environment’, 2021. IEEE Internet of Things Journal.
[Impact Factor: 10.238] [Published Version]A Barnawi, P Chhikara, R Tekchandani, N Kumar, and B Alzahrani. ‘Artificial intelligence-enabled Internet of Things-based system for COVID-19 screening using aerial thermal imaging’, 2021. Future Generation Computer Systems.
[Impact Factor: 7.307] [Published Version]P Chhikara, R Tekchandani, N Kumar, V Chamola, and M Guizani. ‘DCNN-GA: A Deep Neural Net Architecture for Navigation of UAV in Indoor Environment’, 2020. IEEE Internet of Things Journal.
[Impact Factor: 10.238] [Published Version]P Chhikara, P Singh, R Tekchandani, N Kumar, and M Guizani. ‘Federated Learning meets Human Emotions: a Decentralized Framework for Human-Computer Interaction for IoT Applications’, 2020. IEEE Internet of Things Journal.
[Impact Factor: 10.238] [Published Version]P Chhikara, R Tekchandani, N Kumar, and MS Obaidat. ‘An Efficient Container Management Scheme for Resource Constrained Intelligent IoT Devices’, 2020. IEEE Internet of Things Journal.
[Impact Factor: 10.238] [Published Version]
Quartile 2 (Q2)
P Chhikara, R Tekchandani, and N Kumar. ‘Adaptive federated learning scheme for recognition of malicious attacks in an IoT network’, 2023. Computing.
[Impact Factor: 2.420] [Published Version]A Barnawi, P Chhikara, R Tekchandani, N Kumar, and M Boulares. ‘A CNN-based scheme for COVID-19 detection with emergency services provisions using an optimal path planning’, 2021. Multimedia Systems.
[Impact Factor: 2.603] [Published Version]P Chhikara, N Jain, R Tekchandani, and N Kumar. ‘Data dimensionality reduction techniques for Industry 4.0: Research results, challenges, and future research directions’, 2020. Software: Practice and Experience.
[Impact Factor: 3.200] [Published Version]
Quartile 3 (Q3)
- P Chhikara, P Gupta, P Singh, and T Bhatia. ‘A Deep Transfer Learning based model for Automatic Detection of COVID-19 from Chest X-Rays’, 2021. Turkish Journal of Electrical Engineering and Computer Sciences.
[Impact Factor: 0.853] [Published Version]
Book Chapter
- P Chhikara, P Singh, P Gupta, and T Bhatia. ‘Deep convolutional neural network with transfer learning for detecting pneumonia on chest X-rays ‘, 2019. Advances in Bioinformatics, Multimedia, and Electronics Circuits and Signals.
[Published Version]