research

2025

  1. Paper
    dla_teaser.png
    DLA: Dual Layer Aggregation for Squeezing Capacity of Multimodal Large Language Models for Subject-driven Generation
    Shuhong Zheng, Aashish Kumar Misraa, Yu-Teng Li, Yu-Jhe Li, and Igor Gilitschenski
    2025
    Under Review
  2. Paper
    visual_persona_teaser.png
    Visual Persona: Foundation Model for Full-Body Human Customization
    Jisu Nam, Soowon Son, Zhan Xu, Jing Shi, Difan Liu, Feng Liu, Aashish Misraa, Seungryong Kim, and Yang Zhou
    2025
    a version published at CVPR 2025
  3. Patent
    cz_person_teaser.png
    Zero-shot person customization in diffusion model
    Aashish Kumar Misraa, Pranav Aggarwal, and Midhun Harikumar
    Jan 2025
    US App. 19/372,643

2024

  1. Patent
    cz_teaser.png
    Content customization and composition in diffusion
    Pranav Aggarwal, Aashish Kumar Misraa, He Zhang, Soo Ye Kim, Wei Xiong, Hareesh Ravi, Jing Shi, Midhun Harikumar, Zhe Lin, and Eli Shechtman
    Oct 2024
    US App. 18/913,107
  2. Patent
    storyboarding_teaser.png
    One-click dynamic storyboarding using Text Guidance
    Pranav Aggarwal, Midhun Harikumar, and Aashish Kumar Misraa
    Oct 2024
    US App. 18/916,140
  3. Patent
    llava_unet_teaser.png
    Multi-concept adaptor learning of multi-modal LLM for image diffusion model
    Yu-Jhe Li, Aashish Kumar Misraa, and Midhun Harikumar
    Oct 2024
    US App. 18/953,734
  4. Patent
    stacked_teaser.png
    Stacked image generation for character consistency
    Pranav Aggarwal, Aashish Kumar Misraa, and Midhun Harikumar
    Oct 2024
    US App. 19/238,852
  5. Patent
    selfattref_teaser.png
    Self attention reference for improved diffusion personalization
    Nick Kolkin, Aashish Kumar Misraa, Midhun Harikumar, and Eli Shechtman
    Aug 2024
    US App. 18,187,915
  6. Patent
    sepca_teaser.png
    Modality specific learnable attention for multi-conditioned diffusion models
    Hareesh Ravi, Aashish Kumar Misraa, and Ajinkya Kale
    Aug 2024
    US App. 18/817,692
  7. Patent
    dinoscore_teaser.png
    Score based fine grained control of concept generation using DINO adapter
    Pranav Aggarwal, Aashish Kumar Misraa, Midhun Harikumar, Jing Shi, He Zhang, and Wei Xiong
    Jul 2024
    US App. 18/785,914

2022

  1. Patent
    facesearch_teaser.gif
    Tracking unique face identities in videos
    Ali Aminian, Aashish Kumar Misraa, Kshitiz Garg, and Aseem Agarwala
    Nov 2022
    US 12,412,419
  2. Patent
    morpheus_teaser.png
    Processing framework for temporal-consistent face manipulation in videos
    Han Guo, Kshitiz Garg, Ali Aminian, Aashish Misraa, William Marino, and Nicolas Huynh Thien
    May 2022
    US App. 17/751,322

2021

  1. Patent
    senselect_teaser.gif
    Text-based framework for video object selection
    Shivam Nalin Patel, Kshitiz Garg, Han Guo, Ali Aminian, and Aashish Kumar Misraa
    Nov 2021
    US 12,266,181
  2. Patent
    blur_teaser.png
    Blur classification and blur map estimation
    Aashish Kumar Misraa and Zhe Lin
    Mar 2021
    US 11,816,181

2020

  1. Patent
    sensage_inf_teaser.png
    Unified framework for multi-modal similarity search
    Pranav Aggarwal, Ali Aminian, Ajinkya Kale, and Aashish Kumar Misraa
    Apr 2020
    US 11,500,939
  2. Paper
    sensage_teaser.png
    Multi-Modal Retrieval using Graph Neural Networks
    Aashish Kumar Misraa, Ajinkya Kale, Pranav Aggarwal, and Ali Aminian
    Apr 2020
  3. Paper
    waymo_teaser.png
    Waymo Driverless Car Data Analysis and Driving Modeling using CNN and LSTM
    Aashish Kumar Misraa, Naman Jain, and Saurav Singh Dhakad
    Apr 2020
    This work contributed to research acknowledged in MDPI Journal of Applied Sciences

2017

  1. Paper
    helmet_teaser.png
    An automatic detection of helmeted and non-helmeted motorcyclist with license plate extraction using convolutional neural network
    Jimit Mistry, Aashish K. Misraa, Meenu Agarwal, Ayushi Vyas, Vishal M. Chudasama, and Kishor P. Upla
    In 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), Nov 2017
  2. Paper
    An analysis of non-immigrant work visas in the USA using Machine Learning
    Dhanasekar Sundararaman, Nabarun Pal, and Aashish Kumar Misraa
    Int. J. Comput. Sci. Secur. (IJCSS), Nov 2017