research

2025

  1. Paper
    dla_teaser.png
    DLA: Dual Layer Aggregation for Squeezing Capacity of Multimodal Large Language Models for Subject-driven Generation
    Shuhong Zheng, Aashish Kumar Misraa, Yu-Teng Li, Yu-Jhe Li, and Igor Gilitschenski
    2025
    Under Review
  2. Product
    layered_image_editing_teaser.png
    Layered Image Editing
    2025
    part of V5 release
  3. Product
    adobe_firefly_five.webp
    Firefly Image Generation & Editing V5
    2025
  4. Paper
    visual_persona_teaser.png
    Visual Persona: Foundation Model for Full-Body Human Customization
    Jisu Nam, Soowon Son, Zhan Xu, Jing Shi, Difan Liu, Feng Liu, Aashish Misraa, Seungryong Kim, and Yang Zhou
    2025
    a version published at CVPR 2025
  5. Patent
    cz_person_teaser.png
    Zero-shot person customization in diffusion model
    Aashish Kumar Misraa, Pranav Aggarwal, and Midhun Harikumar
    Jan 2025
    US App. 19/372,643

2024

  1. Product
    firefly_custom_models.webp
    Firefly Custom Models
    Nov 2024
  2. Patent
    cz_teaser.png
    Content customization and composition in diffusion
    Pranav Aggarwal, Aashish Kumar Misraa, He Zhang, Soo Ye Kim, Wei Xiong, Hareesh Ravi, Jing Shi, Midhun Harikumar, Zhe Lin, and Eli Shechtman
    Oct 2024
    US App. 18/913,107
  3. Patent
    storyboarding_teaser.png
    One-click dynamic storyboarding using Text Guidance
    Pranav Aggarwal, Midhun Harikumar, and Aashish Kumar Misraa
    Oct 2024
    US App. 18/916,140
  4. Patent
    llava_unet_teaser.png
    Multi-concept adaptor learning of multi-modal LLM for image diffusion model
    Yu-Jhe Li, Aashish Kumar Misraa, and Midhun Harikumar
    Oct 2024
    US App. 18/953,734
  5. Patent
    stacked_teaser.png
    Stacked image generation for character consistency
    Pranav Aggarwal, Aashish Kumar Misraa, and Midhun Harikumar
    Oct 2024
    US App. 19/238,852
  6. Patent
    selfattref_teaser.png
    Self attention reference for improved diffusion personalization
    Nick Kolkin, Aashish Kumar Misraa, Midhun Harikumar, and Eli Shechtman
    Aug 2024
    US App. 18,187,915
  7. Patent
    sepca_teaser.png
    Modality specific learnable attention for multi-conditioned diffusion models
    Hareesh Ravi, Aashish Kumar Misraa, and Ajinkya Kale
    Aug 2024
    US App. 18/817,692
  8. Product
    firefly2_style.png
    Generative Image Stylization
    Aug 2024
    part of V2 release
  9. Product
    firefly2_structure.png
    Generative Image Structure Match
    Aug 2024
    part of V2 release
  10. Product
    firefly2_photo.png
    Generative Image Content Control
    Aug 2024
    part of V2 release
  11. Product
    adobe_firefly_two.avif
    Firefly Image Generation V2
    Aug 2024
  12. Patent
    dinoscore_teaser.png
    Score based fine grained control of concept generation using DINO adapter
    Pranav Aggarwal, Aashish Kumar Misraa, Midhun Harikumar, Jing Shi, He Zhang, and Wei Xiong
    Jul 2024
    US App. 18/785,914

2023

  1. Product
    adobe_firefly.jpg
    Firefly Image Generation V1
    Jul 2023
  2. Product
    project_blink.jpg
  3. Product
    remove_background.avif
    Remove Video Background
    Jul 2023

2022

  1. Patent
    facesearch_teaser.gif
    Tracking unique face identities in videos
    Ali Aminian, Aashish Kumar Misraa, Kshitiz Garg, and Aseem Agarwala
    Nov 2022
    US 12,412,419
  2. Patent
    morpheus_teaser.png
    Processing framework for temporal-consistent face manipulation in videos
    Han Guo, Kshitiz Garg, Ali Aminian, Aashish Misraa, William Marino, and Nicolas Huynh Thien
    May 2022
    US App. 17/751,322
  3. Product
    project_morpheus.gif
    Morpheus: temporal-consistent face manipulations in videos
    May 2022
  4. Product
    facesearch_ppro.avif
    Face Detection & Identification in After Effects, Premiere Pro, and Elements workflows
    May 2022

2021

  1. Product
    culling.avif
    Selecting Best Photos with Assisted Culling
    Dec 2021
    contributions include blur estimation, eye focus, best frame detection and model compression
  2. Patent
    senselect_teaser.gif
    Text-based framework for video object selection
    Shivam Nalin Patel, Kshitiz Garg, Han Guo, Ali Aminian, and Aashish Kumar Misraa
    Nov 2021
    US 12,266,181
  3. Patent
    blur_teaser.png
    Blur classification and blur map estimation
    Aashish Kumar Misraa and Zhe Lin
    Mar 2021
    US 11,816,181
  4. Product
    stock_shot.jpeg
    Shot angle and size classifiers for stock videos and images
    Mar 2021

2020

  1. Patent
    sensage_inf_teaser.png
    Unified framework for multi-modal similarity search
    Pranav Aggarwal, Ali Aminian, Ajinkya Kale, and Aashish Kumar Misraa
    Apr 2020
    US 11,500,939
  2. Paper
    sensage_teaser.png
    Multi-modal retrieval using graph neural networks
    Aashish Kumar Misraa, Ajinkya Kale, Pranav Aggarwal, and Ali Aminian
    Apr 2020
  3. Paper
    waymo_teaser.png
    Waymo driverless car data analysis and driving modeling using CNN and LSTM
    Aashish Kumar Misraa, Naman Jain, and Saurav Singh Dhakad
    Apr 2020
    This work contributed to research acknowledged in MDPI Journal of Applied Sciences

2017

  1. Paper
    helmet_teaser.png
    An automatic detection of helmeted and non-helmeted motorcyclist with license plate extraction using convolutional neural network
    Jimit Mistry, Aashish K. Misraa, Meenu Agarwal, Ayushi Vyas, Vishal M. Chudasama, and Kishor P. Upla
    In 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), Nov 2017
  2. Paper
    An analysis of non-immigrant work visas in the USA using Machine Learning
    Dhanasekar Sundararaman, Nabarun Pal, and Aashish Kumar Misraa
    Int. J. Comput. Sci. Secur. (IJCSS), Nov 2017
  3. OSS
    scilab.png
    Scilab: Memory & Performance Improvements
    Nov 2017