Skip to content

Add MoViNet model #2304

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
innat opened this issue Jan 18, 2024 · 6 comments
Open

Add MoViNet model #2304

innat opened this issue Jan 18, 2024 · 6 comments

Comments

@innat
Copy link
Contributor

innat commented Jan 18, 2024

Short Description

MoViNets: Mobile Video Networks for Efficient Video Recognition

Mobile Video Networks (MoViNets) are efficient video classification models runnable on mobile devices. MoViNets demonstrate state-of-the-art accuracy and efficiency on several large-scale video action recognition datasets.

On Kinetics 600, MoViNet-A6 achieves 84.8% top-1 accuracy, outperforming recent Vision Transformer models like ViViT (83.0%) and VATT (83.6%) without any additional training data, while using 10x fewer FLOPs. And streaming MoViNet-A0 achieves 72% accuracy while using 3x fewer FLOPs than MobileNetV3-large (68%).

Papers

MoViNets

Existing Implementations

Other Information

The streaming version of this model makes it quite impression and it would be valuable addition.

@divyashreepathihalli
Copy link
Collaborator

Hi @innat that you for this suggestion. We will keep this open, but at this point this is of low priority for the team.

@innat
Copy link
Contributor Author

innat commented Feb 1, 2024

@divyashreepathihalli Thanks for the confirmation. I pulled out the movinet from tf-model garden and maintaining to a dedicated repo (private for now). The codebase somewhat complex due to large number of configurations. I will keep update the codebase, so, please let me know when keras-cv is ready take it.

@divyashreepathihalli
Copy link
Collaborator

If you have code ready to go, which works well across all backends. Please feel free to open the PR. We will review it and add it.

@sachinprasadhs
Copy link
Collaborator

Thanks for reporting the issue! We have consolidated the development of KerasCV into the new KerasHub package, which supports image, text, and multi-modal models. Please read the announcement. KerasHub will support all the core functionality of KerasCV.

KerasHub can be installed with !pip install -U keras-hub. Documentation and guides are available at keras.io/keras_hub.

With our focus shifted to KerasHub, we are not planning any further development or releases in KerasCV. If you encounter a KerasCV feature that is missing from KerasHub, or would like to propose an addition to the library, please file an issue with KerasHub.

@innat
Copy link
Contributor Author

innat commented Jan 16, 2025

@sachinprasadhs

If you encounter a KerasCV feature that is missing from KerasHub, or would like to propose an addition to the library, please file an issue with KerasHub.

Firstly, identifying which feature is required or missing can be done effectively, either by practitioners or the Keras team. Tickets in keras-cv were created only when such features were NOT available or when a specific need was encountered. Commenting after a significant delay and requesting the recreation of the same issue is neither efficient nor appropriate. Kindly move this ticket (and any other similar tickets with the same context) from keras-cv to keras-hub.

Copy link

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

@github-actions github-actions bot added the stale label Feb 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants