A 3.6.0 release for my Image Description Toolkit is now available. The release features a new –redescribe option as well as support for two Microsoft Florence-2 models via Hugging Face.
Redescribe
The redescribe option can be used with the command line version of the toolkit to redescribe the same set of images without the need to extract video frames or convert image files to JPG from HEIC. This can save time and allow you to quickly test different prompts on the same set of images. To use the redescribe option, simply add –redescribe as the first option in an “idt workflow” command followed by the standard –provider, –model and –prompt-style options. Use the workflow directory from a previous image description workflow as the file directory for images to be described. Product documentation has full details.
Hugging Face and Florence Models
Hugging Face is described as “The platform where the machine learning community collaborates on models, datasets, and applications.” It contains a wide range of AI models, datasets and apps across the full spectrum of AI tasks.
The Florence-2 AI model is described in part as, “an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks.” It runs locally and requires no pre-installation. Note that the model will download on first use so expect a longer image description time. Also be aware that for a variety of reasons, the Florence-2 model is only supported in the command line version of the Image Description Toolkit.
You can get the latest update for the Image Description Toolkit from my projects page at www.theideaplace.net/projects or the GitHub release page.
Link in Github Releases page is broken, when clicking on downloading installer, there appears the “not found” error
Thank you for letting me know. This should be corrected now.