Feature List of EVERYTHING #36

x-CK-x · 2024-06-16T16:29:19Z

A Feature List of everything being added in the short to long term future.

Suggestions Welcome

I will move things off this list into individual tickets to be worked on as time permits.:

- new models for tagging and/or captioning
- new augmentation options
- new image board support (anime & gelbooru)
- two new image tagging modes for both tag based models and caption based models
- a new UI for both modes
- a new layout
- a better way of outlining tags and words in different categories
- faster response times with tag/word suggestions
------------
- segmentation & object detection model/s integration
- a new feature to automatically curate data for the user according to a few questions the user has to answer to determine the heuristics of the model
- cogVLM support
- LLM support for optionally improving captions i.e. cogVLM models struggle with NSFW tagging, so it might be better to use tags resulting from a tagging model and have an LLM re-format the tags into a caption; and then as a second pass have the VLM act as validation to those captions. And/or use an LLM to also merge the LLM caption and the cogVLM caption together into 1 singular caption that takes certain semantics of both captions to make a better one.
- more user configurable options to caption their data
- new methods of pruning tags, again based on various heuristics from Q&A with the user/s and an LLM and/or VLM
- a new WIKI for the tutorials
- super resolution model & denoising model support to (remove) noise artifacts or adversarial attacks from images the user may want to train with; i.e. mitigating some of the effects of Nightshade & Glaze
---------
- new tag, image, & caption statistics and visualization tools for the user to use from a data scientist perspective on how to best choose their data and augment their data
- a custom trained vision classifier on images with [nightshade, glaze, both, nothing]. To allow the user to know which data has been poisoned by artists etc. and if they need to be de-noised / upscaled on to mitigate the effects to some extent
---------
- tagging/captioning model/s will be downloaded on the fly if the option on the dropdown menu is selected, instead of having to go to the download tab to grab it beforehand.

Update the existing Version_3 WebUI WIKI Page for the Version_4 WebUI
Finish Code Refactor
Conda setup instructions
CSV load time optimization with the pandas framework
.sh & .bat installer scripts for conda
Image Board manager class object
PNG Info & tag combination options

NEW Features Paused as of (09/05/2023) :: unless there are willing contributors to develop any of the other features.

New image board specific tagging/captioning models will be supported as they are released :: (There is "no" current eta. on the progress of those models being developed by others)

Contributors are welcome to open a Pull Request for their developments & I will promptly review it to be added

The text was updated successfully, but these errors were encountered:

ProxyPlayerHD · 2024-11-23T10:07:00Z

one feature i'd love to see added is the ability to abort the current operation.
currently that is only possible by closing/quitting the terminal running the program, which isn't great.

x-CK-x added documentation Improvements or additions to documentation help wanted Extra attention is needed TICKET_ACTIVE labels Jun 16, 2024

x-CK-x removed the TICKET_ACTIVE label Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature List of EVERYTHING #36

Feature List of EVERYTHING #36

x-CK-x commented Jun 16, 2024 •

edited

Loading

ProxyPlayerHD commented Nov 23, 2024

Feature List of EVERYTHING #36

Feature List of EVERYTHING #36

Comments

x-CK-x commented Jun 16, 2024 • edited Loading

ProxyPlayerHD commented Nov 23, 2024

x-CK-x commented Jun 16, 2024 •

edited

Loading