Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature List of EVERYTHING #36

Open
7 of 17 tasks
x-CK-x opened this issue Jun 16, 2024 · 1 comment
Open
7 of 17 tasks

Feature List of EVERYTHING #36

x-CK-x opened this issue Jun 16, 2024 · 1 comment
Labels
documentation Improvements or additions to documentation help wanted Extra attention is needed

Comments

@x-CK-x
Copy link
Owner

x-CK-x commented Jun 16, 2024

A Feature List of everything being added in the short to long term future.

Suggestions Welcome

I will move things off this list into individual tickets to be worked on as time permits.:

- new models for tagging and/or captioning
- new augmentation options
- new image board support (anime & gelbooru)
- two new image tagging modes for both tag based models and caption based models
- a new UI for both modes
- a new layout
- a better way of outlining tags and words in different categories
- faster response times with tag/word suggestions
------------
- segmentation & object detection model/s integration
- a new feature to automatically curate data for the user according to a few questions the user has to answer to determine the heuristics of the model
- cogVLM support
- LLM support for optionally improving captions i.e. cogVLM models struggle with NSFW tagging, so it might be better to use tags resulting from a tagging model and have an LLM re-format the tags into a caption; and then as a second pass have the VLM act as validation to those captions. And/or use an LLM to also merge the LLM caption and the cogVLM caption together into 1 singular caption that takes certain semantics of both captions to make a better one.
- more user configurable options to caption their data
- new methods of pruning tags, again based on various heuristics from Q&A with the user/s and an LLM and/or VLM
- a new WIKI for the tutorials
- super resolution model & denoising model support to (remove) noise artifacts or adversarial attacks from images the user may want to train with; i.e. mitigating some of the effects of Nightshade & Glaze
---------
- new tag, image, & caption statistics and visualization tools for the user to use from a data scientist perspective on how to best choose their data and augment their data
- a custom trained vision classifier on images with [nightshade, glaze, both, nothing]. To allow the user to know which data has been poisoned by artists etc. and if they need to be de-noised / upscaled on to mitigate the effects to some extent
---------
- tagging/captioning model/s will be downloaded on the fly if the option on the dropdown menu is selected, instead of having to go to the download tab to grab it beforehand.
  • Update the existing Version_3 WebUI WIKI Page for the Version_4 WebUI
  • Finish Code Refactor
  • Conda setup instructions
  • CSV load time optimization with the pandas framework
  • .sh & .bat installer scripts for conda
  • Image Board manager class object
  • PNG Info & tag combination options

NEW Features Paused as of (09/05/2023) :: unless there are willing contributors to develop any of the other features.

New image board specific tagging/captioning models will be supported as they are released :: (There is "no" current eta. on the progress of those models being developed by others)

Contributors are welcome to open a Pull Request for their developments & I will promptly review it to be added

@x-CK-x x-CK-x added documentation Improvements or additions to documentation help wanted Extra attention is needed TICKET_ACTIVE labels Jun 16, 2024
@ProxyPlayerHD
Copy link

one feature i'd love to see added is the ability to abort the current operation.
currently that is only possible by closing/quitting the terminal running the program, which isn't great.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants