-
Yahoo Japan Corporation
- Japan
Stars
Official Code Base for KDD 2024 paper Extreme Meta-Classification for Large-Scale Zero-Shot Retrieval
The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
QLoRA: Efficient Finetuning of Quantized LLMs
A CLI interface for Marp and Marpit based converters
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Guidance on documentation, scripts and integration steps on using the EDC project results
EDC core services including data plane and control plane
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of β¦
Framework for benchmarking vector search engines
π¦π Build context-aware reasoning applications
Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C , C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram π
Implementation of Continuous k-Nearest Neighbors in Python
Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.
State-of-the-Art Text Embeddings
Unsupervised text tokenizer for Neural Network-based text generation.
High-Resolution Image Synthesis with Latent Diffusion Models
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
An Engine-Agnostic Deep Learning Framework in Java
Open source platform for the machine learning lifecycle