Skip to content
View yangliuy's full-sized avatar

Organizations

@NTMC-Community

Block or report yangliuy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM101n: Let's build a Storyteller

28,963 1,584 Updated Aug 1, 2024

The official Meta Llama 3 GitHub site

Python 26,355 2,970 Updated Aug 12, 2024

Open weights LLM from Google DeepMind.

Python 2,410 305 Updated Sep 20, 2024

深度学习经典、新论文逐段精读

26,447 2,403 Updated Aug 8, 2024

Manipulating Python Programs

Python 344 20 Updated Sep 28, 2024

ACL2020 Tutorial: Open-Domain Question Answering

834 85 Updated Jan 1, 2021

This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers.

Shell 456 65 Updated Apr 15, 2020

Overview of venues, research themes and datasets relevant for conversational search.

141 19 Updated Aug 9, 2022

Protocol Buffers - Google's data interchange format

C 65,333 15,452 Updated Sep 28, 2024

Flax is a neural network library for JAX that is designed for flexibility.

Python 5,987 631 Updated Sep 28, 2024

Composable transformations of Python NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 30,058 2,753 Updated Sep 28, 2024

Trax — Deep Learning with Clear Code and Speed

Python 8,058 814 Updated Sep 10, 2024

Benchmarks of approximate nearest neighbor libraries in Python

Python 4,876 734 Updated Sep 2, 2024

TFX is an end-to-end platform for deploying production ML pipelines

Python 2,106 707 Updated Sep 27, 2024

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,240 887 Updated Sep 22, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C 10,116 1,166 Updated Sep 1, 2024

A curated question answering research dataset of factoid questions

HTML 49 18 Updated Nov 9, 2019

stand alone Krovetz stemmer

C 4 1 Updated Apr 12, 2018

scripts to download and standardize trec query and document sets

Makefile 46 4 Updated Aug 7, 2019

A clone of indri-5.12 with minor customizations.

C 25 4 Updated Sep 23, 2024

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C 4,785 583 Updated Sep 4, 2024

Approximate Nearest Neighbors in C /Python optimized for memory usage and loading/saving to disk

C 13,136 1,160 Updated Jul 29, 2024

A library for efficient similarity search and clustering of dense vectors.

C 30,685 3,576 Updated Sep 26, 2024

Dataset accompanying the SPECTER model

Python 127 17 Updated Dec 19, 2022

SPECTER: Document-level Representation Learning using Citation-informed Transformers

Python 511 55 Updated Jun 12, 2023

The code of ACL 2019 paper: Matching Article Pairs with Graphical Decomposition and Convolutions

Python 235 60 Updated Nov 30, 2020

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Jupyter Notebook 14,108 3,509 Updated Apr 21, 2022

《机器学习》(西瓜书)公式详解

23,843 4,739 Updated Aug 20, 2024

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,701 299 Updated Apr 6, 2023
Next