Lists (1)
Sort Name ascending (A-Z)
Stars
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.
curl-impersonate: A special build of curl that can impersonate Chrome & Firefox
A WebSocket (RFC6455) library written in Rust
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
Lexbor is development of an open source HTML Renderer library. https://lexbor.com
Fast HTML5 Parser with CSS selectors. This is successor of myhtml and expected to be faster and use less memory.
The fast Rust-based web bundler with webpack-compatible API 🦀️
DeepSeek Coder: Let the Code Write Itself
Lightweight, super fast C/C (& Python) library for sequence alignment using edit (Levenshtein) distance.
Library for fast text representation and classification.
ChatGPT with superpowers! Search chat history, create folders, export all chats, pin messages, access thousands of community prompts, incognito mode, language and tone selection, and many more feat…
📚🔥收集全网最热门的技术书籍 (GO、黑客、Android、计算机原理、人工智能、大数据、机器学习、数据库、PHP、java、架构、消息队列、算法、python、爬虫、操作系统、linux、C语言),不间断更新中♨️
Stable Diffusion with Core ML on Apple Silicon
The C Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C
low-cost, high-efficiency, easy-to-implement
An open-source C library developed and used at Facebook.
Yet Another Python Profiler, but this time multithreading, asyncio and gevent aware.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Java implementation of the Internet Research Lab Web Crawler (IRLbot) as presented by Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, and Dmitri Loguinov in their paper "IRLbot: Scaling to 6 Billion …
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用