Skip to content
View xuwenyihust's full-sized avatar

Block or report xuwenyihust

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

DeepSeek LLM: Let there be answers

Makefile 1,405 92 Updated Feb 4, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,467 1,680 Updated Sep 27, 2024

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

28,573 3,283 Updated Mar 25, 2024

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

Python 440 120 Updated Sep 16, 2024

MLeap: Deploy ML Pipelines to Production

Scala 1,500 310 Updated Jul 3, 2024

Streamlit — A faster way to build and share data apps.

Python 34,804 3,015 Updated Sep 27, 2024

PawMark is a platform for developers to build, schedule and monitor data pipelines.

JavaScript 29 Updated Sep 22, 2024

Practice machine learning/deep learning.

Jupyter Notebook 1 Updated Oct 9, 2023

Practice and tutorial-style notebooks covering wide variety of machine learning techniques

Jupyter Notebook 3,075 1,794 Updated May 22, 2023

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 166,956 44,169 Updated Sep 27, 2024

A curated list of awesome Machine Learning frameworks, libraries and software.

Python 65,511 14,588 Updated Aug 7, 2024

A natural language interface for computers

Python 52,347 4,620 Updated Sep 26, 2024

Open source platform for the machine learning lifecycle

Python 18,399 4,159 Updated Sep 27, 2024

Jupyter handsontable integration

Python 540 68 Updated Jan 4, 2024

The official Notion API client library, but rewritten in Python! (sync async)

Python 1,756 140 Updated Jul 13, 2024

A Jupyter - Leaflet.js bridge

TypeScript 1,485 364 Updated Aug 21, 2024

Interactive Widgets for the Jupyter Notebook

TypeScript 3,137 948 Updated Sep 12, 2024

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 61,964 13,588 Updated Sep 27, 2024

A better notebook for Scala (and more)

Jupyter Notebook 4,514 393 Updated Aug 1, 2024

Jupyter Interactive Notebook

Jupyter Notebook 11,617 4,877 Updated Sep 9, 2024

Jupyter metapackage for installation, docs and chat

Python 14,873 4,072 Updated Sep 26, 2024

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,342 2,418 Updated Sep 27, 2024

Koalas: pandas API on Apache Spark

Python 3,329 357 Updated Mar 20, 2024

Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…

Scala 82 14 Updated Apr 2, 2024

Examples for High Performance Spark

Scala 498 233 Updated Aug 27, 2024

A data pipeline developing kit.

Java 1 Updated Apr 8, 2021

Flink 中文视频课程(持续更新...)

4,529 1,150 Updated Jun 18, 2020

VIP cheatsheets for Stanford's CS 229 Machine Learning

17,535 3,940 Updated May 20, 2020

A Python library for creating Github-style badges

Python 467 143 Updated Jun 6, 2024
Next