Features

Searchable database for unstructured data

Quickstart | Docs | Tutorials | Chat

Check out our blog post to grasp what we have been doing for the last months.

NucliaDB is a distributed search engine built from the ground up to offer high accuracy and semantic search on unstructured data. By mere mortals for mere mortals, NucliaDB's architecture is as simple as possible to be scalable and deliver what an NLP Database requires

NucliaDB is written in Rust and Python and built on top of the mighty tantivy library. We designed it to index big datasets and provide multi-teanant suport.

Features

Store original data, extracting and understanding data on object and blob storage
Index fields, paragraphs, and semantic sentences on index storage
Cloud extraction and understanding with Nuclia Understanding API™
Cloud connection to train ML models with Nuclia Learning API™
Container security based with Reader, Manager, Writer Roles
Resources with multiple fields and metadata
Text/HTML/Markdown plain fields support
File field support with direct upload and TUS upload
Link field support
Conversation field support
Blocks/Layout field support
Eventual consistency transactions based on Nats.io
Distributed source of truth with TiKV and Redis support
Blob support with S3-compatible API and GCS
Replication of index storage
Distributed search
Cloud-native: Kubernetes only

Upcomming Features

Blob support with Azure Blob storage
Index relations on index storage

Architecture

Quickstart

Get a NucliaDB token to connect to Nuclia Understanding API™

Only needed if you want to use Nuclia Understanding API™ and Nuclia Learning API™

Start NucliaDB minimal

First we need object storage and blob storage

docker run redis
docker run minio

TODO

Create a Knowledge box container

curl http://localhost:8080/v1/kb \
  -X POST \
  -H "X-NUCLIADB-ROLES: MANAGER" \

Upload a file

After starting NucliaDB and creating a Knowledge Box you can upload a file:

curl http://localhost:8080/v1/kb/<your-knowledge-box-id>/upload \
  -X POST \
  -H "X-NUCLIADB-ROLES: WRITER" \
  -T /path/to/file

Search a file

After starting NucliaDB and creating a Knowledge Box you can upload a file:

curl http://localhost:8080/v1/kb/<your-knowledge-box-id>/search \
  -X GET \
  -H "X-NUCLIADB-ROLES: READER" \

API Tutorials

Upload a file

💬 Community

Chat with us in Discord
📝 Blog Posts
Follow us on Twitter

🙋 FAQ

How is NucliaDB different from traditional search engines like Elasticsearch or Solr?

The core difference and advantage of NucliaDB is its architecture built from the ground up for cloud and unstructured data. Its vector index plus standard keyword and fuzzy search provide an API to use all extracted and learned information from Nuclia, understanding API and provide super NLP powers to any application with low code and peace of mind.

What license does NucliaDB use?

NucliaDB is open-source under the GNU Affero General Public License Version 3 - AGPLv3. Fundamentally, this means that you are free to use Quickwit for your project, as long as you don't modify NucliaDB. If you do, you have to make the modifications public.

What is Nuclia's business model?

Our business model relies on our Nuclia Learning API and Nuclia Understanding API. We also offer NucliaDB as a service at our multi-cloud provider infrastructure: https://nuclia.cloud.

🤝 Contribute and spread the word

We are always super happy to have contributions: code, documentation, issues, feedback, or even saying hello on discord! Here is how you can get started:

Have a look through GitHub issues labeled "Good first issue".
Read our Contributor Covenant Code of Conduct
Create a fork of NucliaDB and submit your pull request!

✨ And to thank you for your contributions, claim your swag by emailing us at info at nuclia.com.

Name		Name	Last commit message	Last commit date
Latest commit History 460 Commits
.github		.github
charts		charts
config		config
docs		docs
mypy_stubs		mypy_stubs
nucliadb		nucliadb
nucliadb_byte_rpr		nucliadb_byte_rpr
nucliadb_cluster		nucliadb_cluster
nucliadb_dataset		nucliadb_dataset
nucliadb_fields_tantivy		nucliadb_fields_tantivy
nucliadb_ingest		nucliadb_ingest
nucliadb_models		nucliadb_models
nucliadb_node		nucliadb_node
nucliadb_one		nucliadb_one
nucliadb_paragraphs_tantivy		nucliadb_paragraphs_tantivy
nucliadb_protos		nucliadb_protos
nucliadb_reader		nucliadb_reader
nucliadb_relations		nucliadb_relations
nucliadb_search		nucliadb_search
nucliadb_service_interface		nucliadb_service_interface
nucliadb_services		nucliadb_services
nucliadb_telemetry		nucliadb_telemetry
nucliadb_train		nucliadb_train
nucliadb_utils		nucliadb_utils
nucliadb_vectors		nucliadb_vectors
nucliadb_writer		nucliadb_writer
.dockerignore		.dockerignore
.gitignore		.gitignore
.license_header.txt		.license_header.txt
.licenserc.yaml		.licenserc.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CODE_STYLE_PYTHON.md		CODE_STYLE_PYTHON.md
CODE_STYLE_RUST.md		CODE_STYLE_RUST.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Dockerfile.basenode		Dockerfile.basenode
Dockerfile.cluster_monitor		Dockerfile.cluster_monitor
Dockerfile.ingest		Dockerfile.ingest
Dockerfile.ingest.MacOs		Dockerfile.ingest.MacOs
Dockerfile.node		Dockerfile.node
Dockerfile.node_local		Dockerfile.node_local
Dockerfile.node_sidecar		Dockerfile.node_sidecar
Dockerfile.one		Dockerfile.one
Dockerfile.reader		Dockerfile.reader
Dockerfile.search		Dockerfile.search
Dockerfile.train		Dockerfile.train
Dockerfile.writer		Dockerfile.writer
LICENSE.txt		LICENSE.txt
LICENSE_AGPLv3.0.txt		LICENSE_AGPLv3.0.txt
Makefile		Makefile
NucliaDB_individual_CLA.md		NucliaDB_individual_CLA.md
README.MacOs.md		README.MacOs.md
README.md		README.md
VERSION		VERSION
code-requirements.txt		code-requirements.txt
deny.toml		deny.toml
docker-compose-deps.yaml		docker-compose-deps.yaml
docker-compose-distributed.yaml		docker-compose-distributed.yaml
docker-compose-one.yaml		docker-compose-one.yaml
docker-compose.yaml		docker-compose.yaml
mypy.ini		mypy.ini
openapi.yaml		openapi.yaml
rustfmt.toml		rustfmt.toml
search.key		search.key
test-requirements.txt		test-requirements.txt
test_logging.ini		test_logging.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Searchable database for unstructured data

Quickstart | Docs | Tutorials | Chat

Check out our blog post to grasp what we have been doing for the last months.

Features

Upcomming Features

Architecture

Quickstart

Get a NucliaDB token to connect to Nuclia Understanding API™

Start NucliaDB minimal

Create a Knowledge box container

Upload a file

Search a file

API Tutorials

💬 Community

🙋 FAQ

How is NucliaDB different from traditional search engines like Elasticsearch or Solr?

What license does NucliaDB use?

What is Nuclia's business model?

🤝 Contribute and spread the word

Reference

Meta

About

Releases

Packages

Languages

License

fossabot/nucliadb

Folders and files

Latest commit

History

Repository files navigation

Searchable database for unstructured data

Quickstart | Docs | Tutorials | Chat

Check out our blog post to grasp what we have been doing for the last months.

Features

Upcomming Features

Architecture

Quickstart

Get a NucliaDB token to connect to Nuclia Understanding API™

Start NucliaDB minimal

Create a Knowledge box container

Upload a file

Search a file

API Tutorials

💬 Community

🙋 FAQ

How is NucliaDB different from traditional search engines like Elasticsearch or Solr?

What license does NucliaDB use?

What is Nuclia's business model?

🤝 Contribute and spread the word

Reference

Meta

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages