Skip to content
View dayyass's full-sized avatar
πŸš€
Rocket Science
πŸš€
Rocket Science

Block or report dayyass

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dayyass/README.md




Hi, my name is Dani πŸ‘‹ and I ❀️ AI and Open-Source

Field of interests: LLM, NLP, RL, Graphs, Distributed Systems

My telegram channel: Cat's Shredinger

Skills πŸ› οΈ

  • Languages:Β  Python, SQL
  • DS/ML/DL: Β Β  SkLearn, PyTorch, Transformers
  • Big Data: Β Β Β Β Β  Hadoop, Spark
  • DevOps:  Β Β Β Β  Linux, Git, Docker

Work experience πŸ‘”

Job Position Company Field Work Period
Head of AI Transformation Social Discovery Group LLM, Conversational AI 2024-05 β€” now
Research Scientist Lead SberDevices LLM, GigaChat 2023-04 β€” 2024-05
NLP Team Lead SberDevices Search, Information Retrieval 2022-10 β€” 2023-04
NLP Tech Lead Sber AI Lab NLP, MLOps, Mentoring 2021-05 β€” 2022-10
Senior NLP Engineer Tinkoff AI Lab Virtual Assistant "Oleg" 2021-02 β€” 2021-04
Middle NLP Engineer MTS AI Lab NER with Pseudo-Labeling 2020-05 β€” 2021-02
Junior Data Scientist Sberbank ML with Tabular Data, CV 2018-07 β€” 2020-05

Education πŸŽ“

Projects 🐾

  • MUSE TF -> PT - convert Multilingual Universal Sentence Encoder from TensorFlow to PyTorch and ONNX
  • QaNER - unofficial implementation of QaNER paper (NER via Extractive Question Answering)
  • RLLib - Reinforcement Learning library
  • MUSE as Service - REST API for sentence embedding using Multilingual Universal Sentence Encoder
  • PyTorch NER - pipeline for training NER models using PyTorch
  • Text Classification Baseline - pipeline for building text classification TF-IDF LogReg baselines
  • Graph-Based Clustering - clustering using graph connected components and spanning trees

Public talks πŸ—£

Certifications πŸ“œ

Hackathon participation πŸ’»

Achievements πŸ†

  • Key contributor to GigaChat: Russian most advanced LLM
  • 500 stars on GitHub and 10 packages in PyPI with 38k downloads
  • Contributor to PyTorch, Scikit-Learn, SciPy
  • Open Data Science Best Contributor 2020

GitHub Stats ⭐

Dani El-Ayyass' github stats

More information in my LinkedIn πŸš€

Pinned Loading

  1. QaNER QaNER Public

    Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.

    Python 65 6

  2. rllib rllib Public

    Reinforcement Learning Library.

    Python 29

  3. pydfs pydfs Public

    Distributed File System written in Python

    Python 15

  4. muse-as-service muse-as-service Public

    REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.

    Python 52 5

  5. text-classification-baseline text-classification-baseline Public

    Pipeline for fast building text classification TF-IDF LogReg baselines.

    Python 63 4

  6. pytorch-ner pytorch-ner Public

    Pipeline for training NER models using PyTorch.

    Python 54 7