📢 Exciting News! DSFSI is at #EMNLP2024! We're thrilled to announce that two of our papers are being presented, highlighting our commitment to advancing #AfricanNLP and pushing the boundaries of #LowResourceLanguages in NLP. 📄 Paper 1: "From N-grams to Pre-trained Multilingual Models For Language Identification" Authors: Thapelo Sindane, Vukosi Marivate Summary: This work explores N-gram models and pre-trained multilingual models (like mBERT, RemBERT, XLM-r, and Afri-centric models such as AfriBERTa, Afro-XLMr, AfroLM, and Serengeti) for Language Identification (LID) across 11 South African languages. Through extensive experiments, we demonstrate that model choice and dataset selection are crucial, revealing Serengeti as a superior performer and introducing a lightweight BERT-based LID model (za_BERT_lid) that shows promising results. 📎 Read the full paper - https://lnkd.in/dr6AeEyn 📄 Paper 2: "Correcting FLORES Evaluation Dataset for Four African Languages" Authors: Idris Abdulmumin, Sthembiso Mkhwanazi, Mahlatse Mbooi, Shamsuddeen Hassan Muhammad, PhD, Ibrahim Said Ahmad, Neo Putini, Mathebula Miehleketo, Matimba Shingange, Tajuddeen Gwadabe, Vukosi Marivate Sumamry: This paper outlines the corrections to the FLORES dataset for Hausa, Northern Sotho, Xitsonga, and isiZulu. Through a careful review by native speakers, we identified and corrected inaccuracies, enhancing the data’s reliability and improving the integrity of NLP evaluations, especially in machine translation. We emphasize the importance of native speakers' involvement for linguistic accuracy in future NLP endeavors. 📎 Read the full paper https://lnkd.in/dvxmVZmG Thapelo Sindane, an incoming PhD student (2025) and recent Masters graduate at DSFSI, will be at #EMNLP all week. Feel free to reach out to him to learn more about our work and our commitment to elevating African languages in NLP! 🌍✨ #EMNLP2024 #LanguageTechnology #MachineTranslation #AIforAfrica #DataScienceforSocialImpact
Data Science for Social Impact
Higher Education
Hatfield, Gauteng 1,921 followers
Data Science for Social Impact Research Group at the CS Department, University of Pretoria, South Africa.
About us
Data Science for Social Impact Research Group at the CS Department, University of Pretoria, South Africa.
- Website
-
https://dsfsi.github.io/
External link for Data Science for Social Impact
- Industry
- Higher Education
- Company size
- 11-50 employees
- Headquarters
- Hatfield, Gauteng
- Type
- Partnership
- Founded
- 2018
- Specialties
- Machine Learning, Data Science, Natural Language Processing, Web Mining, Social Media Mining, and African Natural Language Processing
Locations
-
Primary
Hatfield, Gauteng 0002, ZA
Employees at Data Science for Social Impact
-
Idris Abdulmumin
Postdoctoral Fellow, University of Pretoria DSFSI | Lecturer, Ahmadu Bello University, Zaria | Co-Founder, HausaNLP & ArewaDS | Member, MasaKhaneNLP
-
Seani Rananga
Computer Science Lecturer at the University of Pretoria and Senior Lab member(Python developer) at Data science for Social Impact(DFSI) Research…
-
Dawit Shibabaw
Data scientist | data analyst| Generative AI practitioner |Country ambassador of Zindi Data scientist Team | Data science for social impact research…
-
Christiaan Lombard
Computer Science Student
Updates
-
The deadline has been extended to 18 November 2024. Spread the news and share with your networks.
Calling all researchers and language enthusiasts! 🌟 Applications are now open for the Hundzula: NLP<>Linguistics Retreat 2025! Join us at the University of Johannesburg from Feb 18/19-22, 2025, for this unique event where African languages meet cutting-edge Natural Language Processing (NLP) technologies. 🔗 More information and apply https://lnkd.in/dWqEbii9 ⏳ Deadline: 11 November 2024 Don’t miss this opportunity to shape the future of language technologies.
Apply Now for the Hundzula: NLP<>Linguistics Retreat 2025 [Deadline: 11 Nov 2024]
dsup.substack.com
-
Together Data Science Law Lab 🚀
One night; two awards!!!! Incredibly thankful to have received the Exceptional Young Researcher Award and the certificate awarded in recognition of my The National Research Foundation of South Africa (NRF) C2 rating University of Pretoria ‘s Academic Achievers’ Awards ceremony on Tuesday (05 November 2024) evening. I have said it before and I will continue to say that God has truly blessed me to not just receive these huge honour but to be blessed by so many people who hold me up, who support and root for me. Such people are way too numerous to mention but I am happy that I directly give them their flowers whenever I can. 🌸🌸 My family and those whom I have the blessing to call friends continue to show up for me and I thank each of them and pray that God will always show up for them. Ndi oma, ndi oma ❤️ Grateful to everyone at the Data Science Law Lab for all the ways they root for and bring the African context into the room when it comes to technological innovation. Thankful to all our fantabulous advisors, collaborators and funders alike. Because of you all, I personally continue to grow as a researcher and thinker and our work continues to advance. I truly appreciate each of you. Unu adika! I must make special mention of everyone at Data Science for Social Impact and recently everyone at CIPIT for their super supportive and close collaboration as a result of which we have done and will continue to do the incredible and inspiring work. More deets about recent and future work in a bit. For now, let me say that: Onye nwelu unu, nwelu mmadu! Thank you to everyone at University Of Pretoria, Faculty of Law and to the University of Pretoria in general who show up for me. It means a lot to be able to count on your support. I can look to the future with excitement and enthusiasm for the work ahead and for the positive impact we will make in our world.
-
Data Science for Social Impact reposted this
📢💼 We Are Hiring! Attention University of Pretoria full-time 3rd year, honours/4th year or masters students! DSFSI is seeking a part-time Research assistant and/or Data engineer who will be working (10-20 hrs/week), starting January 2025. You will be involved with several innovative research projects currently underway within the group. Check out our past projects at https://buff.ly/3Avrg4J Ideal candidates should have or be willing to develop skills in machine learning, software development for pipelines, and data analysis. Individuals with diverse backgrounds in computational, mathematical, statistical, or engineering fields are invited to apply. Apply now to join our team and gain hands-on experience. Application deadline: 20 November 2024 For more info and to apply:
Google Forms: Sign-in
accounts.google.com
-
📢💼 We Are Hiring! Attention University of Pretoria full-time 3rd year, honours/4th year or masters students! DSFSI is seeking a part-time Research assistant and/or Data engineer who will be working (10-20 hrs/week), starting January 2025. You will be involved with several innovative research projects currently underway within the group. Check out our past projects at https://buff.ly/3Avrg4J Ideal candidates should have or be willing to develop skills in machine learning, software development for pipelines, and data analysis. Individuals with diverse backgrounds in computational, mathematical, statistical, or engineering fields are invited to apply. Apply now to join our team and gain hands-on experience. Application deadline: 20 November 2024 For more info and to apply:
Google Forms: Sign-in
accounts.google.com
-
We keep building upward 🚀
UP ABSA Chair of Data Science: University of Pretoria, Co-Founder: Lelapa AI, Co-Founder Deep Learning Indaba &Masakhane NLP. Interest in Data Science, Natural Language Processing & AI in general
Honoured to finally share this news. Last night, I had the profound privilege of receiving the Exceptional Young Researcher Award at the University of Pretoria ’s research awards. It’s an honour that reflects not just my journey, but the journey of so many remarkable people who have stood by me, lifted me up, and inspired me every step of the way. To my family, friends, and each of you who have walked this path with me—thank you. Your support has been my foundation, and I am endlessly grateful. I am especially indebted to the incredible team at the Data Science for Social Impact (DSFSI) lab, past and present members, who have poured their time, effort, and passion into advancing our research and creating real, lasting impact. Research is truly a collective endeavour, and each of you has contributed a vital piece to the work we do together. I am profoundly thankful for your dedication and camaraderie. My heartfelt gratitude also extends to our partners/collaborators at Lelapa AI, Masakhane, Deep Learning Indaba, the Data Science Law Lab, and so many others who have joined forces with us. Together, we are working to shine a light on African Machine Learning, AI, and Data Science—harnessing these fields to address the challenges that matter most to our communities. By focusing on solutions for our society, we’re building a legacy that I am proud to be part of. A special thanks to the Absa Group—Gavin Cope, Anna Nascimento, Christine Wu, and their entire team—who have championed our vision through the ABSA UP Chair of Data Science. Your belief in us has been invaluable. And to the UP Faculty of Engineering, Built Environment and Information Technology and to all the University of Pretoria staff who work with us, thank you for fostering an environment where innovation and purpose intersect. Though we’ve achieved so much together, I know there is still so much more to accomplish. I look forward to what lies ahead as we continue to push boundaries, create solutions, and work towards a brighter future for us all. We look forward in the next few months sharing what we have bene working on in 2024/2025 #OnwardsUpwards #AfricanMachineLearning
-
Data Science for Social Impact reposted this
Last month, The Center for Digital Humanities at Princeton University kicked off the African Languages in the Age of AI Speaker Series with a full house for Vukosi Marivate's talk, "A New Agenda for African Languages x AI: Everything, Everywhere, All At Once". ➡ 📹 The recording is now available online at: https://lnkd.in/exd4qT-V. In his talk, Professor Marivate (Chair of Data Science, Professor of Computer Science, University of Pretoria) discussed the crucial role of community building in developing technologies for African languages in the age of AI and touched upon the unique challenges and opportunities in fostering collaboration for African languages, developing technologies that respect and empower communities, and his vision for the future of technology and community engagements in African languages. Simon Gikandi (Chair of Princeton University's Department of English) served as respondent and moderator of the Q&A. The African Languages in the Age of AI speaker series is co-sponsored and supported by: the Africa World Initiative, Program in African Studies, and the African Humanities Colloquium / Princeton Institute for International and Regional Studies. Photography by Alison Nugent.
-
Catch up on the insightful #DS4SocietySeminar presented by Moseli Mots'oehli from the University of Hawaii at Manoa. The seminar, "Towards Safer Driving on Southern African Roads Through Assistive and Autonomous Annotation and Driving" focuses on enhancing assistive and autonomous perception software to promote road safety for drivers in Southern Africa. This initiative addresses the urgent need to reduce road accidents, with statistics showing a high rate of traffic incidents in the region. The full video recording is now available: https://lnkd.in/dfbrsYms #UPTuks #DSFSI #RoadSafety
DS4Society Seminar 2024: Moseli Mots'oehli - Towards Safer Driving on Southern African Roads
https://www.youtube.com/
-
On the 30th of September 2024 the Embassy of Sweden in Pretoria and the Computer Science department at the University of Pretoria UP Library hosted an enlightening AI Workshop. The focus was on Low Resource Languages and Large Language Models in Africa: https://lnkd.in/ddTb6vRS and The role of Academia, Industry and government in AI and R&D: https://lnkd.in/d6AKGibQ. Lesego Makhafola and Vukosi Marivate moderated the sessions. The 1st session focused on developing LLMs for low-resource languages in Africa, focusing on limited data and resource challenges. Emphasis was placed on fostering collaboration to create inclusive AI that supports diverse languages and preserves linguistic diversity. The 2nd session focused on AI's transformative potential in healthcare and education, alongside challenges for small businesses in adopting AI. This session emphasized collaboration between academia, industry, and government to boost AI education and policy support, while addressing ethical and sustainability concerns for inclusive, context-specific AI solutions in Africa. The panel included accomplished individuals such as Vukosi Marivate from DSFSI, Paul dos Santos from AI Sweden, Dr. Chijioke Okorie Okorie from Data Science Law Lab, Nomonde Katlego Khalo from the University of Cape Town, Martin Svensson from AI Sweden, Saidah Nash Carter from Bright Insight Global LLC and Greg Desilla from Melio AI. The discussions were insightful, sparking ideas for the future of AI in Africa. #UPEBIT #EmbassyofSwedenofPretoria #UPTuks UP Faculty of Engineering, Built Environment and Information Technology
-
Data Science for Social Impact reposted this
🚀 Exciting #INSPIRE FILM Update - Diabetes Management Through Innovation! 🚀 We are excited to share the highlights of our latest research project at Stellenbosch University's Division of Health Systems and Public Health. 🎥🌍 Discover how we are bridging #technology and #healthcare to create more inclusive solutions for global health. 📱🤝 Film Credits: 🎬 Film Production: The Dollie House 🎤 Speakers: - Prof. Lynn Hendricks - Gabriela Carolus - Dr. Jacki O'Neill - Prof. Simone Titus Dawson - Dr. M. Motloung - Mercy Muchai 👥 Participants: Pricilla Anthony; Keanu Jansen Van Vuuren; M Grantham; Lynn Booysen; Dr. DC Joubert; Karabo Dinelelo; Dr. Lee-Ann Gillion; Monique de Wit; Melissa Kelly; Gadija Claasen; Bridget McNulty; Shiara Pillay; David Titus; Nophiwe Job; Monika Deschodt; Lourentia van Wyk; Priscilla Adams; Jodie Layman-Lemphane; Zisikazi Jardine; Amber Cupido. 🤝 Supported by Prof. Rene' English, the Division of Health Systems and Public Health, Department of Global Health, the INSPIRE COLLAB Team (Mandisa Mireldah Mashaba; Nikki Thomas; Nicala Zeeman; Amina Abdullah), and the collaboration with the Data Science for Social Impact team (Prof Vukosi Marivate and Dr Hope Mogale), Boyd Migisha (Swansea University) and the Microsoft Research Africa, Nairobi Team (Mercy Muchai; Stephanie Nyairo; Najeeb G. Abdulhamid, PhD). #GenerativeAI #HealthcareInnovation #DiabetesCare #AIforGood #DigitalHealth #StellenboschUniversity #Storytelling #CoDesignWorkshop #TechForHealth #AIInHealthcare