WSL Research Workshop May 2024

The Web Science Lab (WSL), IIIT-B conducts a biannual research workshop where research scholars share their knowledge and the latest developments in their work. The event includes interactive brainstorming sessions and encourages discussions that give a fresh perspective on the ongoing research problems.

Date : May 14, 2024

Venue: Hybrid (Web Science Lab, A-132 & Online)

Schedule:

Sl.No.SpeakerTimeTitle
1Praseeda10:30 – 10:50Representing individualistic assimilation patterns through learning map
2Pooja10:50 – 11:10Intervention Science for Sustainable Development
3Asilata11:10 – 11:30What Makes Consent Meaningful? Situating meaningful consent within a social contract framework for data privacy
Break
4Bhoomika11:40 – 12:00Video Based Event Detection and Captioning for Vehicular Traffic to aid Scenario Search
5Anurag12:00 – 12:20Eduembedd – Knowledge Graph Embedding for Education domain
6Aparna12:20 – 12:40Retrieval Augmented Generation using Community Knowledge Corpus
Lunch Break
7Balambiga2:00 – 2:20Policy-based Consent Management Service for open ended dissemination of data in Digital Public Infrastructures
8Rohith2:20 – 2:40Ownership and Information Flow Primitives for Digital Public Infrastructures
Break
9Apurva2:50 – 3:10Accessing Data Through the Lens of SDGs
10Sarvesh
Manavi
3:10 – 3:30Dashboard for Learning Map
11Prof. Srinath & Prof. Sushree3:30 – 4:30Closing Remarks

Online attendees can join using the following link;

https://teams.microsoft.com/l/meetup-join/19%3ameeting_MTNhN2VjZjYtODdhMS00NWFiLTlkMzAtOTA3ZDgwZjJmNWI0%40thread.v2/0?context=%7b%22Tid%22%3a%2282a84c22-47b2-4612-b9f7-860f39eb9b12%22%2c%22Oid%22%3a%22c0cab96e-1626-4396-8188-c75dea19f8af%22%7d

Meeting ID: 457 818 670 744

Passcode: wfBifg

IndicNLP

IndicNLP project focuses on building an knowledge management framework for oral community knowledge in low-resource and colloquial Kannada language.

Background

Knowledge in rural communities is largely created, preserved, and is transferred verbally, and it is limited. This information is valuable to these communities, and managing and making it available digitally with state-of-the-art approaches enriches awareness and collective knowledge of people of these communities. The large amounts of data and information produced on the Internet are inaccessible to the population in these rural communities due to factors like lack of infrastructure, connectivity, and limited literacy. Knowledge internal to rural communities is also not conserved and made available in any global Big Data information systems. Artificial Intelligence (AI) technologies such as Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) provide substantial assistance when vast quantities of data, like Big Data, are available to build solutions. In the case of low-resource languages like Kannada and rural colloquial dialects, publicly available corpora are significantly less. Building state-of-the-art AI solutions is challenging in this context, and we address this problem in this work. Knowledge management in rural communities requires a low-cost and efficient approach that social workers can use. Organizations such as Namma Halli Radio have collected an audio corpus of a few hours containing community interactions spoken in colloquial language. We propose an architecture for oral knowledge management for rural communities speaking colloquial Kannada using audio recordings.

Funding Agency

Mphasis F1 foundation

Publications

M. Aparna, Sharath Srivatsa, G. Sai Madhavan, T. B. Dinesh, and Srinath Srinivasa. AI-based Assistance for Management of Oral Community Knowledge in Low-Resource and Colloquial Kannada language. International Conference on Big-Data-Analytics in Astronomy, Science and Engineering, BASE 2023, Springer LNCS. [to-appear]

Sharath Srivatsa, Aparna M, Sai Madhavan G, and Srinath Srinivasa. 2024. Knowledge Management Framework Over Low Resource Indian Colloquial Language Audio Contents. In Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD) (CODS-COMAD ’24). Association for Computing Machinery, New York, NY, USA, 553–557. https://doi.org/10.1145/3632410.3632483 

Aparna M and Srinath Srinivasa. 2023. Active learning for Named Entity Recognition in Kannada. TechRxiv. Preprint. https://doi.org/10.36227/techrxiv.24580582.v1

Media Mentions

Demo

Graama-Kannada Audio Search webapp : http://103.156.19.244:33035/,
(username : guest, password : guest123)

Graama-Kannada demo video:

People

Research Scholars

Project Students

  • Goutham U R
  • Ram Sai Koushik Polisetti
  • Sai Madhavan G
  • Kappagantula Lakshmi Abhigna
  • Manuj Malik
  • Debmalya Sen
  • Vikram Adithya C P
  • Venumula Sai Sumanth Reddy