Workshop on Big Data Engineering (BDE 2017)

This workshop is a part of a project Co-creation of a Center of Excellence in Big Data Engineering , a collaboration between International Institute of Information Technology (IIIT-B) and City University London, to set up a centre of excellence in Big Data Engineering. This project aims to create a Centre of Excellence in Big Data Engineering. The objectives of the centre would be to co-create research agendas, curriculum and outreach programmes in Big Data.

The abstract of the talks can be accessed here.

Workshop main web page may be accessed here.

Workshop agenda:

Activity log 2017

5 September 2017. Gooru labs was formally inaugurated at IIIT Bangalore, by Prof. Rajagopalan, Dr. Sridhar Mitta and Dr. Prasad Ram.

4 August 2017. Asha Subramanian completed her open seminar entitled, “Semantic Integration and Knowledge Representation of Open Data Powered by Linked Open Data” as a pre-requisite requirement to the submission of her PhD thesis.

21-30 June 2017. Srinath Srinivasa visited Gooru HQ at Redwood City, California as part of the ongoing collaboration for setting up Gooru Labs at IIIT Bangalore.

5–8 June 2017. Asha Subramanian presented her work at the Data Science Congress 2017 held in CIDCO Convention Centre, Vashi, Navi Mumbai, Maharashtra, India. Abstract of the paper can be found at Abstract

27 April 2017. Raksha P S finished her PhD Comprehensive exam.

24 April 2017. Final Project reviews for Semester Jan-May 2017 at Web Science Lab IIIT Bangalore.

19 April 2017 – 20 April 2017. Workshop on Big Data Engineering at IIIT Bangalore. This workshop is a part of a project Co-creation of a Center of Excellence in Big Data Engineering , a collaboration between IIIT-B and City University London, to set up a centre of excellence in Big Data Engineering.

8 April 2017. Asha Subramanian and Raksha P S presented poster and demo of their work at RISE “Open House,” IIIT Bangalore.

7 April 2017. Asha Subramanian and Raksha P S presented their work at PhD Colloquium, IIIT Bangalore.

29 March 2017. Srinath Srinivasa, Dean R & D, IIIT Bangalore. Attended European Research Council (ERC) meeting in Delhi representing IIIT Bangalore.

15 March 2017. Visit of Prasad Ram(Pram), Founder and CEO of Gooru.org at Web Science Lab, IIIT Bangalore.

13 March 2017. Project Review 2 of the projects at Web Science Lab, IIIT Bangalore.

8 February 2017. Srinath Srinivasa. Took office as the Dean (R&D) of IIIT Bangalore.

6 February 2017. Project Review 1 of the projects at Web Science Lab, IIIT Bangalore.

9th January 2017 – 13th January 2017. Srinath Srinivasa, Visited Gooru HQ at Redwood City, California, as a part of continuing the collaboration initiative.

Web Sciences Lab Workshop – 19th December 2016

WSL Worksop Dec 2016

Date: 19th December 2016

Venue: Room no 226, IIIT Bangalore

Time: 9:30 AM to 3:30 PM

We are conducting a one day workshop to collate and present research work by research scholars at the Web Sciences lab, IIIT Bangalore. Research Scholars will present their work, discuss ideas, share problems encountered, retrospect and provide updates on their progress.

Following is the schedule for the workshop

Time Task
9:30 – 9:45 Overview of the work done by lab in past 6 months – Prof Srinath Srinivasa
10:30 – 11:00 Inferencing in the Large:Towards Automation of Semantic Integration and Knowledge Representation of Open Data – Presenter : Asha Subramanian
11:00 – 11:30 A talk on Trust and Mediation – Presenter : Praseeda
11:30 – 12:00 Narratives Plot Comparison – Presenter : Sharath Srivatsa
12:00 – 12:30 Framework for Mediation Driven Learning – Presenter : Chaitali Diwan
12:30 – 1:30 Break for lunch
Afternoon
1:30 – 2:00 A talk on The Marketplace of Opinions – Presenter : Raksha
2:00 – 2:30 Semantic Summarization from User Generated Short Reports – Presenter : Jaya
2:30 – 3:30 Open discussion with all the participants on “Research and Me”

The abstracts of various talks are given below.

Title: Inferencing in the Large: Towards Automation of Semantic Integration and Knowledge Representation of Open Data

Abstract: Data available on public domain especially though open data initiatives such as data.gov, data.gov.in, data.gov.uk publish useful information on various aspects of government policies and administration. One could derive immense insights by semantically integrating such datasets across various domains. Semantic Integration involves extraction of common domains or themes that explain a collection of datasets by identifying unique resources for data values and relations amongst rows of data across these datasets using known or custom vocabularies and knowledge bases. The natural taxonomy and classification of the entities, instances and properties in the vocabularies allow for extraction of themes relevant to the datasets. Multiple research efforts have addressed the problem of semantic annotation of web tables and csv tables, which mainly involves interpreting tabular data by linking them to relevant vocabularies, however they have not focussed on the problem of semantic integration of tables. Linking Government Data is an active research interest. The current process to semantically link such datasets is largely manual and involves manual identification of vocabularies, classes and properties for each dataset, creating templates which will then automate the process of mapping the data to the identified vocabularies.
Our work presents two models, 1) the generation of semantically linked data for the open datasets using vocabularies from LOD cloud such as Dbpedia, YAGO, Schema.org, UMBEL etc and 2) representing the data in an intuitive home-grown Knowledge Representation Framework called MWF (Many Worlds on a Frame), a framework loosely modelled on Kripke Semantics. MWF allows for rich representation of data across two aspects – the type hierarchy(is-a) relationship and the containment hierarchy(is-in) relationship supported by roles and associations to transform the open datasets into a web of semantically interlinked themes and their associations.

Title: Understanding  trust in mediation

Abstract: Intermediaries have always been a part of the society. It was individuals who played a role of broker to orchestrate and facilitate transactions between various parties. Click here for more

Title: Narratives Plot Comparison

Abstract: Narratives are extremely versatile way of telling imaginary or fictional and true or empirical incidents whereas expositions are simple and concise documentation based on true and well researched content. Writing narratives is not bounded by any style, it is limited by the author’s intention to entertain, his experience and effort to compose. A similar message can be conveyed in varying grades of style and illustrative cases and hence comparing two narratives and scoring their similarity is non-trivial. Narratives have two aspects the flow of events called the Fabula and the expression style called Discourse, both aspects affect the reading experience and the impact of the intention or message to be conveyed by the author. Our hypothesis is that two narratives can be compared by matching the verbs and nouns of events of each subject. Click here for more

Title: Framework for Mediation Driven Learning

Abstract: Learning is a complex process in which the learner experiences permanent and lasting changes in knowledge, behaviour, or ways of processing the world. Every learner is unique and learns and perceives things differently, at a different pace. In the classroom environment which is designed for an average student, same content is delivered to all the students in the same way. There is a fundamental flaw in designing the curriculum in this way for an average student, since there are virtually no students who fit into this category of average [1]. Hence, there is a need to address the individuality of the student for effective learning. A learning theory called as “Independent Learning” addresses this. Independent learning encourages and enables students to become self-directed in their learning experiences and to have more autonomy and control over their learning. In addition to this, it is found that learning is very effective where there is a collaboration with other learners. In our work, we propose the concept of “mediation driven learning” which builds upon the theories of independent learning and collaborative learning and uses the power of Web to mediate or facilitate learning. We create a framework for mediation driven learning where we get the learners and tutors together on one platform and provide a mediation algorithm that finds an optimal matching between the learners and tutors for a particular learning concept. Click here for more

Title: Understanding the Marketplace of Opinions

Abstract: Our understanding of web has been evolving from that of a passive repository to a participatory socio-cognitive space, where human beings are participants rather than users of it. More than effecting the daily transactions this space has created a huge impact on how thoughts are shaping at individual level and also in a community. To be able to interpret how the society is transforming, it is very important to understand how the web is impacting the social cognition….Click here for more

Title: Semantic Summarization from User Generated Text Reports

Abstract:Text summarization is an active research area among Natural Language Processing research community. The community have been developed diverse paradigms for generating summary from long documents, even-though there is minimal effort on creating summary from large collection of short and noisy documents. Here, the short documents refers to user generated social media activity messages or any short reports which are generated as part of any closed domain. The proposed research aims to (semi-) automate the process of summary generation from a given set of short documents with more emphasis on the semantics of the document content. The research is initiated with a completely unsupervised techniques. The entire document collection is represented as an undirected graph of key phrases and later the graph clustering, graph centrality based measures and Markov Random Field based factor computation techniques are used to glean the important information. Further simple natural language generation techniques and natural language specific heuristics are applied to generate the candidate sentences for the final summary.

Open Discussion:

During the open discussion, all the participants will briefly share their individual views and comments on whether research pursuits have changed their approach in life towards achieving their passions or goals, and if yes, share their experiences.

 

Talk on “Web Annotation, Community Narratives and Familiarizing Stories” by Dr. Dinesh from Servelots

Speaker will visit the idea of Renarration Web with examples from Bio Diversity Protocol and Intangible Heritage of Hampi. He will then look at the ongoing Web Annotation Standards work at the W3C Web Annotation Working Group. Then we will spend some time discussing how the work of Web Sciences Lab can help in finding Similar Stories.

Date: 24th August 2016
Time: 3:00 PM
Venue: IIIT-Bangalore

About the speaker: Dinesh is the technical director at Janastu (janastu.org, 2002) and Servelots (servelots.com, 1999) in Bangalore, India which have been providing free and open source (FOSS) solutions and support, including R&D, to SME and NPOs/NGOs. They have introduced the concept of the SWeeT Web architecture and used it with platforms such as “re-narration web” in order to address the issue of contextualisation needs of web content, in particular for the case of low-literate web users who need a multi-lingual re-narration capable Web. He is a member of the W3C Working Group on Web Annotations as an Invited Expert.

Their work in recent years can be capture by these subject tags:
web annotations, social semantic web, location intelligence interpretation, 3d augmenting real spaces, re-narration, community radio, wifi-mesh and anthillhacks

Click here for more information about the speaker

Activity log 2016

21 December 2016. Sumant Kulkarni successfully defended PhD. thesis entitled, “Mining Labeled Semantics across Terms and Documents in User Generated Text Corpora.”

19 December 2016. WSL. “Web Science Lab Workshop” at IIIT Bangalore. WSL workshop

11th December 2016 – 17th December 2016. Srinath Srinivasa, Visited Gooru HQ at Redwood City, California, as a part of continuing the collaboration initiative.

29 November 2016. Web Science Lab. WebScience@10 -celebrating 10 years of Web Science. Events were run in Berlin, London and Chicago and WebScience@10 ‘TV channel’ and followed on hashtag #websci10. The event showed the world a day in the life of Web science by showing videos from different countries, IIIT -Bangalore was featured in this. The video is available at https://youtu.be/wz48FT4hcrI

21 November 2016. Srinath Srinivasa. Participated as an external member of the thesis exam committee of an MS thesis defence at the Supercomputer Education and Research Centre, IISc, Bangalore.

9 November 2016. Praseeda. Presented a talk on “Trust, CBAC and Blockchains” at Web Science Lab, IIIT Bangalore.

26 October 2016. Asha Subramanian. Presented her paper at the 15th International Conference on Ontologies, Databases and Applications of Semantics (ODBASE 2016), Rhodes, Greece.
Paper details : Subramanian, Asha, Ved Kurien Mathai, Vikkurthi Manikanta, Janaki Vinesh Joshi, and Srinath Srinivasa. “Semantic Integration of Open-Data Tables.” In OTM Confederated International Conferences” On the Move to Meaningful Internet Systems”, pp. 589-607. Springer International Publishing, 2016.

10–14 October 2016. Srinath Srinivasa. Visited the Gooru HQ at Redwood City, California, as part of a collaboration initiative.

16 September 2016. Prasad (Pram) Ram, founder and CEO of Gooru. Presented a talk on “Technology for Social Justice” at IIIT Bangalore.

5–9 September 2016. Raksha S Patel, Asha Subramanian. Attended the 42nd International Conference on Very Large Databases (VLDB 2016), New Delhi, India.

24 August 2016. T B Dinesh, Technical Director, Servelots and Janastu. Presented a talk on “Web Annotation, Community Narratives and Familiarizing Stories” at Web Science Lab, IIIT Bangalore.

10 August 2016. Srinath Srinivasa. Participated as a selection committee member for selection of proposals from a call by NWO-DST, at SERC, IISc, Bangalore.

15 July 2016. Nirmal Kumar Sivaraman and Srinath Srinivasa. Presented a talk entitled, “Modeling Online Groups” and presented an introduction to Web Science, at hasgeek.in monthly meetings, Bangalore.

15 June 2016. Anoop Kumar Pandey. Successfully defended MS thesis entitled, “Entity Resolution using Co-occurrence Graph and Continuous Learning.”

7 June 2016. Nirmal Kumar Sivaraman. Successfully defended MS thesis entitled, “A Cognitive Model of Groups and A Computational Model to Analyze Online Groups.”

31 May 2016. Chaitali Diwan. Successfully defended MTech thesis entitled, “Autonomous Spectrum Assignment of Whitespace Devices.”

29 Apr 2016. WSL members interacted with Infosys CorpTech group on research opportunities in Social Network Analysis and Web Science.

1 Apr 2016. Aastha Madaan. Rejoined WSL at IIITB after a six month research exchange at the Web Science Trust, University of Southampton.

12 Mar 2016. Tahir Dar. Presented Demonstration Paper entitled “UCliDSS : An Unsupervised Clinical Decision Support System for text” at International Conference on Management of Data (COMAD2016), Persistent Systems, Hinjewadi, Pune.

24 Feb 2016. Srinath Srinivasa. Delivered a public lecture entitled, “The Web and the Mind” as part of Bangalore Science Forum — a 53 year old tradition featuring public science lectures, National College, Basavanagudi, Bangalore.

21 Jan 2016. Srinath Srinivasa. Delivered an invited talk entitled, “Towards Mathematical Foundations for Social Machines” at the Ramanujan Mathematics for IT conference at IIIT Bangalore.

20 Jan 2016. Srinath Srinivasa. Participated as an invited guest at the launch of Carl Zeiss R&D Center at Bommasandra, Bangalore, India.

6 Jan 2016. Sumant Kulkarni. Presented his Open Seminar entitled “Assorted Algorithms for Semantics Extraction Based on a Co-occurrence Graph Corpus Model” before thesis submission.

Activity log 2015

24 Nov 2015. Sweety Vinod Agrawal. Successfully defended MS thesis entitled “Integrity Management For Open-ended Dissemination Of Utilitarian Data”

6 Nov 2015. Srinath Srinivasa. Delivered an invited talk entitled, “Big Social Machines: Architecture and Challenges” at Amrita University, Bangalore Campus, Bangalore, India.

29 Oct 2015. WSL members hosted a talk entitled “Social Sensing” by Prof. Nishanth Sastry from King’s College, London, UK.

16–25 Oct 2015. Asha Subramanian. Visiting scholar at the Ohio Centre of Excellence in Knowledge Enabled Computing, Wright State University, Dayton, Ohio, USA.

11–15 Oct 2015. Asha Subramanian. Attended the International Semantic Web Conference (ISWC) at Bethlehem, Pennsylvania, USA. She presented her paper entitled, “Inferencing in the Large : Characterizing Semantic Integration of Open Tabular Data” at the conference.

9 Oct 2015 — 8 Jan 2016. Aastha Madaan. Visiting scholar at the Web Science Trust at University of Southampton, UK.

5 Oct 2015. Aastha Madaan. Delivered an invited talk entitled: “The Promise of Web Science” at Delhi Univesity, New Delhi.

30 September 2015. WSL members launched the Web Science India Endowment to promote the discipline of Web Science in India.

7 Aug 2015. Asha Subramanian. Won a travel grant of US$1570 from the conference organizers for attending the International Semantic Web Conference (ISWC 2015) to present her paper.

31 July 2015. Srinath Srinivasa, Aastha Madaan. Visited City University London, UK, as part of the TRUMP and REACH project collaboration.

27 July 2015. Srinath Srinivasa. Participated in a discussion at the City Council of Aberdeen on a vision for Smart Cities.

27 — 30 July 2015. Srinath Srinivasa, Aastha Madaan. Visited the Digital Economy Hub at the University of Aberdeen, Scotland, UK, as part of the TRUMP project collaboration.

23 July 2015. Srinivasan Muthuswamy. Successfully defended MS thesis entitled “Computational Models to Identify Celebrities from Social Media”

19 July 2015. Srinath Srinivasa. Participated in AntHill Summer School and Hack Days 2015. Devarayanadurga Hills, Tumkur, India.

13 — 14 July 2015. Asha Subramanian. Participated in the 5th International Conference on Web Intelligence, Mining and Semantics (WIMS 2015), Limassol, Cyprus.

28 June — 1 July 2015. Srinath Srinivasa. Participated in the ACM Web Science conference, at Keble College, Oxford, United Kingdom.

22 — 24 June 2015. WSL members. Participated in the “PivotPoints” workshop hosted by dot.rural in collaboration with the University of Aberdeen on the theme of participatory frameworks for Smart Cities.

20 June 2015. Srinath Srinivasa. Delivered an invited talk entitled, “Abstractions, Expressions and Online Collectives,” as part of ACM Goa invited talk series.

19 June 2015. Srinath Srinivasa. Visited University of Goa as part of an Academic Audit team to review their MTech and MCA programs.

12 June 2015. Chinmay Jog. Successfully defended his MS thesis entitled, “Distributing a Trust Framework for Utilitarian Data Exchange in Inter-Organizational Collaborations.”

3 June 2015. Srinath Srinivasa. Visited the International School of Information Management (ISIM), University of Mysore, as part of their PhD colloquium.

2 June 2015. Srinath Srinivasa. Participated in a focused group discussion on Gamification and Organizational Behaviour at Trancendix Pvt. Ltd., Koramangala, Bangalore.

1 June 2015. Srinath Srinivasa. Visited the Air Force Command and Training Center in Hebbal Bangalore on invitation to introduce WSL, CDS and IIITB.

25-26 May 2015. Srinath Srinivasa. Participated in the first meeting of all work-package heads of the REACH project, IIT Mumbai, India.

8 May 2015. Srinath Srinivasa, Jayati Deshmukh. Presented WSL at the Complex Systems Lab (SosLab) at Bogazici University, Istanbul, Turkey.

4-8 May 2015. Srinath Srinivasa, Jayati Deshmukh. Participated in the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 2015), Istanbul, Turkey.

15 April 2015. Srinath Srinivasa. Participated in a workshop on “How can European Institutions and Industry engage with India’s new Resarch Universities?” Organized by the EU STI (Science and Technology Initiative) Platform in India. New Delhi.

11 March 2015. Srinath Srinivasa. Participated in the “Graph Day” Workshop at IBM Research, to bring together researchers working on graph data management.

18 Feb 2015. WSL organized a hackathon in conjunction of the launch event of the Web Observatory.

17 Feb 2015. WSL members organized the launch event of the first Indian node in the global Web Observatory grid.

5 Feb 2015. Srinath Srinivasa. Participated in the online kickoff meeting of a EU-India project named, “REACH: Rural Broadband Internet Access using Cooperative Mesh Networking in White Space Spectrum.”

Activity log 2014

  • WSL members participated in the CDS six-monthly workshop at IIIT-B. November 26 2014.
  • Sweety Agrawal. Presented a paper entitled “Integrity Management in a Trusted Utilitarian Data Exchange Framework” at ODBASE 2014, Amantea, Italy, October 29 2014.
  • Srinath Srinivasa. Presented WSL and CDS at Ericsson Research, Bangalore. Sep 23 2014.
  • Sumant Kulkarni. Successfully completed the comprehensive exam and thesis proposal seminar, qualifying him for candidature towards a PhD. September 18 2014.
  • Sumant Kulkarni, Srinath Srinivasa. Attended review meeting for the project on Intelligent Workflow Management. EMC Bangalore HQ, Mahadevapura. Aug 18 2014.
  • Srinath Srinivasa. Presented WSL and CDS at KayBus Bangalore office. August 14 2014.
  • Srinath Srinivasa. Delivered an invited talk entitled Towards a “Mindful” Web at Trinity College, Dublin, Ireland. July 28 2014.
  • Srinath Srinivasa. Invited by School of Computer Scinence and Statistics and School of Digital Humanities at Trinity College, Dublin for exploring research collaborations. July 27–31 2014.
  • Aastha Madaan joined WSL as post doctoral research faculty on 23 July 2014.
  • Srinath Srinivasa. Attended the second NRDMS review meeting for the Sandesh project at NSDI office, New Delhi. 22 July 2014.
  • Jayati Deshmukh. Successfully defended MTech thesis entitled, “Evolution of Cooperation with Entrenchment Effects.” 16 June 2014.
  • OSL members hosted CrossCurrents, an Indo-UK workshop on the use of digital technology for culture preservation. May 12-14 2014. Photos from the workshop.
  • Srinath Srinivasa. Presented a paper at IIWeb 2014 at Chicago, Illnois, USA. March 31 2014.
  • Chinmay Jog, Sweety Agrawal. Attended ACM CoDS 2014 at New Delhi to present a paper. March 21-23 2014.
  • OSL members hosted a talk, “Towards Intelligent Information Infrastructure” by Prof. Pete Edwards, University of Aberdeen, Scotland, 6 Feb 2014.
  • Srinath Srinivasa. Delivered an invited talk entitled: Utilitarian Aggregation of Open Data at the Social Media Workshop, organized by the British High Commission at IIIT Bangalore. Feb 5-6, 2014.
  • Nisha Bhasia, Tuli Kundu, Divya Maharshi. Organizational volunteers for Social Media Workshop, organized by the British High Commission at IIIT Bangalore. Feb 5-6, 2014.
  • OSL members hosted a talk, “From Big Data to Smart Data” by Prof. Amit Sheth, Wright University. 6 Jan 2014.
  • Srinath Srinivasa. Program Committee member for the 10th ICDE Workshop on Information Integration on the Web (IIWeb), Chicago, Illonis, USA, March 2014.
  • Srinath Srinivasa. Program Committee member for XRCI 2014, Xerox Research Center India Research Symposium, Bangalore, India, March 2014.
  • Srinath Srinivasa. PC Member and Publicity Chair of 7th India Software Engineering Conference (ISEC 2014), Chennai, India, February 2014.