Sharing and learning

Phil Barker's work

Menu

Skip to content
  • Phil Barker
  • work blog
  • publications
  • contact

New publication analysing LRMI metadata on the web

24 April 2017LRMIlrmi, metadata, schema.org, semantic technologiesPhil Barker

I have a new publication: “Analysing and Improving Embedded Markup of Learning Resources on the Web,” which Stefan Dietze and Davide Taibi have presented at the 2017 International World Wide Web Conference in Perth Australia. I played a minor role in the “analysing” part of this work, the heavy lifting was done by my co-authors. They analysed data from the Common Crawl to identify sites that were using LRMI terms in their schema.org markup.  The analysis provides answers to important questions such as: who is using LRMI metadata and which terms are they using? How many resources have been marked up with LRMI metadata? Are the numbers of users growing? What mistakes are being made in implementing LRMI?

We also see how once a term is in schema.org it is interpreted in ways that may not have been anticipated by those who created it, with any implicit assumptions held within a community of practice being ignored. Thus terms that have a specific meaning within the learning, education and training field are construed in their more generic meaning. The result of this is that some LRMI terms are used for resources that we in LRMI did not have in mind when creating them. Consequently the presence of LRMI metadata on a web resource may not be a good indicator that a resource is intended for education–this is true of some properties more than others. To avoid this when making additions to schema.org (if you see it as a problem), the domain to which a term applies should be in the term name.

A second observation that seems important to me is the strong inverse relationship between sophisticated data structures and amount of usage. Yes, I’m talking about the AlignmentObject: potentially very expressive, but either it solves a problem no one has (which I don’t think is the case) or it is so complex that few people understand it well enough to use it. In general, properties with simple text/literal values get much more use than entity-valued properties.

Publication Details

The official reference is: Stefan Dietze, Davide Taibi, Ran Yu, Phil Barker, and Mathieu d’Aquin. 2017. Analysing and Improving Embedded Markup of Learning Resources on the Web. In Proceedings of the 26th International Conference on World Wide Web Companion (WWW ’17 Companion). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 283-292. DOI: 10.1145/3041021.3054160

It is licensed CC:BY, but the ACM version seems to be behind a paywall, so here is a local post-publication copy (pdf).

Here’s the abstract.

Web-scale reuse and interoperability of learning resources have been major concerns for the technology-enhanced learning community. While work in this area traditionally focused on learning resource metadata, provided through learning resource repositories, the recent emergence of structured entity markup on the Web through standards such as RDFa and Microdata and initiatives such as schema.org, has provided new forms of entity-centric knowledge, which is so far under-investigated and hardly exploited. The Learning Resource Metadata Initiative (LRMI) provides a vocabulary for annotating learning resources through schema.org terms. Although recent studies have shown markup adoption by approximately 30% of all Web pages, understanding of the scope, distribution and quality of learning resources markup is limited. We provide the first public corpus of LRMI extracted from a representative Web crawl together with an analysis of LRMI adoption on the Web, with the goal to inform data consumers as well as future vocabulary refinements through a thorough understanding of the use as well as misuse of LRMI vocabulary terms. While errors and schema misuse are frequent, we also discuss a set of simple heuristics which significantly improve the accuracy of markup, a prerequisite for reusing learning resource metadata sourced from markup.

 

Related

Post navigation

← Thoughts on Support for Technology Enhanced Learning in HE CMALT – Another open portfolio →

About me

I am  Phil Barker. I offer consultancy in the area of technology to enhance learning, and information systems for education. I am particularly interested in supporting the sharing, discovery and selection of appropriate learning resources. Much of the work I do is with Cetis LLP, a cooperative consultancy for innovation in educational technology. more…
#LRMI #cetis #teaching #other

Recent Posts

  • Fruitful RDF vocabularies are like cherries not bananas
  • What am I doing here? 4. Learning Resource Metadata
  • What am I doing here? 3. Tabular Application Profiles
  • What am I doing here? 2. Open Competencies Network
  • What am I doing here? 1. Credential Engine

Topics

  • annotation
  • assessment
  • books
  • ClassicPress
  • competencies
  • course data
  • course description
  • Credential Engine
  • critical thinking
  • DCTAP
  • delores
  • Design for Online Learning
  • DublinCore
  • eBooks
  • epub
  • IEEELOM
  • ISOMLR
  • JobPosting
  • JSON LD
  • legalities
  • lrmi
  • metadata
  • OAIORE
  • oer
  • online exams
  • open education
  • practice
  • PressBooks
  • python
  • random musings
  • rasppi
  • RDF
  • repositories
  • resource description
  • resource discovery
  • schema.org
  • semantic technologies
  • SHACL
  • subject coding
  • sustainability
  • talent pipeline
  • tap2shacl
  • What am I doing
  • wikidata
  • WordPress

Me, elsewhere

  • Mastodon
  • Twitter
  • LinkedIn
  • ORCID
  • Flickr
  • Wikipedia, Wikisource, Wikidata
  • RSS - Posts
  • RSS - Comments

Licence

Creative Commons License
Sharing and Learning - Phil Barker's work. by Phil Barker is licensed under a Creative Commons Attribution 4.0 International License .
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy
Proudly powered by WordPress