Tag Archives: Andrew Gray

Matching ORCID and other authority control identifiers in Wikidata BEACON

Further to my previous post on finding ORCID identifiers used in Wikidata & Wikipedia, Magnus Manske has released another useful gadget. “Wikidata BEACON” is a new tool that matches individuals’ (or other subjects’) entries in two different authority control systems. One of these, of course, can be ORCID.

For example to find people who are listed in Wikidata, and have an ORCID identifier recorded there, and who also have, say, a VIAF identifier, or a MusicBrainz artist profile, choose one of those properties, then the other, from the two drop down menus, then select “Get BEACON data”.

screenshot

Screenshot of Beacon, with ORCID and VIAF identifiers selected.

The result is returned as a pipe (“|“)-separated list, with the middle of the three columns being the Wikidata ID (in the format “Qnnn“) of the item concerned. (For the technically inclined, the format is BEACON, used to enable third party data re-users to automate the conversion of identifier values into web links. You can see the part-URLs, to which the values must be appended, at the head of the results page, labelled #PREFIX and #TARGET)

So, Bill Thompson, for instance, appears as:

4426461|Q4911143|0000-0003-4402-5296

showing respectively, his VIAF (4426461), Wikidata (Q4911143), and ORCID (0000-0003-4402-5296) identifiers

A query can also be made in the form of a URL, for example this one:

https://tools.wmflabs.org/wikidata-todo/beacon.php?prop=496&source=214

in which “496” is from Wikidata’s code for an ORCID identifier and “214” for a VIAF identifier.

Another example is:

https://tools.wmflabs.org/wikidata-todo/beacon.php?prop=661&source=373

which shows the identifiers of chemicals in the Royal Society of Chemistry’s ChemSpider database and the matching Wikimedia Commons categories.

Similarly:

https://tools.wmflabs.org/wikidata-todo/beacon.php?prop=827&source=345

matches the BBC and Internet Movie Database (IMDb) identifiers of television programmes.

Beacon is a good illustration of the way in which Wikidata has become a hub linking disparate datasets about people, and other things; as described by Andrew Gray in “Wikidata identifiers and the ODNB – where next?“.

Finding ORCID identifiers used in Wikidata & Wikipedia

As you may know, I’m was appointed Wikipedian in Residence at ORCID in June this year.

I’ve previously written a guide to using ORCID identifiers in Wikipedia.

A new tool, ‘Resolver‘, by my friend Magnus Manske, who has awesome coding skills, and is generous with them, allows you to find whether a particular ORCID identifier is used in (and thus in one or more Wikipedia projects, in any language).

By entering the property “P496” (the Wikidata property for an ORCID ID) and the ORCID ID value (the short form, e.g. “0000-0003-4402-5296”, not the full identifier, “http://orcid.org/0000-0003-4402-5296”) into Resolver, the relevant Wikidata page, if any, is retuned. At the foot of that page are links to Wikipedia articles (again, if any).

Resolver screenshot

An ORCID identifier query in Resolver

Alternatively, you may compile a URL in the format https://tools.wmflabs.org/wikidata-todo/resolver.php?prop=P496&value=0000-0003-4402-5296 – which will automagically redirect.

Note that this works for articles, but not identifiers used on Wikipedia editors’ user pages, which have no Wikidata equivalent.

Resolver works with other unique identifiers, too, such as VIAF, or BBC Your Paintings artist identifiers, and many more. If you want to know why that’s important, see Andrew Gray’s post, “Wikidata identifiers and the ODNB – where next?“. Resolver is not just for people, though. It will also resolve unique identifiers for other types of subjects, such as BBC programme IDs or ChemSpider IDs for chemical compounds.

Requesting open-licensed, open-format recordings of the voices of Wikipedia subjects for Wikimedia Commons

The Idea

A little while ago, my friend and fellow Wikipedia editor (he’s the Wikipedian in Residence at the British Library!) mentioned to me that Wikipedia could do with more sound files. We discussed recordings of music, industrial and everyday sounds (what does a printing press sound like? Or a Volkswagen Beetle? What do different kinds of breakfast cereal sound like when milk is added?), as well as people’s voices, so that we have a record of what they sound like.

A giant ear-trumpet

Beethoven’s Trumpet (With Ear) By John Baldessari, at the Saatchi Gallery.
Photo by Jim Linwood, on Flickr, CC-BY

In the spirit of Wikipedia, all such recordings would be open-licensed, to allow others to use them, freely. They can then be uploaded to Wikimedia Commons (the media repository for Wikipedia and its related projects) in an open format, namely Ogg Vorbis (that’s like mp3, but without patent encumbrances).

So I’m working on a new initiative to provide short (under ten-second) open-licensed audio clips of examples of the speaking voices of notable people (i.e. people who have Wikipedia articles about them).

What To Do

As a pilot, I’m asking some of my (cough) celebrity friends to kindly record the following, or a variation of their choice, with no background noise:

Hello, my name is [name]. I was born in [place] and I have been [job or position] since [year]

(but without mentioning Wikipedia!) They can do that, in quiet room, with a modern mobile phone, or a computer.

[Stop Press: See update 4, below, for update regarding use of “Vocaroo”, to avoid this step]

Once they’ve done that, they can convert the file to Ogg Vorbis using this free tool and then upload it to Wikimedia Commons, with an open-licence, with no “non-commercial (NC)” or “no derivatives (ND)” restrictions, (e.g. CC-By or CC-By-SA), and add the category “Voice intro project”.

If that’s too much fuss, they can e-mail it, or its URL, to me (andy@pigsonthewing.org.uk), using common file formats like mp3 or .wav, stating that it’s under one of those licences, and CC the mail to: permissions-en@wikimedia.org to formally record the open licence. Then I or other Wikipedia editors will make the conversion.

Alternatively, perhaps, they can point to a suitable, open-licensed, example of their speaking voice, which is already online.

Anyone Can Help

If you’re not the subject of a Wikipedia article, you can still help, by recording and uploading to Wikimedia Commons audio files, as described above, of machinery or everyday activities and occurrences.

Updates

  1. A couple of Wikipedia article subjects have asked why they would do this. In short, so that there is a public — and freely reusable — record of what they sound like, for current and future generations. And so that we know how they pronounce their names.
  2. The uploaded files are now gathered in a Wikimedia Commons category. Thank you to the early contributors.
  3. I’ve been asked about multi-lingual recordings. The best thing would be separate files, one in each language, please.
  4. If you have a microphone on your computer (doesn’t work on iPhone/iPad), it’s possible to record directly into the Vocaroo website, and just email or tweet me a link. But you still need to agree to an open licence!