Following a marathon session demoing Yahoo Pipes yesterday (the slides I didn’t really use but pretty much covered are available here) I thought I’d start to have a look at what would be involved in generating a Pipes2PHP, Pipes2Py, or Pipes2JS conversion tool as I’ve alluded to before (What Happens If Yahoo! Pipes Dies?)… | source : blog.ouseful.info
From my own perspective, I would also add that should consultants like Spikes Cavell create derived data works from open public data, there should be some transparency in declaring how the derived work was created (see for example: So Where Do the Numbers in Government Reports Come From? and Data is not Binary). Another example of how once open data is becoming “closed” behind a paywall comes from Paul Geraghty (“Closed Data Now” SOCITM does a “Times”): | source : blog.ouseful.info
I haven’t got very far with this, but I have found a few starting points (I think) as who owes whom in the most general of terms, so I thought I’d just link to them here in case for the sake of convenience and rediscoverability.
My starting point: a BBC report on Who owns the UK’s debt?
From there, I ended up finding loosely related data at:
National Statistics – UK Accounts
UK Debt Management Office Quarterly Review
Bank for International Settlements | source : blog.ouseful.info
With the publication via WriteToReply of a comment soliciting speech from Ed Vaizey on public libraries yesterday (Remodelling Libraries, [press release]), I started thinking again how we might support a highlighting approach in WriteToReply. In a post earlier this year (Skim.it – Like Digress.it, But With Ratings Rather than Comments?), I briefly considered how we might publish documents in a paragraph atomising way (as we do using the digress.it WordPress theme) and then allow readers to add ratings (rather than comments) to the document at a paragraph level. Something like this, maybe: | source : blog.ouseful.info
So what can we do with that CSV file? How about turning it into a database, simply by uploading it to Google docs? I had a quick play pruning the code on my Guardian Datastore explorer, and put together this clunky query tool that lets you explore the COI website data as if it was a database. | source : blog.ouseful.info
- student satisfaction data is available from the Direct Gov Unistats service (OU data [XLS]; general download list); - funding data about current grants is provided on research council sites. The EPSRC, for example, provide a way of accessing spreadsheets for funding received by various OU departments: OU Awards from the EPSRC (see more generally the full list of funded organisations; (if you know similar ways of getting similar data from other research councils, or funders such as JISC, please post a link in the comments to this post:-) - financial data, where already published; the OU’s public financial statements can be found on the Freedom of Information minisite, for example (OU FOI: financial statements); - organisational data, where already published. Again the OU seems to be ahead of the game on this one via the FOI site: OU FOI: organisational structure; - RAE | source : blog.ouseful.info
So what else came to mind? Gaining access to the electoral roll allows (I think) the domicile of voters to be identified by poll number, at least to the street level; which means that it would be possible to generate a crude heat map, in real time, of where votes had been cast from.Throughts on Telling…. | source : blog.ouseful.info
It’s not just individual data that we have access to, of course – there’s also mass action data, like some of the webrhythms I’ve collected on Trendspotting, or data collected from living buildings that you can peek at on Pachube – but I’ve written more than enough for now. Hopefully there are one or two ideas in here that act as a starting point, at least, for a data driven storytelling exercise… As ever, comments are appreciated. If I get a chance in the next week, I’ll try to refine this post to make it a little more assignment like. And please, if you’ve ever run a data driven digital storytelling activity, I’d love to hear about it:-) | source : ouseful.wordpress.com
Anyway – this is my starter for ten on how to make live datastore data available to the masses. It’ll be interesting to see whether this approach (or one like it) is used in favour of getting temps to write SPARQL queries and RDF parsers… The obvious problem is that my approach can lead to an explosion in the number of formulae and parameters you need to learn; the upside is that I think these could be quite easily documented in a matrix/linked formulae chart. The approach also scales to pulling in data from CSV stores and other online spreadsheets, using spreadsheets as a database via the =QUERY() formula (e.g. Using Google Spreadsheets Like a Database – The QUERY Formula), and so on. There might also be a market for selling prepackaged or custom formulae as script bundles via a script store within a larger Google Apps App store… | source : ouseful.wordpress.com
That is, how about defining a simple spreadsheet function that lets us look up a particular data value for a particular subject and a particular institution? How about being able to write a formula like:
=gds_education_unitable(“elecEng”,”Leeds”,”NSSTeachingPerCent”)
and get the national student satisfaction survey teaching satisfaction result back from students studying Electrical/Electronic Engineering at Leeds University? | source : ouseful.wordpress.com