Last modified: 2013-09-27 00:02:04 UTC
I have just attended Ryan Faulkner talk at the Python devroom at FOSDEM 2013. He has shown some metrics about people engagement in Wikipedia, and the general framework to gather those metrics from WP databases. It would be cool if that information could be made public and available for third parties, with the proper anonymization, etc. I am not sure if this is the category where I should file this bug report. Thanks!
Not sure either, filing it under E3 for the start so the team gets aware of it.
If there are datasets specific Ryan's presentation he is most able to answer your question. The general answer is that after getting them anonymized and approved by Legal, we are very happy to publish datasets when we can. In the meantime, there are quite a few datasets hosted at http://datahub.io/en/dataset?tags=wikipedia&q=Wikipedia
Hey Israel, I'm on irc now. The slide that I think you're referring to were our results on the Post Edit feedback experiment: http://meta.wikimedia.org/wiki/Research:Post-edit_feedback/PEF-1 The API data on the other hand is not yet public only because the service is not ready yet. We are thinking about how we want manage the access in this way, but it's becoming obvious that we'll need to do this sooner rather than later. I can keep you in the loop on the latest developments.
Thanks Steven for the info. Ryan, I am sorry, I was offline travelling back to Madrid :). I would highly appreciate if you keep me updated of any progress on the availability of the data and/or the API. Thanks!
[Resetting assignee as Ryan left WMF.]
Wikimetrics (metrics.wmflabs.org) is now available for public use, it is the API that Ryan is referring to in #3.
Marking fixed. If there are issues with Wikimetrics, they can be filed at https://bugzilla.wikimedia.org/enter_bug.cgi?product=Analytics&component=Wikimetrics . People can also discuss these issues on the https://lists.wikimedia.org/mailman/listinfo/analytics list.