Wikipedia:WikiProject Newspapers/Wikidata: Difference between revisions
→Query retrieval: add mapping example |
→Query retrieval and maps: add alternate map method |
||
Line 49: | Line 49: | ||
You can customize the queries above, or make your own. A [[:d:Wikidata:SPARQL_tutorial|tutorial]] and [[:d:Wikidata:SPARQL_query_service/queries/examples|examples]] are available to kick you off. |
You can customize the queries above, or make your own. A [[:d:Wikidata:SPARQL_tutorial|tutorial]] and [[:d:Wikidata:SPARQL_query_service/queries/examples|examples]] are available to kick you off. |
||
You can also generate a map of all newspapers in a given Category, if the newspapers all have coordinates in the articles. See the following example for Newspapers published in Minnesota. |
You can also generate a map of all newspapers in a given Category, if the newspapers all have coordinates in the articles. See the following example for Newspapers published in Minnesota: '''<nowiki>{{GeoGroupTemplate|article=Category:Newspapers published in Minnesota}}</nowiki>''' This will generate the box at the right. Clicking on the [[OpenStreetMap]] link in the box will bring up the map. Substitute the name of any Category you want to use. |
||
<nowiki>{{GeoGroupTemplate|article=Category:Newspapers published in Minnesota}}</nowiki> |
|||
{{GeoGroupTemplate|article=Category:Newspapers published in Minnesota}} |
{{GeoGroupTemplate|article=Category:Newspapers published in Minnesota}} |
Revision as of 22:45, 18 February 2020
About | Talk | Goals | Team | Tutorials | Data | Projects | Reviews & Alerts | Research |
Wikidata is a sister site to Wikipedia; it is a hybrid between a wiki and a database, so it's much more structured than Wikipedia. Each item is essentially a data entry, with links to other data entries; so the item for the New York Times will have elements like "instance of ... newspaper" and "located in ... New York, New York". (Sample Wikidata item for New York Times - this one has a ton of info, a small local paper might only have 3 or 4 statements.)
Wikidata is expected, over time, to play a greater and greater role in how information is organized on the Internet. Other web services can query it as a database, and pull out structured information.
Adding databases to Wikidata
Wikidata can offer great value simply by linking existing online databases (often websites). For instance, if one web site has a page for every lawyer in Nebraska, another has a page for every female published author in the U.S., and another has a page for everyone buried in a U.S. cemetery, then the Wikidata item for a deceased female lawyer-author from Nebraska could have an "identifier" linking to each of those pages, making it easier in the future for both humans and automated processes to "link" the scattered bits of online information about her.
99of9 has added the US NPL identifier to Wikidata, and linked thousands of U.S. newspapers to their US NPL pages. (For instance: visit the Portland Tribune's Wikidata entry and scroll down to near the bottom; then click the "2595" link.)
What other databases can we add? Here are some national ones:
- USNPL Done
- Chronicling America, a project of the U.S. Library of Congress, which seems to use the LCCN identifier in its URL scheme (as do some other online databases)
- Mondo Times
- SmallTownPapers.com (appears to be a commercial archiving venture -- must be behind archiving project like this one)
- Google's newspaper archive (not sure how useful it is as a data source, though it has tons of content)
- Newspapers.com is pay-to-play, but seems to have a strong URL scheme for its pages, and they have a ton of archives. (They're also a Wikipedia Library partner, so there might be valuable lines of communication available.)
- Podunk.com - many newspapers listed, requires more research to see how much useful info it has.
- Echo Media, same - needs more research.
Oregon
- Oregon Historical Newspapers archive (Univ. of Oregon) (uses LCCN as unique ID)
- Oregon Newspaper Publishers Association - this one could be problematic, curious what data folks think. Tons of useful info, but it only has separate pages for General Members (not for Associate or Collegiate members, or non-members). So, over time...what if a newspaper drops its membership? Presumably, the record dies. Not sure how to handle. Done
Infoboxen
One important example of how Wikidata will shift the way that information is organized is evident within the Wikimedia world: Wikidata is increasingly used in managing the kind of infoboxen that are a high priority for this WikiProject.
- There are many infoboxen that already rely on information as published in Wikidata. {{Infobox newspaper}} is not currently one of them, but sooner or later it probably will be.
- On Wikimedia Commons, many categories use infoboxen that are automatically generated from Wikidata. (example)
There is an Infobox Tutorial on Wikidata that might be worth reviewing.
There were 8,408 articles using the {{Infobox newspaper}}, as of February 16, 2020. See Link for the current count and Special:WhatLinksHere/Template:Infobox_newspaper for the current articles using this template. The data that should be included in this Infobox should include, at minimum: name=, type= (Daily, Weekly or monthly newspaper), foundation=, language=, ceased publication= (for defunct newspapers), headquarters= (address of newspaper), publishing_city=, publishing_country=, ISSN= (when known), oclc= (when known), and website= (when known).
Query retrieval and maps
When facts are stored in databases, you can ask questions about the whole set of facts at once. One way this is done on wikidata is using the Wikidata query service.
Here are some examples of queries relevant to this project:
- Map of all newspapers on wikidata if they have a recorded place of publication and that place has recorded coordinates. The map is colour coded according to whether there is an en-wiki article, and if so, the link is available by clicking on the point.
- USA newspapers without a place of publication please provide P291 if you can find it.
You can customize the queries above, or make your own. A tutorial and examples are available to kick you off.
You can also generate a map of all newspapers in a given Category, if the newspapers all have coordinates in the articles. See the following example for Newspapers published in Minnesota: {{GeoGroupTemplate|article=Category:Newspapers published in Minnesota}} This will generate the box at the right. Clicking on the OpenStreetMap link in the box will bring up the map. Substitute the name of any Category you want to use.
Personalized automatically updating lists
If there is a specific subset of newspapers that you are interested in, and you can specify this with a query, you can get a personalized automatically updating list.
Here is an example by wikidata:User:Sic19 that lists a whole lot of information stored in wikidata about all Welsh newspapers. --99of9 (talk) 07:56, 10 August 2018 (UTC)
Things to do
- Every newspaper (whether or not it's notable enough for a Wikipedia article) should have a Wikidata entry.
- There is now a Mix'n'match set 1655 for Australian Newspapers you'd be welcome to help with. --99of9 (talk) 01:43, 7 August 2018 (UTC)
Related WikiProject on Wikidata
There is a closely related WikiProject on Wikidata; please consider reviewing their pages and/or joining that project.
(work in progress...please feel free to build out this page)