{"id":6587,"date":"2018-05-10T20:48:56","date_gmt":"2018-05-10T20:48:56","guid":{"rendered":"http:\/\/studentwork.prattsi.org\/infoshow\/?p=6587"},"modified":"2018-08-07T11:13:31","modified_gmt":"2018-08-07T11:13:31","slug":"peshawar-scrapin-producing-a-better-index-to-cia-documents-on-the-soviet-occupation-of-afghanistan-1979-1989","status":"publish","type":"post","link":"https:\/\/studentwork.prattsi.org\/infoshow\/2018\/peshawar-scrapin-producing-a-better-index-to-cia-documents-on-the-soviet-occupation-of-afghanistan-1979-1989","title":{"rendered":"Peshawar Scrapin\u2019: Producing a better index to CIA documents on the Soviet occupation of Afghanistan, 1979-1989"},"content":{"rendered":"<p><span>Peshawar Scrapin&#8217; is an exercise in rapid subject tagging of poorly-described of textual material. Using automatic and human-curated methods, I scraped 7,000+ PDF documents on the Soviet-Afghan War from the CIA&#8217;s website, expanding the CIA&#8217;s deficient metadata with the names of relevant persons, factions, places, and concepts.<\/span><\/p>\n<p>Slides: https:\/\/docs.google.com\/presentation\/d\/1ND-sEmw5zBjerO3t3x68xGrJHtk9Ar9rhu0GOTHgxzE\/edit?usp=sharing<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Peshawar Scrapin&#8217; is an exercise in rapid subject tagging of poorly-described of textual material. Using automatic and human-curated methods, I scraped 7,000+ PDF documents on the Soviet-Afghan War from the CIA&#8217;s website, expanding the CIA&#8217;s deficient metadata with the names of relevant persons, factions, places, and concepts.<\/p>\n","protected":false},"author":259,"featured_media":6971,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[282],"tags":[7,162,131],"coauthors":[525],"class_list":["post-6587","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-282","tag-information-visualization","tag-policy","tag-research"],"_links":{"self":[{"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/posts\/6587","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/users\/259"}],"replies":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/comments?post=6587"}],"version-history":[{"count":2,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/posts\/6587\/revisions"}],"predecessor-version":[{"id":6913,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/posts\/6587\/revisions\/6913"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/media\/6971"}],"wp:attachment":[{"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/media?parent=6587"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/categories?post=6587"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/tags?post=6587"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/studentwork.prattsi.org\/infoshow\/wp-json\/wp\/v2\/coauthors?post=6587"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}