6 Publication place
MARC: 260a
6.1 Complete Dataset Overview
4947 unique publication places; available for 956556 documents (81%).
ambiguous publication places; some of these can be possibly resolved by checking that the the synonyme list does not contain multiple versions of the final name (case sensitive).
unknown place names These terms do not map to any known place on the synonyme list; either because they require further cleaning or have not yet been encountered in the analyses. Terms that are clearly not place names can be added to stopwords; borderline cases that are not accepted as place names can be added as NA on the synonyme list.
discarded place names These terms are potential place names but with a closer check have been explicitly rejected on the synonyme list
6.2 Publication countries
- 46 unique publication countries; available for 889790 documents (75%).
- 4239 places with unknown publication country (85.7% of the unique places; can be added to country mappings)
- potentially ambiguous region-country mappings (these may occur in the data in various synonymes and the country is not always clear when multiple countries have a similar place name; the default country is listed first). NOTE: possible improvements should not be done in this output summary but instead in the country mapping file.
6.3 Geocoordinates
- 74.6% of the documents were matched to geographic coordinates (based on COMHIS geomapping process).
- 4299 unique places (86.9% of all unique places and 25.36% of all documents) are missing geocoordinates. See list of places missing geocoordinate information.
6.4 Subset Analysis: 1809-1917
477 unique publication places; available for 52733 documents (82%).
Unique publication country for a period 1809-1917: 32; available for 51416 documents (80%).
Top-20 publication places are shown together with the number of documents.
Country | Documents (n) | Fraction (%) |
---|---|---|
Finland | 46461 | 72.0 |
Sweden | 1447 | 2.2 |
Russia | 1241 | 1.9 |
USA | 930 | 1.4 |
Germany | 558 | 0.9 |
England | 173 | 0.3 |