Difference between revisions of "User:Adewalker/Gigs Reconciliation Jan 2015"
m (Protected "User:Adewalker/Gigs Reconciliation Jan 2015" ([Edit=Allow only administrators] (indefinite) [Move=Allow only administrators] (indefinite))) |
|
(No difference)
|
Revision as of 09:25, 11 January 2015
Introduction
Decided to reconcile the Main Gigs page listings with the various categories currently in use, namely Category:Gigs_by_country and Category:Gigs_in_location, to make sure that we have correctly categorised all the gigs. Thanks to the Gigbox template, this categorisation is taken care of automatically and will result in a correct categorisation (unless there is a location or country error in the Gigbox). However, quite a few gig pages do not use the Gigbox, and therefore this automatic categorisation doesn't take place.
Methodology
Here's what I did.
Data from Gigs templates
- Copied all the 1991 to 2014 gig template contents from the Main Gigs page into a spreadsheet
- In my spreadsheet, added additional columns for Year, Redlink and Cancelled so that I could filter the data as required.
- Year is useful for sorting as some dates are incomplete, eg 1996-10-??, and are therefore treated by Excel as text rather than a date.
- Redlink gigs will not appear in any of the gig categories, cos the gig pages don't exist, so I added a flag in the relevant rows in my spreadsheet.
- Cancelled gigs do not get categoried either (as per the Gigbox code), so these too needed to be flagged.
- I decided to ignore 2015 as this list is still being added to and there is still updating going on with these gigs.
Analysis of Gigs template data
Results so far:
- Total number of Gig template gigs = 1306
- 1991 = 2
- 1992 = 1
- 1993 = 1
- 1994 = 6
- 1995 = 14
- 1996 = 4
- 1997 = 33
- 1998 = 20 (Redlink = 2)
- 1999 = 147 (Redlink =12)
- 2000 = 146 (Redlink = 42)
- 2001 = 120 (Redlink = 12)
- 2002 = 38
- 2003 = 53
- 2004 = 147 (Redlink = 2) (Cancelled = 4)
- 2005 = 23
- 2006 = 106 (Cancelled = 2)
- 2007 = 111 (Cancelled = 10)
- 2008 = 17
- 2009 = 56
- 2010 = 101 (Cancelled = 4)
- 2011 = 18
- 2012 = 42 (Cancelled = 3)
- 2013 = 96 (Cancelled = 1)
- 2014 = 4 (Cancelled = 1)
- Total = 1306
- Total number of Redlink gigs = 70
- Total number of Cancelled gigs = 25
- None of the Cancelled gigs are Redlinks
From this raw data:
- Exclude Redlinks and Cancelled gigs, ie 70 + 25 = 95 to be excluded
- Theoretical number of gigs categorised by country and location should be 1306 - 95 = 1211, based on Gig/year boxes on Main Gigs page
Check quality of Gigs template Country data
Errors with the naming of country (location) data in the Gigs/year templates. These errors probably arise because these tables are keyed manually, and therefore naming of countries or provinces/countries are not forced to comply with the naming standards we use in actual Gig pages (thanks to the Gigbox).
Fixes required (in Gigs/year templates):
- Gigs/2000 - 2000-08-18 Flughafen - NW Germany (should be NW, Germany)
- Gigs/2000 - 2000-08-27 Lowlands Festival - Holland (should be Netherlands)
- Gigs/2001 - 2001-08-04 Witnness - Rep of Ireland (should be Ireland)
- Gigs/2001 - 2001-08-24 Lowlands Festival - The Netherlands (should be Netherlands)
- Gigs/2004 - 2004-01-19 Metro Theatre - NS Australia (should be NS, Australia)
- Gigs/2004 - 2004-05-31 Megaland - LI Netherlands (should be LI, Netherlands)
- Gigs/2004 - 2004-07-10 Balado Park - shown as Scotland (should be UK)
- Gigs/2008 - 2008-07-30 Vivo Rio - Brasil spelling issue (should be Brazil)
- Gigs/2010 - 2010-07-09 Balado Park - shown as Scotland (should be UK)
- Gigs/2013 - 2013-09-14 Parque Olimpico - Brasil spelling issue (should be Brazil)
Note - these errors have no bearing on the actual categorisation of these gigs by country, as the actual category is established by the Gig Page's Gigbox, not the Gigs/year template listing. The reason they need fixing is (a) to be consistent with naming conventions and country names as generated by the Gigbox and (b) to allow me to reconcile my spreadsheet with the actual Gigs in {country} categories.
Other issues noticed:
- Another issue noted with country names in the Gigs/year templates is inconsistent use of province/state/region initials. For example, some Netherlands gigs have the state initials, others don't. Same with Germany gigs.
- Brasil vs Brazil. This is an issue at Category level too (to be discussed further down).
Reconcile Excel with Gigs in Country categories
Notes about the below table:
- Gig tables = number of gigs as per the analysis in my spreadshett, which is based on the contents of the various Gigs/year templates (as displayed on the main Gigs page)
- Redlinks = number of gigs (as per gig page analysis) which are redlinks, ie the Gig Page itself doesn't yet exist.
- Gig pages = 1st column minus 2nd column. This is the figure that we need to reconcile with the category data.
- No. gigs in actual Category = self-explanatory, ie how many in each "Gigs in {country}" category.
Country | Gig tables | Redlinks | Gig pages | No. gigs in actual Category |
---|---|---|---|---|
Argentina | 6 | 6 | ||
Australia | 57 | 4 | 53 | |
Austria | 13 | 13 | ||
Belgium | 18 | 18 | ||
Brazil | 8 | 8 | ||
Canada | 30 | 30 | ||
Chile | 2 | 2 | ||
China | 2 | 2 | ||
Colombia | 1 | 1 | ||
Croatia | 1 | 1 | ||
Czech Republic | 2 | 2 | ||
Denmark | 15 | 1 | 14 | |
Estonia | 1 | 1 | ||
Finland | 11 | 1 | 10 | |
France | 120 | 7 | 113 | |
Germany | 93 | 10 | 83 | |
Greece | 4 | 1 | 3 | |
Hungary | 4 | 4 | ||
Iceland | 1 | 1 | ||
Indonesia | 1 | 1 | ||
Ireland | 19 | 3 | 16 | |
Italy | 39 | 2 | 37 | |
Japan | 44 | 44 | ||
Latvia | 3 | 3 | ||
Luxembourg | 3 | 3 | ||
Malaysia | 1 | 1 | ||
Mexico | 10 | 10 | ||
Monaco | 1 | 1 | ||
Netherlands | 24 | 1 | 23 | |
New Zealand | 6 | 6 | ||
Norway | 17 | 3 | 14 | |
Poland | 3 | 3 | ||
Portugal | 12 | 12 | ||
Romania | 1 | 1 | ||
Russia | 7 | 7 | ||
Serbia | 1 | 1 | ||
Singapore | 2 | 2 | ||
South Africa | 2 | 2 | ||
South Korea | 5 | 5 | ||
Spain | 26 | 1 | 25 | |
Sweden | 12 | 2 | 10 | |
Switzerland | 25 | 1 | 24 | |
Taiwan | 1 | 1 | ||
Turkey | 4 | 1 | 3 | |
UK | 330 | 11 | 319 | |
Ukraine | 2 | 2 | ||
UAE | 2 | 2 | ||
USA | 289 | 21 | 268 | |
TOTAL | 1281 | 70 | 1211 |
To be continued...