KCLCCHMinor programmeAV1000Numerical and graphical analysis


AV1000
Fundamentals of the digital humanities
Distribution of words across various sources: Millennium Dome Textbase for “time”

The following table shows the distribution of the word “time” across various sources concerned with the Millennium Dome: the Guardian (G), Hansard debates in the House of Commons (HC) and House of Lords (HL), the London Evening Standard (LES), the Select Committee on Culture, Media and Sport, reports 2 (SCCMS2) and 3 (SCCMS3) and the London Times (T). Note particularly that the size of the textbase for each source varies significantly.

The problem here is to chart the distribution so as to show the relative densities of occurrence for the given word.

sourceoccurrencessize of source
G 62 35218
HC 83 35129
HL 169 48653
LES 107 55238
SCCMS2 38 17242
SCCMS3 72 33402
T 74 47511

Total: 605, Total in Database: 272393.

revised January 2008