|
Peter Teibenbacher
Demographic transition - a relational data-base and statistical
oriented solution
This paper deals with the topic of demographic
transition in a small parish in Upper Styria/Austria between 1880 and
1940. The parish`s population mainly was an agrarian one, partially
engaged in handicraft and traffic. Besides that the parish consisted of
67% catholics and 33% protestants. Thus also the question concerning the
role of so-called cultural and so-called social-economic factors
determining demographic behaviour can be tested. The analysis is done with
the help of a relational data-base in SPSS-format. This solution does not
want to present itself as the best one but as a functional one among
others. The advantage is, that SPSS is a wide-spread program, well-known
and usable in many other concerns. The data-matrix primarily wants to
represent female demographic biographies - but also contains fathers and
bridegrooms -, it depicts different events like cases of births, deaths
and marriages and different persons with different roles involved. The
process of identifying persons is done in a semi-automatic way with the
held of stepwise oriented sorting-routines, selecting routines comprising
different variables like maiden-names, first and last names, dates of
birth, marriage or death of a person etc. The decisions at least have been
made case-by-case, cause there is no real satisfying automatic
record-linkage-procedure at our disposal. Using this mix of methods the
central questions concerning demographic behaviour like starting,
stopping, spacing, the amounts of births for each woman, series of
sequential events in a woman`s demographic biography etc. can be tested in
a satisfactory way. But with the help of different routines, especially
the lag and lead-functions in SPSS it is also possible to insert
automatically "demographic family"-codes, comprising all persons which are
involved in common demographic events like births (mother, father, child)
and marriages (brides and bridegrooms).
Onno Boonstra, Leen Breure & Peter Doorn
Past, Present and Future of Historical Information Science
The roots of historical information science are grounded in quantitative socio-economic history on the one hand and in computerized analysis of historical texts on the other. In the second half of the 1980s 'history and computing' got a strong impulse by the advent of the PC. In the 1990s a debate on source-oriented versus problem-oriented computing was important. The Internet stimulated many heritage institutes (archives, libraries and museums) to digitize collections. Since the late 1990s and in the first years of the21st century, the 'H&C movement' seems to have lost it's momentum: humanities computing courses and departments in universities are under pressure or cut back; journals or series are being discontinued; the number of new publications is diminishing; H&C conferences are less frequent and attract less participants.One might even speak of a 'crisis' in historical computing.
It is useful to reflect on what has been accomplished in the past 15 years, what the present situation is, and to set out lines for the required future development of the field. To this aim, we have formulated a research project, which should result in a conference, to be held in Amsterdam on 12-13 December 2003, and later in a book. The research questions have been formulated as follows:
What has been the effect of computing on history as a dicipline? Which new challenges lay ahead for history and computing?
Which innovations did computing bring the historical discipline? What innovations for the future are possible and desirable? What are the requirements of the historical field on computing?
What were the main topics for discussion? What new topics require discussion? Future agenda?
We intend to focus the research questions focus on the one hand on the quantitative approach to history (socio-economic history), on the other to the Qualitative approach to history (cultural history), text & images. Moreover, attention will be paid to the role of digitization and digital archiving in heritage institutions for research, on teaching in universities, and on the communication of the History and Computing specialists with the sectors of historical research at large, heritage institutions, and 'general' informatics.
Serguei Kachtchenko and Ekaterina Kostrigina
Population of Karelia (1800-1917). Computer Analysis of
individual records.
In 2000-2003 the work on finding and studying parish
registers of Olonetz and Pudozsk parishes of the Province of Olonetz with
karelian and russian population has been continued. The research has been
carried out by the chair of Source Studies Of Russian History of
St.Petersburg State University, with the support of Russian Humanitarian
research Fund of Russian Academy of Science (Grant ?01-01-78002 ?/?).
As a result of search works at The Central State Archives of The
Republic Kareliya (Petrozavodsk) and The Central State Historical Archives
of St. Petersburg numerable volumes of parish registers (since 90-th of
the 18th century to 1917), confession records, revision materials and
censuses of the Province of Olonetz were found.
The found items of information were formalized and entered into
computer databases, those files, which contain the information on births
(baptisms), marriages (weddings), deaths (burial services). Among the
issues discussed the most attention was drawn to the principles of
creating the computer databases on historical demography. The developments
were towards the adaptation of the standard statistical programs and
creation of the original. The analysis of parish registers, first of all
issues of holes in infants' births and deaths registration is of a great
importance. We managed to define the exact time when the registration of
perinatal, neonatal and postneonatal mortality had begun. A number of
questions, connected with the analysis of so-called "small selected sets"
were investigated. The items, connected with mortality were studied (the
peaks of deaths (connected with epidemics), the reasons of deaths, life
expectancy (the dependence on sex and social status); rates of births and
marriages (particular attention was paid to the changes in the marriage
age during 150 years). Some pre-liminary results were compared with the
data from the different regions of Russia and Netherlands at the
international seminars, which were held in 1999 - 2001 in St. Petersburg
and Groningen. Several people were busy in the research, apart from the
authors of the letter, K. Valegina, M. Markova, S.Smirnova and a number of
students of the History Faculty of St. Peterburg State University have
taken part in the investigations
Siegfried Gruber
Migration in Albania in the early 20th century
There is hardly anything known about migration in
Albania in the early 20th century on the basis of quantitative research.
This is the consequence of a lack of appropriate data. A group of
researchers has made a sample of the Albanian census of 1918 available for
research and this sample can be used for research about migration since
the census includes data about place of birth and place of present
residence. This is of course only data about one point in time and does
not allow for analysing migration histories of individuals, but it
provides us with information about the percentage of people born in the
place where they were staying at the time of the census. It allows also to
analyse residence patterns following marriage and traces of migration
histories can be found when children or siblings were born in different
places. All places will be linked to a map for creating a historical GIS
of Albania in 1918. This historical GIS enables research about the places
of origin of the urban population and therefore about the different
hinterland of the urban centres. The exact information about the place of
residence in the cities can be used for finding the places within the
cities where the rural immigrants lived and whether they tended to live
near other immigrants from the same region or not.
Robert M. Schwartz
Making the Grade: An Application of Digital Elevation Modeling to
the Study of Railways and Environment in Victorian England
This paper uses a digital elevation model to study the
changing impact of railway construction on the landscape of England and
Wales from the 1840s to the eve of World War I. It examines the degree to
which the increasing power of locomotives and other technical advances
after 1860 enabled the later rail system to negotiate the landscape less
by cutting through hills and mountains via huge excavations than by
steaming over it. In this way, branch lines to remote and topographically
difficult areas became less costly and more feasible, and the geographical
linkage of England and Wales was extended and enhanced. To examine this, I
use a digital elevation model to calculate and compare the slopes of
rail-line segments constructed up to 1855 and those built between 1856 and
1876. The initial results bear out my working hypothesis and illustrate
the usefulness of this approach. I also plan to present a real-time
fly-through of the virtual landscape of Wales to demonstrate another
feature of GIS visualization.
Onno Boonstra
Information technology in history teaching. Learning history by
writing history: the Low Countries History Museum
Undergraduate students at the University of Nijmegen
need to present dozens of papers in order to pass on to the graduate
courses. In 99 percent of the time, the papers they have written do not
get full attention: neither from their fellow students, nor from their
teachers. Although students realize that it is worthwile to put a lot of
effort in writing papers, it is frustrating to experience that the merit a
paper gets is in no proportion to the hours of hard work that have gone
into it. In order to solve the discrepancy, the Department of History has
set up a web site, called the Low Countries History Museum, which is
hosting a collection of hundreds of items, each item relating to a major
event in the history of the Low Countries. Instead of writing a paper,
students are invited to write a lemma for the museum, thus learning
history by adding to its historiography.
Mats Hayen
The coding of 3.000.000 titles into HISCO at the Stockholm
Historical Database
Abstract: This paper presents the work with title
standardisation and HISCO-coding at the Stockholm Historical Database.
HISCO (Historical International Standard Classification of Occupation) is
a collaborative project involving different institutions all over the
world. In Stockholm work with HISCO began in 1999 and at the present time
we have managed to code approximately 90-95% of all titles that occur in
our database (currently containing more than 3 million person entries).
The paper discusses the source material, coding methods, coding problems
and how the material may be used by historians and other academics.
Marco van Leeuwen, Amsterdam
HISCO and the History of Work Website
Marco van Leeuwen, Amsterdam: HISCO and the History of Work Website
The International Institute for Social History will soon make public
the History of Work Website. This website contains information on
occupational titles, descriptions of the work involved, images on the
world of work, essays about these images, measures of class and prestige
as well as a bibliography. Ten thousands occupational titles from a dozen
countries and languages all over the (as yet Western) world are made
available in a comparable and systematic way. This has been made possible
by the work of scholars from about 20 research institutes. The coding
system used is HISCO, a structured coding schedule able to capture
relevant details of the world of work in the past, and with short
descriptions of the tasks and duties involved. Next to this the History of
Work Website has at present about a thousand images on the world of work,
as well as a number of essays by art historians on these images. The
website offers HISCO-based measures of class and prestige in the past,
recode jobs to current class and occupational schemes in use by
statistical agencies, as well as a computer assisted module coding
occupational titles into HISCO.
Marco Sunder
On the Biological Standard of Living of the Middle Class in 19th
Century U.S.
Average physical stature of a population is often used
as an indicator of (biological) living standards, insofar as human height
reflects net nutritional status during the first two decades of life. The
"Antebellum Puzzle" refers to the unusual situation in 19th century United
States, when the average height of (male, non-slave) cohorts born in the
decades preceding Civil War declined, although the economy was growing in
terms of real per capita output. This has been called the "Antebellum
Puzzle". The literature argues that economic forces during the American
industrialization can explain a substantial proportion of this puzzle. The
main argument is that increasing relative prices of foodstuffs induced an
adverse dietary change among the average population. We address this issue
on the basis of a newly collected sample on human heights from passport
applications. It allows us to assess trends in net nutritional status of
the richer parts of the society for the birth cohorts c. 1820-90. It
primarily represents the urban North Eastern United States and covers not
only males but also females. About 17,000 individuals have been linked to
counties of birth so that differences in local conditions, especially in
agricultural production, can be taken into account. Our analysis indicates
that the local configuration did influence the biological living standard
of the people in our sample, however the time trend suggests that they
were more immune against the adversities of those years.
Markus Heller
Web Access to Classic Text Retrieval
The data format "Beta Code" in which the PHI and TLG corpora are
encoded has ceased to be developed more than a decade ago. Being optimized
for saving space, compactness is no longer a primary requirement for
modern technology. Instead many new technologies in the field of
"Information Retrieval" have developed and data exchange and ease of
access are key factors of any corpus project.
Being topic of a Masters' Thesis, the current work was initially meant
to be a proval of concept to show that it was possible to analyze, extract
and process the corpus data into an index using inverted indexing
technology as known from many search engines.
As a secondary step an access tool was built as to enable Web users to
issue search commands using the highly versatile Perl Regular Expressions.
Direct text access including translation into the appropriate fonts
provided comfortable access for researchers interested in searching as
well browsing the classical texts available.
Ongoing research which is to be presented at the AHC conference in
Tromsoe concentrates on optimizing the corpus index and on transferring
the index data into a relational model on the basis of a relational
database.
The result will be searchable and browseable Web access to the corpus
through efficient search technology on basis of a powerful relational
database.
Luuk Schreven
Providing access to the Dutch population census of 1971
Although the first Dutch population census was held in 1795 by the
occupying French, it wasn't until 1829 that the Dutch picked up on the
idea and institutionalised the concept. From then on, there was a
decennial census until 1930. The 1940 census was cancelled due to World
War II, but soon thereafter the thread was picked up again, resulting in
general population censuses in 1947 and 1960 and a housing census in 1956.
The late sixties and seventies showed an increasing public concern with
the protection of privacy. This led to a limited public ban on the 1971
census, only some .18 percent actually refused to cooperate. The 1981
census on the other hand was first postponed and later cancelled because
of an average non-response of 26 percent during census trials.
Since 1997 the Netherlands Institute for Scientific Information
Services (NIWI) and Statistics Netherlands (CBS) have been working on
several projects to digitise the Dutch population censuses. The first
results, consisting of two sets of CD-ROMs, a Website (http://www.volkstellingen.nl/),
were presented in 1999. Through these CD-ROMs and the website images of
the census publications from 1795 to 1971 were presented, also some 10,000
pages of published data were manually converted for the 1899 census, these
are available through the CBS Statline system (http://www.cbs.nl/en/figures/statline/index.htm).
More recently NIWI and CBS are cooperating in a project aimed to do the
same for all the other censuses. What's more, the individual data of the
last two censuses (1960 and 1971) will become available for research as
well.
My paper will deal with the some of the problems encountered while
examining the 1971 individual data, as well as the actions taken to ensure
that individual citizens cannot be identified within the data. Furthermore
I will present the end results of the project, as it is to be finished by
the end
Donald Spaeth
Structured or Semi-Structured? Applying XML to Historical
Data
Most historical sources take the form of semistructured data. Most
computer-based historical research, however, represents sources as
structured data, whether statistical or relational database management
software is used. This is not a new problem. Proponents of
'source-oriented data processing' have argued since the late 1970s that,
rather than fitting the data to software, the software should be fitted to
the data. This paper will consider the suitability of Extensible Markup
Language (XML) and related tools (such as XSLT) for representing and
analysing historical data, using a database of probate inventories from
seventeenth-century Thame (England) as a testbed.
Computer scientists have embraced XML as a means of representing
semi-structured data. Developed initially to publish information on the
World Wide Web, it is becoming clear that XML has an important role in
representing and interrogating data. Documents on the Web present computer
scientists with the problem of making sense of a heterogenous body of
information, unlike the more regular business data used by commercial
applications. Like historical documents, semistructured data are
characterised by
- irregularities in structure
- missing data
- duplicated fields
- minor changes in representation
- ambiguous data
- information that does not conform to strict data types
- data from heterogenous sources
XML provides a flexible
user-definable approach to represent the, often implicit, structures of
historical documents. Rather than copying data into the fields and records
of a relational table, tags are inserted into a transcript of the text.
The user may define any tags that he or she wishes, so long as a limited
number of conventions are followed. The tagging system may be documented
in a Document Type Definition (DTD), if desired. For those familiar with
developing data models, the development of DTD, which can be used to
validate and document the data, seems an obvious step. The DTD also
facilitates analysis by providing a consistent and predictable structure.
Yet proponents of semi-structured data have argued that the DTD itself
imposes a structure, and thus a set of constraints, on the document. One
must still balance ease of analysis against being true to the source.
This paper will compare the relative advantages of treating data as
structured or semi-structured data. The Thame inventory database was
devised using a DTD, so that the document confirms to a structure,
although some slackness in definition loosens the constraints (e.g. by
permitting some tags to appear at more than one level of the hierarchy).
The data can be analysed in native XML, using XSLT, or can be exported to
a relational database. Indeed, the ease of exporting the data confirms the
structured form of its representation in XML; it is possible, for example,
to relate information from one portion of the hierarchy (or indeed from a
separate hierarchy) with another. The relational analogy can be used to
cope with other complications created by source-orientation, e.g. the
large numbers of variant spellings found in early modern documents.
The alternative is to code the data as they appear, without using a
DTD. Since tags may then potentially appear at any level of the hierarchy,
the data may be more difficult to analyse in one or more 'flat' matrices.
XSLT and XPath provide means for retrieving content from fields wherever
they may appear, but if one wishes to retrieve and relate several fields
from different portions of the hierarchy navigation may be more difficult.
It is necessary to develop new research strategies, both to explore the
location of tags that have been used after markup is complete and to
perform multi-dimensional analysis. The forms such analysis will take are
only beginning to take shape, and this paper will provide a preliminary
statement.
Marjan Balkestein, Annelies van Nispen and Peter
Doorn
Round table on data archiving
Topics 1. The (changing?) role of historical
data-archives in national data-infrastructures Are there
developments that change(d) the role and position of data-archives in your
country? Does public awareness of the need for digital preservation
influence centralized data-archives? What are the pros and cons of
centralized and decentralized data-storage? How do you see the future of
data-archives?
2. Trends in data acquisition How are your data-collections
developing? Is there (still) a strong emphasis on quantitative historical
datasets? What kind of trends do you see when you look at the datasets you
acquire? What are the trends in computer-based historical research at the
moment?
3. New developments What new developments are of interest to
our work as data-collectors and providers? XML, OAI, DDI?
4. Projects and opportunities for cooperation What
(inter)national projects do you participate in? Are there opportunities
for data-archives to participate in projects together?
Heinz Berger
Teaching History Online
I would like to present a paper about a project called Geschichte
Online - History Online. In this project we are dealing with the
possibilities of online teaching in various fields of history studies at
the university. We ask the question what basic knowledge and skills could
be taught by e-learning and online teaching. What is the didactic surplus
of online teaching compared to face to face teaching or the reading of
books? How can we find the best way of alternating between phases of
presence and online phases? Can we expect progress in improvement and
standardisation of qual-ity through public and transparent teaching? Is it
possible to create metadata for historical con-tents?
Questions like these are focused in the web based project History
Online started in October 2002 at the University of Vienna and supported
by the Austrian Ministry for Education, Science and Culture. Our project
group consists of historians of two departments of the Viennese University
and free lance collaborators. Important parts of our work are frequent
discussions about our conception with colleagues all over Austria. They
also undertake the permanent evaluation of the resulting "virtual learning
objects" in their history lessons at various Universities.
Elementary skills and knowledge should be presented and taught in the
five following modules: Introduction into academic historical work,
investigation of historical information and literature, edu-cation and
training in the field of didactics for history teachers, education in
didactics of media based on examples like TV-news and online teaching
combined with seminars and lectures (based on a specific examples in the
field of historical cultural studies).
I would also like to present the structure of the project (sitemap and
requirements) and the system of metadata we use (based on Dublin core).
The courses we offer start with a site of guidance fol-lowed by the main
site which is divided into basic text, examples, exercises, literature and
a glos-sary for historical, didactical and technical terms. In a short
demonstration I will show one of the prototypes of History Online
including the whole functionality.
Various didactic ways will be used in History Online like guided web
investigation in traditional face-to-face lessons, interactive online
learning where the users are introduced to the topic, online phases of
practice with response about the success and tests, team oriented courses,
presentation of historical subjects in lectures with a limited number of
participants and in seminars and courses for training of multipliers.
Sergey Kornienko
Teaching the history of Stalinism in Internet The problem
of our research is the opportunity to use Internet resources in history
teaching. One of the historical themes in which the using of Interrnet
technology and resources is "Stalin and Stalinism". The successful usage
of Internet resources in the teaching of this problem demands to reveal,
analyze and consider them. Method and procedures. The result of searching
Internet recourses on the problem with "Yandex-system" (Russian segment of
the net) and "Alta-Vista" permitted to obtain the whole amount of
resources, their distribution according to types and problems. The
quantitative results of research are organized in the tables which data
are presented with different diagrams.
Results and Discussion. The comparative analysis of number data and
graphics shows that the general amount of resources in Russian and
world-wide Internet are approximately equal and that is why the
educational potentials the both parts of Internet are equal. The analysis
of types of documents on the problem "Stalin and Stalinism" shows that
they can be classified as texts, articles, portraits and drawings of
different kinds (Yandex: "Stalin" 1663). There are also audi and video
documents, which are of special value in the educational process. The
analysis of topics of the theme "Stalin and Stalinism" shows that they
include the most valuable problems and aspects, such as Stalin's
biography, the Stalin's regime, repressions, terror, the USSR in Stalin's
time, foreign and inner policy, Stalin and bolshevism, Stalin and nuclear
bomb, Stalin and cold war, Stalin's death and others. The detailed
thematic distribution of resources is accompanied by tables and graphics.
The Internet materials allow to make a conclusion that the Internet
resources are not only rather large, but unique as well. It is especially
important for teaching the history of Stalinism. The more full and
adequate discussion of the educational Internet resources of the history
of Stalinism must be added by the inner quality analysis. We continue our
research just in this direction. Nevertheless the quantitative analysis of
Internet resources on the problem "Stalin and Stalinism" illustrates that
it must be an important component in teaching the History of the 20-th
century.
Janet Delve and Mark Allen
Data warehousing - a new approach for historians?
Relational databases are a standard tool for historical research
[Harvey and Press, 1996; Hudson, 2000], with object-oriented databases
receiving the occasional mention. [Bradley, 1994] highlighted the
difficulties associated with relational modelling and [Breure, 1995]
pointed out the advantages of inputting data into a model which matches it
closely. [Burt (Delve) and James, 1996] considered the relative freedom
when using source-oriented data modelling as compared to relational
modelling with its restrictions due to normalisation, and drew attention
to the forthcoming possibilities of data warehouses (DWs).
This paper considers in depth the nature of DWs, exploring how they
differ from databases. DWs allow primitive data to be stored along with
summarised and related integrated data. Normalisation is not enforced as
it is with relational databases, but a degree of normalisation is possible
where desirable. DWs comprise fact and dimension tables, which differ from
relational tables. Fact tables are broadly geared towards numerical data
whilst the dimension tables contain relatively unchanging descriptive
data. DWs are loaded periodically with snapshots of data from a database,
and inherently contain a time element. The issues surrounding the use of
this primarily business-oriented technology for historical purposes is
addressed, especially in view of the availability of appropriate DW
software in universities with, for example, Oracle site licenses.
References Bradley, J. 1994 'Relational Database Design',
History and Computing, 6.2, 71-84 Breure, L. 1995 'Interactive data
Entry', History and Computing, 7.1, 30-49 Burt (Delve), J. and James, T.B.
1996 'Source-Oriented Data Processing. The triumph of the micro over the
macro?' History and Computing 8.3 160-9 Harvey, C. and Press, J. 1996
Databases in Historical Research, Basingstoke Hudson, P. 2000 History by
numbers. An introduction to quantitative approaches, London
Biographical details Janet Delve is a senior lecturer in
Information Systems at Portsmouth University, England. She received her
doctorate, 'The development of the mathematical department of the
Educational Times 1847-62' in the History of Mathematics and Historical
Computing, at Middlesex University in 1999 under Ivor Grattan-Guinness.
Her research interests include nineteenth-century mathematics, early
British computing and data warehousing. She contributed articles on
William John Clarke Miller and Thomas Turner Wilkinson to the forthcoming
New Dictionary of National Biography [2004]. Janet is a Co-director of the
Winchester Project, King Alfred's College, and a committee member of the
UK Branch of the AHC.
Mark Allen is a lecturer in history at King Alfred's College in
Winchester, England. He works on the development of Winchester in the
nineteenth century and the uses of the census as an historical source.
Mark is also a Co-director of the Winchester Project, King Alfred's
College.
Alexander Kobrinsky
Speeches of the Leaders of Factions in the State Duma of the
Russian Federation (computerised analysis: applying TACT)
Making a close study of the activity of the State Dumas of the Russian
Federation and analysing stenographs we can get answers to many questions
which we are interested in concerning the a ctivity of the Dumas as a
single whole legislative body, the activity of factions and deputies
groups on the positions and views which they upheld as well as the work of
separate deputies in the supreme legislative body and so on. Studying
speeches of the leaders of deputy unions gives us the possibility to get
more complete and distinct idea of the position of a faction or a group,
of its participation in discussion of this or that question. The analysis
allows to reveal factions priorities in the legislative activity.
Very often an appraisal of the activity of a deputy union in the State
Duma is made on the basis of journalistic articles based, in their turn,
on separate utterances of its leader given by mass media which are, as a
rule, fragmentary, systemless and, besides, not always correct enough
chosen and without proper scientific analysis. Argumentative questions
presented by the press and television are sometimes not quite exact and
make their own contribution to non-objective perception of this or that
leader for the part of users of information and this is directly connected
with the appraisal of work of his union in the State Duma.
The similar approach favours the appearance of definite stereotypes of
perception of different politicians as well as of different parties, the
Duma factions and groups in a society. The existing stereotypes prevent an
objective appraisal of the activity of the elected representatives of the
people.
The subject of the research constitute stenographic reports of meetings
of the SD FM RF (the State Duma of the Federal Meeting of the Russian
Federation) of 1993 - 1995 in their electronic version. The original
source basis was composed, in fact, by stenographs of 1993 - 1995 plenary
meetings of the SD FM RF (which are at our disposal in their electronic
version). The present investigation proceeds the work on studying
stenographic reports of the plenary meetings of the State Duma of the
Federal Meeting of the Russian Federation. The results of this work were
published in different scientific articles and monographs. 1
The aim of our research is to give a scientific analysis of speeches of
the leaders of the main factions for revealing their positions as well as
the dynamics and evolution of their views on actual problems of home and
foreign policy of Russia. To reach the goal of the research work it is
necessary: (1) to define the main problems touched upon by the leaders of
factions in the course of their speeches, (2) to reveal their position on
a number of questions discussed in the SD RF, (3) to find out the degree
of their activity and priorities in the legislative activity.
Of special interest are speeches of the leaders of the CPRF, LDPR and
YBL (now known to us as Yabloko). On December 19, 1999 the elections to
the State Duma of the third convocation took place and today we can state
that these three factions managed, for the third time (with different
success), to overcome the 5 per cent barrier. Taking into consideration
that the leaders of these factions remained to be the same the present
research work creates the basis for the following comparative analysis: to
what degree priorities of factions stated by their leaders appeared to be
unchangeable and, as well, what changes took place in positions of the
leader themselves and whether there were such.
Of the whole mass of the stenographs reports (in the electronic version
it is about 25 mg bites that approximately makes 40 full value 600 pages
books in the printed form containing materials of 140 meetings of the
State Duma of the first convocation) we selected the parts which directly
concern speeches of the characters we are interested in. We carried out
the work on defining the main blocks of interest of each of the factions
leaders in the course of his activity in the State Duma of the first
convocation. We formed the list of categories reflecting the main
directions in the speeches of leaders of the Duma factions. And, having
selected a set of indicators (semantic units of a text ) making concrete a
semantic meaning of each category, we made the analysis of speeches
applying the computerised TACT programme. It was very important for us to
use one and the same methodological set of instruments based on computer
technologies applied to historical researches that gave us the possibility
to get objective data for our analysis and made it real to make
conclusions "obvious" to the maximum possible degree.
In the course of work we determined the frequency and intensity of
speeches of the leaders of factions and the main accents put by them in
the process of discussion of various problems. There was shown the
dynamics of semantic categories appearance in their speeches. We studied
the context of separate utterances and gave the estimation of it. We also
showed the frequency of appearance of different terms having an important
meaning for the legislative activity in speeches of the leaders of the
deputies unions of the first convocation and we made their comparative
analysis. ____________
- Kobrinsky, A. L. Problems of the State Building in the Russian
Federation (on the materials of stenographs of the State Duma plenary
meetings in 1993 - 1995), M.: MGU, 2001.
Kobrinsky, A. L. Speeches of
Zshirinovsky, Zjuganov, Yavlinsky in the State Duma (1993 - 1995), M.,
2003.
Batyrbaeva Sh. D. Bishkek
Application of Information Technology in Database Creation at the
Research of Demographic Processes in Kyrgyzstan of XX Century.
The demographic history of Kyrgyzstan in Soviet and post-Soviet period
is not formed yet as the independent scientific trend. So, our
investigation is devoted to the problems of database formation on the
history of Kyrgyz population in retrospective view, on the basis of the
censuses taken in the XX century with attraction of data related to
genealogical tree on the level of separately taken settlements from 1999
to 1926.
Such methodology is conditioned by the fact, that the programs of
conducted censuses differed from each other and groups of data had not
been coordinated with previous data at their preparing for publication.
Besides, the territory of Kyrgyz Republic, regardless of its small size,
was divided into some typological regions differing greatly in their
natural-climatic conditions, formed ancient traditions and customs,
conditioning the differences in all areas of Kyrgyz population
functioning, which also made difficulties in comparing of the available
data. Hence, the selection of settlements for database formation was
carried out depending on the undamaged condition of initial regional
summary data.
At the formation of the database on census materials we faced with the
problem of insufficient registration and inaccuracy of collected
information. In order to settle this problem, we conducted the comparative
analysis of initial censuses records and data from Official Registries of
Village Soviets, but not always we got the desired results.
For this purpose, we conduct the additional research on the selected
settlement with application of Computer technology, based on the
informational logic modeling of Kyrgyz families genealogies, because
according to traditions, the male representatives were to know their
genealogical tree seven times removed.
The results of modeling of 3-4 generations show rather adequately the
dynamics of number variations and migration mobility of men, starting from
13 years old in the separately taken settlement, because according to
tradition, the male population was considered to be adult since that age
and in case of their death, they were taken into account as the separate
generations of definite ancestors.
For this purpose, while analyzing the clan-tribe settling of Kyrgyz
people and using the list of informative objects, we shall make-up the
diagram of data flows in the form of a scheme (see below).
At the making of diagram, here, the attributes of each object are being
substantiated and the relations between them are being established. On the
base of the collected materials for informational reflection of the
clan-tribe settlement of Kyrgyz people, we created the database MS ACCESS
in SQL.
This database allows to explain the number deviations in male
population of some clans at the selected settlements, to make clear on
their base the natural and mechanical motion for each inter-census period
and to define hypothetically the dynamics of women's number variations in
the definite age groups and therefore, to fill in the data, not-registered
at database formation.
In a whole, the work in this direction is being continued. The
determination of dynamics of the male portion of Kyrgyz population in
retrospective towards the XV century will be the result of this research,
because at present time we have already some published works of some
connoisseurs of genealogical trees - "sanzharachy", revealing the
genealogies of the Kyrgyz people.
Scheme of the
genealogical tree of the Kyrgyz
Kees Mandemakers and Lisa Dillon
Best practices with large databases on historical
populations
Program round table AHC 2003 Friday 8th of August 11.15 hours
- 11.15 Introduction by Kees Mandemakers, International Institute of
Social History Amsterdam and Lisa Dillon, Université de Montréal
- 11.45 Comments by Gunnar Thorvaldsen, University Tromsø; Ólöf
Garðarsdóttir, Statistics Iceland; and Marco van Leeuwen, International
Institute of Social History, Amsterdam
- 12.15 Short reaction by authors
- 12.30 Discussion with the public
Abstract Since the
late 1960s researchers who transform routinely-generated primary sources
into machine-readable data have produced numerous methodological articles
and book chapters detailing the process of data creation and describing
how the peculiarities of primary sources can affect interpretation of the
data. However, in such articles, the best practices for creating large
databases have usually been implicit rather than explicit. Here a
comprehensive list of best practices for the creation of large databases
on historical populations is introduced, drawing upon the experiences of
the Historical Sample of the Netherlands, the Canadian Families Project,
the IPUMS and other projects.
The following guidelines represent a revised version of the protocol
formulated on the occasion of the workshop of the Historical Sample of the
Netherlands (HSN) which took place under the title 'HSN-workshop on large
databases: Results and best practices' at Amsterdam on 17-18 May 2001. All
the participants are to be thanked for their comments on this earlier
version which have been incorporated as much as possible into these
guidelines.
This protocol is not exhaustive, but it does address those principles
which should be considered by all projects. We use the phrase 'should be
considered' because we are well aware that other factors such as
cost-benefit considerations may make it useful or necessary to make
alternate choices than those advocated by this protocol. In general,
diverging from one or more of the recommendations described below is not
problematic as long as the argument for these choices is made clear to the
users and future owners of the database. Thus, adherence to this protocol
does not mean that every principle must be applied, yet the reasons for
not following a specific rule or recommendation should be explained.
We consider here only those large databases which have the goal to be
open for secondary analysis and which are essentially open systems, to
which data may be periodically added.
The purpose of maintaining a set of best practices for creating large
databases on historical populations is to:
- articulate the standards necessary to create and maintain
high-quality databases and database documentation. Appropriate funding
must be reserved to achieve these standards.
- ensure that the historical community can trust the results of
research based on these data.
- benefit from previous experience in creating large databases. Thus,
these rules can also be applied to data created by non-academic
organizations such as genealogical societies or the Church of Jesus
Christ of Latter-day Saints.
- ensure databases are of sufficient quality for use by secondary
researchers (researchers outside the main database creation team).
For the sake of clarity and argument and to stimulate further
discussion, this list of best practices is written in the form of a
specific set of rules. However, since every organization bears ultimate
responsibility for its own work, this protocol should be read as a
specific set of recommendations. Practical examples have been supplied in
those instances in which we felt they would help illuminate a particular
recommendation; however to keep this set of rules lean, we added examples
with some prudence.
Finally, we wish to stress that we are conscious that these
recommendations are based on technology and expertise which will evolve
with time. Rather than attempt to forecast future developments, we
recommend that this protocol be revisited and periodically revised in
keeping with new developments and insights. We invite readers to
contribute to periodic revisions of this protocol by forwarding comments
directly to the authors (Kees Mandemakers, kma@iisg.nl and Lisa Dillon, ly.dillon@umontreal.ca) or by
addressing public comments to the H-DEMOG listserv.
Victoria Dmytrieva
Program Support for Content-Analysis: Ukraine project
TACT and TextQuest are wellknown as software for content-analysis. Both
oriented mainly english texts (or in latin) to solve frequencies of words,
form content categories and ets. But working is problemly with texts in
cyrilic of the period before beginning of XX centure (for example,
Lithuanian metrica, even texts of the end of XIX centure). The maine
reason is to adapt program support for text speciffic particularly letters
spelling and writing, the essence of words and phrases that went long ago
out of everyday. The methodic, contents, alhorithm and other questions of
software construction for content-analyse are developed in Dniepropetrovsk
National University at the Department of Historiography and Source Science
(Yuriy Svyatets and Victoria Dmytrieva are project authors).
Besides that the technic realize of program support calls the such
question as choice of program environment for the product. Contemporary
user is habit working with Windows and is disposed to interface and
alhorithm of this software. Therefore we come to Visual Basic as more
appropriate language to solve this task.
The scheme of program working is standard. It includes loading of main
menu, operations with text file (load, edit, save and print), text analyse
(structure of word vocabulary, frequencies and entropies of words, letters
and phrases, form of cathegories, rezults preserve in files and ets.),
help with hypertext links. We foresee connection of program with Microsoft
Word application. The standard schemes of dialogs (when files open, save,
print and ets.) from dinamic libraries of Window\System cataloges are
adopted for the program needs.
This content-analysis program is Windows application. The start up of
it realises from Worktable or Windows Explorer. Program has multi-window
interface that allows to work in each of opened forms in autonomous
regime.
Ian N. Gregory
Integrating space and time into analysis - the Portsmouth
approach
The Portsmouth Geography Department acts as home base to one of the
world's leading centres of excellence in the use of Geographical
Information Systems (GIS) and spatial approaches to analyse of historical
and other cultural data. The department employs eight full time academic
staff with research interests in these areas backed up by dedicated
support staff and a number of research assistants. Areas of technical and
methodological expertise include the development of large spatial and
spatio-temporal databases, surveying, photogrammetry, image processing,
quantitative spatial and spatio-temporal analysis, qualitative analysis
and cartography. These techniques have been applied to research areas as
diverse as archaeology, historical demography, historical geography, local
history and regional development. The department has an excellent track
record in attracting funding for small, medium and large-scale research
projects in these areas. It has links with the key individuals and
organisations involved in GIS and the arts and humanities in the UK,
Europe, North America, Asia and Australia.
This paper outlines work underway in the Department and provides
guidance to scholars interested in integrating and analysing spatial and
temporal aspects of their data.
David J. Bodenhamer
Online data resources and analytical tools
The Polis Center at Indiana University Purdue University Indianapolis
is widely recognized for its expertise in historical and Internet GIS. It
also has an ESRI-certified professional training program with clients
across the United States. Polis has a full-time professional staff of 35,
including 10 staff with GIS expertise. Its strengths are in needs
assessment and requirements specification, system and application design
and implementation, information architecture and database modelling,
Internet data delivery and dynamic mapping, and computer cartography.
Research interests are in distributed systems, semantic and technical
interoperability, and adaptive systems. Polis also has a strong project
management system, with over 200 successful GIS projects at local, state,
national, and international levels. Its projects in humanities and social
science GIS include several large-scale community information systems and
major historical GIS databases served over the Internet.
The paper describes best practice making spatially referenced data
available over the Internet.
Sheila Anderson
Lifelong preservation of digital data - a structured
approach
The Arts and Humanities Data Service (AHDS) is a UK-wide national
service to contribute to the UK arts and humanities digital information
education environment by:
- Preserving arts and humanities digital resources created by higher
education
- Providing rich, deep, access to arts and humanities digital
resources created by and for Higher Education
- Supplying advice and guidance in the creation of digital resources
to quality standards that ensure their suitability for informed use in
research and teaching, and their long-term viability
The AHDS is
a distributed service with five subject centres providing services for
archaeology; history; literature, language, linguistics and textual
studies; performing arts; and visual arts, and a managing Executive at
King's College, London. The AHDS subject centres identify and accession a
wide range of digital resources from across the broad spectrum of the arts
and humanities. These include collections of images and texts, databases,
archaeological field archives, GIS, CAD, VRM, multi-media, and potentially
sound and moving image collections. A range of contextual documentation
and descriptive metadata accompanies these collections. The subject
centres evaluate, validate, structure existing and add additional
metadata, and incorporate the collections into AHDS resource discovery,
delivery and preservation systems. The AHDS Executive provides overall
strategic direction and policy planning.
This paper describes AHDS's approaches to data preservation,
repackaging and reuse and its leading efforts to develop and promote best
practice in these fields.
Paul S. Ell
Strategic datasets for the Broadband age
The Centre for Data Digitisation and Analysis at the Queen's University
of Belfast is a provider of key data resources to the humanities, arts and
social sciences. In essence it converts analogue research materials which
are often difficult for scholars to obtain, or are analytically
challenging and converts the material into digital format using a range of
specialist equipment
Much of the Centre's work has involved the collection of historical
numeric census data for a number of counties converting printed reports
into machine-readable files. The unit also has considerable experience in
the conversion to digital of alpha data and the capturing of images of
objects. The Centre is working with bodies such as the Electronic Cultural
Atlas Initiative and the Arts and Humanities Data Service to digitise
contextual data from around the world.
This paper discusses best practice in data capture, outlines the
centre's work, and reports on research agendas both in terms of
methodological advances in digitisation and the use of large and complex
resources in scholarship.
Adrianne Fielding
Late 19th Century Mortality in New England Mill Towns: Analyzing
Linked Death, Census, and Tax Records
The construction of multiple-source databases through record-linkage is
central to historical studies ranging from European family reconstitutions
to cross-census studies of American occupational mobility. The
construction of linked longitudinal data resources from panel data, e.g.
censuses, in delimited geographic areas confronts methodological concerns
that are profound yet seldom addressed. This paper details both the
methods and findings to date of the Connecticut Valley Historical
Demography Project (CVHDP), which has utilized longitudinal area studies
and interactive, computer-assisted record linkage to examine various
aspects of urban and industrial change that affect the continuing high
levels of mortality and the eventual mortality decline in
nineteenth-century New England.
The CVHDP has examined mortality and social change in the neighboring
and rapidly growing towns of Northampton and Holyoke, Massachusetts, which
differ in their economic and demographic characteristics. Holyoke was a
rapidly growing and industrializing company mill town, while Northampton
experienced more gradual growth and the emergence of a mixed commercial
market economy. To analyze mortality in the two towns, a database for
individual-level mortality analysis was constructed through
computer-assisted record linkage of geographically sampled observations
from decennial censuses from 1850 to 1910 and the deaths recorded within
two years of each census. The CVHDP research design combines the strengths
of a longitudinal panel study with the value of the continuous observation
of mortality. The quasi-experimental research design also supports
comparative studies of mortality in the two differently developing urban
communities across time (Hautaniemi, Swedlund, and Anderton 1999). This
database has facilitated cause of death analyses and GIS analyses of the
mortality impacts of public health interventions in the two emerging urban
centers.
The initial results from these analyses also suggested that wealth
influenced cause-specific mortality, which led us to further expand the
linked database. Until recently, the absence of wealth data in the late
nineteenth-century U.S. federal censuses constrained analyses of wealth
distribution and inequality in the U.S. and confounded longitudinal
analysis. Historical demographers like Steckel (1994), Hautaniemi et al.
(2001), and Ferrie (2001) have turned to tax valuation records as an
alternate and supplemental source of wealth data for this critical period
in U.S. history. Similarly, the CVHDP has also linked tax valuation
records to the database of census and death records. This linked wealth
data allows us to analyze the changing types and inequalities of wealth in
the two communities over time and the effect of wealth on mortality, and
the longitudinal comparison of wealth and inequality allows us to contrast
the effects of the rapid industrial growth in Holyoke with the
Northampton's emergence as a mixed economy market town. The findings
support the influence of wealth on mortality and Steckel's (1994)
suggestion that wealth inequality was uniformly high in nineteenth-century
Massachusetts, and contribute to the emerging hypotheses on changing
inequality in the late nineteenth-century United States.
Anastasia Leonova
Content-analysis of the data collected by NKVD among the German
POWs in 1945-1949
Investigation of the attitudes of the former German servicemen who were
kept in the Soviet POWs camps after the WW2 towards the different aspects
of contemporary political situation was one of the important tasks of the
Soviet secret services agents which were involved into the work with the
POWs.
One of the key problems of the post-war German historiography was the
responsibility for the beginning of the WW2 and the war crimes which
caused the loss of many millions people lives.
The documents discovered in the secret services archives reveal the
discussions held between the top German officers kept in the POWs camps on
this topic. The former officers were divided into those who put
responsibility on the whole German nation which let the national-socialist
party come to power and those who reserved the opinion of the personal
responsibility for the war crimes.
The report describes the contents of the discussions between the POWs
on the problem of the responsibility and the dynamics of the different
opinions in this case.
Boris Grekov
German Politician Walther Rathenau and his Ideas about Russia
1914-1922. (Content analysis of his political thinking.)
The paper focuses on W.Rathenau, President of the AEG, Head of the
supply in raw materials in German industry during World War 1, owner of
AEG factories in Russia, in 1921-1922 minister of economic rehabilitation
and then foreign minister in Wirth's cabinet. An attempt is made to
analyze the evolution of his attitude to Russia from the start of the
First World War up to the Rapallo Treaty (1922). A database of Rathenau's
works was made on the bases his private papers, which were found in the
Moscow "Sonderarchive". Then, with the help of content analysis, a model
was made of his political image of Russia. It was shown, that the policy
towards
Anna Dzhaleva-Chonkova and Vladimir Doulov
Computer-aided possibilities of teaching history and
humanities
Higher School of Transportation, Sofia, Bulgaria The main advantage of
computers especially when they are connected in a network is that they may
display electronic copies of documents, photos, old papers, etc. which is
difficult to have an access to (they are kept in libraries and archives
some of which have a special order for using). Teachers can show
documentary films as well and that will give them a possibility to stop
and look longer at some photos, to repeat some parts, etc.
Using computers in teaching history can present some museums in
electronic version so it will be a great chance as sometimes it is
difficult to visit them (if they are outside the university campus and
even abroad). The virtual museum is a great idea but to develop it
teachers need a collection of brochures, leaflets, books to be scanned. It
is a hard work but the efforts will result in great effect.
Distance teaching and using ??? and H??L technologies makes possible to
turn the teaching process from "communication" in the traditional meaning
of the world into "imprinting" in the sense John Lock put in this word in
preparing knowledge of preliminarily determined parameters.
The hypertext technology creates conditions to overcome one of the
oldest defects of writing and teaching history - "Orwell paradox" - the
effect of rewriting history according to the particular
socio-communication conjuncture. Including historical issues in the
multi-dimentional semantic space that authors interprete by the complex
notion of perceptional and semantic continuum gives a possibility to make
the history knowledge "free" from context overloading and phone
expectations.
Unlike the traditional learning in history when all possible versions
are interpreted through only one dominating structure of saying, the
hypertext distance teaching makes possible the free perception of almost
all versions of the history process according to the computer abilities
and the communicator's competence.
This new communication position overcomes the alienation of knowledge
characteristic for traditional education and sets the trainee in a
completely different condition: he is directly involved in continuum
outlined by the historical issues and he/she is made to "experience" them.
The paper presents the conceptional base of a new "virtual
hermeneutics" as methods background for achieving authentical
understanding of history and other humanitarian subjects and adequate
teaching. Thus it is possible for history knowledge to function as "pure"
history knowledge to a greater degree. The paper is based on the
experience of using computer-aided methods in teaching humanities at the
Higher School of Transport in Sofia, Bulgaria.
Tamara Izmestieva
Syndicate and its Impact on the Development of Russian Coal
Mining Industry. The Late 19th - Early 20th Century. Methods of Analysis
and Results
Russia was provided with great reserves of the two basic materials of
the industrial revolution - coal and iron. Both railway construction and
exploitation of very rich iron deposits in the Krivoi Rog region gave
substantial impetus to the establishment of sizeable coal industry in
Russia. The industry grew rapidly from the 1890s at the expense of the
emergence of some large-scale enterprises.
When crises struck in the early twenties century, many of them were
faced with difficulties. The current selling price of coal barely covered
the costs of production. Many firms produced losses and only a handful of
firms offered dividends to shareholders. In this condition, competition
became destructive. What was way out?
Coal producers tried to solve coal' problems by formation of a coal
syndicate. The syndicate (in Russia it was known by the name of Produgol')
began to operate in February 1906. How did syndicate affect the
development of coal industry?
Produgol' acted as a sales agent for its members. Because the syndicate
did have the right to assign sales quotas and prices to its contracting
firms, it was able to reduce market fluctuations and affect prices on coal
market. So, the syndicate created favorable conditions for both survival
and development of coal mining enterprises. How did members of Produgol'
exploit these advantages?
Naturally, all firms were similar in its attempts to get good profit,
but approach to an allocation of profits was different. Some firms used
profits to pay dividends while others plowed back profit to improve their
production. Some of the latter firms were able to reduce average costs at
larger levels of output.
In this paper we consider a special group of those enterprises, which
potentially may use increasing economies of scale. The group to be
analysed is notable for many reasons, including a good correlation between
interests of each of its members and interests of the coal industry as a
whole.
Our analysis is based on decomposition of total cost into fixed cost
and variable cost. It is not easy to determine what is a fixed cost and
what is a variable cost, because of problems in accounting practice. One
sensible approach is to apply an approximate method which makes possible
estimating two components of total costs. Decomposition of total cost
helps us to formulate a cost equation which reflects potential production
capability of the enterprise.
When investigating Produgol' and its impact on the development of
Russian coal mining industry, we introduce in our analysis two external
factors - prices and production limits, which were under control of the
syndicate. This method of attack serves to clarify the subject under
consideration.
Framework of our research includes some steps (calculations of
elasticity of cost with respect to output, estimation of costs' structure,
formulation of equations, and so on). Although each step of analysis is
rather simple, this procedure as a whole is cumbrous and tedious. So, we
created in Excel special template, which make possible to perform these
operations automatically.
Methods of our analysis in detail, some results of computation and
comments will be presented on the conference.
Michael Shovman
A Seminar on Record Linkage Solution Framework
In the past couple of years our team has been developing several Record
Linkage projects. As a result, we had created a RL solution framework that
could be adjusted and applied to a wide range of RL processes. We think
that as such, our framework may be of interest to the developers of fine
RL tools, as well as to the historians doing RL work. Therefore, we offer
to organize at the upcoming XVth AHC conference a Seminar on Cooperation
in RL Systems Creation. In this seminar, we would discuss the following
topics:
- General RL process classification:
- Desirable multiplicity vs. undesirable one.
- "Historian", "statistician" and compound cases.
- One-time vs. continuous operations.
- RL Lab and RL Factory cases, etc.
- Basic construction units of RL applications
- Human-Machine Interface.
- DB structure and thesauri.
- Decision-making and saving algorithms.
- Specific RL tools.
- Calibration tools.
We are ready to make a presentation covering our vision of the first
two topics, followed by a discussion on the third and fourth ones.
Garry Keyes & Jens Toftgaard Jensen
Mapping Urban History GIS in Spatial Analyses of a
Multi-Source Database
The paper presents a method of reconstructing a historical city map and
examines its usefulness in data representation, analysis and
visualisation. The core data-set for the GIS-model is a nominally linked
multi-source database, which describes people, place and property in early
19th century Aarhus, Denmark. As no historical land register map of the
city existed for this time period the usage of GIS required the
construction of a credible map, based on the geographical information and
structure inherent to the textual sources. The resulting digital map
enables spatial analysis to uncover patterns in urban social and economic
structures.
Lars Göran Carlsson
Record Linkage on Swedish Parish Registers - Some Methodological
Aspects
The Demographic Data Base in Umeå, Sweden provides historical data from
parish registers and parish statistics for researchers from both Sweden
and other countries. The primary materials computerized by the Demographic
Data Base consist of entries on individuals in 19th-century parish
registers. A distinguishing characteristic for the material is its wealth
of detail and high quality.
In order to carry out demographic analyses at individual and group
levels, a method of linking individual records from parish registers has
been developed. This paper discusses the DDB method for record linkage.
Main outlines of the method is the combination of computerized record
linkage and computer aided record linkage where the majority of the
linking is performed by a computer program while remaining record are
linked manually with computer aid.
Some general rules for computerized record linkage used by DDB are;
- Well defined algorithms and rules are used to decide if a link will
be established.
- All individuals is given a unique identity number.
- For records impossible to link to any other record a new individual
will be "created" with only one record.
- Matching is done between pairs of records or between groups of
records, i.e. pages in the source.
In the computerized record linkage exact matches and clear mis-matches
are processed automatically using well-tried algorithms for searching and
matching , the program is able to match and link records without having a
unique identifier for the individuals.
Timur Valetov, Moscow State University
Wage Inequality Simulation (on the materials of Russian
industries, 1890-1910s)
The problem of income distributions and wage inequalities during
different historical periods is one of interest and important aspects of
social history. Wage inequality is the theme of researches by a lot of
historians. The beginning of these researches is related to S.Kuznets, who
suggested that income inequality dynamics for industrialization period is
inverted-U curve ("Kuznets curve"). J.Williamson and P.Lindert did also
research this problem on the base of a lot of materials on the USA and
Great Britain and they actually confirmed the Kuznets's hypothesis, but
the latest researchers of social history of different countries sometimes
made another conclusions on the basis of their data. It is necessary to
tell, that the question is generally not easy to study because of
difficulties in collecting the authentic sources. For example, for Russia
during the industrialization (1890s - 1910s) dynamics of workers' wage
inequality remains completely unknown. Besides the dynamic of wage
inequality itself, the problem of explanation of inequality growth or
reduction is possibly even more important. This task is also still
insufficiently investigated, mostly because of lack of available data.
Maybe a mathematical stimulation modeling of wage inequality could help
to understand the mechanisms inequality dynamics. In the current report I
attempt to make such computerized modeling at a microlevel, without taking
into account the changes having place in the national economy during
industrial development.
After researching on wages of workers on the basis of contemporary
records from one of the textile factories of Moscow province I managed to
collect the database where annual rates of all the workers-engravers'
wages of the factory during 20 years (1890s-1910s, about 400 people) are
recorded. This database allows to trace some factors which defined the
wage rates of the workers. The most important factors are workers' age,
experience and qualification levels. The wage rates were usually increased
from time to time both for skilled an unskilled workers, but the wages for
unskilled were limited by a low wage maximum and the unskilled workers
often left the factory, so their wage could not reach even this maximum.
This is why the average wage of unskilled workers was lower than of the
skilled while the annual wage increases were almost the similar.
This idea is the basis of some modifications of the simulation. The
model consists of 10 differential equations. At the first step of the
model the simulated-factory hires the workers of different qualification
levels and each of the levels is characterized by its own parameters,
similar to those got from the historic source. These are the wage minimum
and the possible wage maximum, the workers' possibilities to leave the
factory or to increase the wage rate, possible size of the increase etc.
The model calculates the main wage inequality coefficients dynamics (i.e.
Gini coefficient, the percentage of common wage got by 10% of well-paid
workers etc.)
The main result of the model is the following. During the first steps
all the dynamics increase (because the first step simulates impossible
conditions of the perfect equality), then become stable. It is interesting
and important that these stable values of the coefficients and the time
the system becomes stable are almost independent on probability
fluctuations included into the model and they are little dependent on the
values of the model constants. I guess this shows that it is impossible to
explain serious changes in wage inequality just on the basis of natural
processes of wage growth or reduction important for all the workers taken
separately.
George M. Welling
New points of view on Price History, based on analysis of the
Paalgeld Portbooks, 1771-1778
In the early thirties of the last century an international group of
economic historians gathered to set standards for the writing of national
price-histories of the participating countries. The Second World War made
contacts between the historians involved very difficult and most of them
decided to continue their work based on the agreements made earlier. Since
then these price histories have been the solid foundations for various
discussions, a.o. for the standard of living debate. A crucial aspect of
the price-histories is the choice of goods to create a price index.
Because very little large scale research had been done about the
composition of the total of goods going around, the choice was mainly
based on pre-conceptions. In an earlier publication I have shown that this
choice was especially unfortunate for the Dutch historian Posthumus,
because his sources were far from perfect. But based on my recent research
on the imports of Amsterdam at the end of the 18th century, I will argue
that the composition of the group of goods for the indexes does not
reflect the realities of the trade. My statistical analysis will show that
a different set of goods as a basis for indexes will allow a much more
reliable view of the economical fluctuations.
Jane E. Dabel
Census Database of the African-American Population in
Nineteenth-Century New York City
My research focuses on the unique role that African-American women
played in nineteenth-century New York City. One of the most significant
tools for my project is a database that contains profiles of over 50,000
African Americans residing in Manhattan. I have collected this information
from the original United States Manuscript Census from New York City for
1850, 1860, 1870, and 1880. This database provides a breadth of
information about free blacks in general and black women in particular
with specific information about women's employment cycles, the role of
women in black family structure, female migration patterns, and the
finances of black women. This is such an important demographic source
because it furnishes an unprecedented amount of data about black women who
often did not leave written records. After examining my database for
information integral to my manuscript, I intend to publish my database on
a searchable CD-ROM for use by historians, independent scholars,
genealogists, and teachers.
A presentation at the XVth Conference on the Association for History
and Computing at the Norwegian Historical Data Centre will provide me with
the opportunity to share the information in my valuable database. It will
also allow me to discuss strategies for examining such sizable amounts of
information.
Olga Porshneva
The Mentality of the Russian People: The 1917 Revolution in the
Focus of Mass Consciousness: a Multi-Dimensional Statistical Analysis of
the Letters Posted by Workers, Peasants and Soldiers to the Central Organs
of the Soviets of Workers' and Soldiers' Deputies
The goal of the paper is to reconstruct the content and structure of
the mass consciousness of workers, peasants and soldiers during the 1917
Revolution, their major economic, social and political preferences, ideals
and values, and also the mechanism of the transformation of their mental
orientations.
The paper under consideration presents an original source based on a
great number of letters posted by workers, soldiers and peasants to the
central organs of the Soviets of Workers' and Soldiers' Deputies and gives
an opportunity to study the problems posed by the author. The
content-analysis of all the letters taken from archive (GARF) files can be
regarded as a quantitative analysis done by the method of random
selection. I made a content-analysis of the correspondence by singling out
the meaningful units - indicators, expressed by the authors of the letters
in the form of statements (views) on social, political, economic and
cultural problems. 238 meaningful units-indicators - have been singled
out. I have also elucidated the interconnections of the indicators in each
source of information and their total number, the system of relationships
of each indicator with the others. The number of categories - combined
indicators, greater meaningful units, was 64. The system of their
interrelationships was also investigated. As a result, I have created the
structure of four relatively stable layers, "blocks" of social
consciousness (traditionalistic, revolutionary-defensive, radically
anti-war, socialist). I have studied the nature and the level of
overlapping of these "blocks".
Chronologically the correspondence was divided into three groups:
spring, summer and autumn, which reflects the presence of qualitative
peculiarities in the process of the revolution at each of this stages. The
distribution of the meaningful categories in the aforementioned periods
visually demonstrates the evolution of the priorities and the content of
mass consciousness throughout 1917.
The method used in the quantitative analysis of the interconnections of
the meaningful categories is based on calculating the coefficient of
correlation. An analysis of the distribution of the totality of meaningful
categories was made on the X2 criterion using STATGRAPHICS. The
sum total of meaningful categories was distributed according to a normal
law which accounted for the use of the coefficient of linear correlation
as a quantitative characteristic of the interrelationships of meaningful
categories.
I have also investigated the mass consciousness and system of mental
priorities of each analysed social group. I have defined the character of
typical verbal reactions in mass consciousness and social psychology
pertaining to different social groups of people and dominant models of
solving current social issues of that time.
The spring-autumn period witnessed the evolution of mass consciousness
of the "lower" classes, the modification of its content in the direction
of the growing radicalisation, mobilisation of communally-equal in origin
moral orientations of social justice and equality. Mass consciousness of
the people was fairly contradictory and multi-layered. The obvious
prevalence in mass consciousness of the two "blocks" -
revolutionary-defensive and radically anti-war - testified to the
dominance at that period of one of the basic archetypes of people's
mentality - the rebellious one. Over a third of the correspondents
(intentionally or unintentionally) expressed their support for the
traditional values and championed them. Part of them adhered to the old
monarchic rule; others, while remaining loyal to firm power, strove to
establish it in a new, revolutionary aspect. The choice of the priority
themes for their confessions, demands, complaints which included the
problems of war and peace, agrarian reformations, establishment of
people's power, social justice, and the stereotyped variants of how to
solve them, - demonstrate the lower classes' basic concepts of justice in
an orderly and just social system.
Vladimir Pereverten
An approach of presentation of interdisciplinary lecture
simultaneously for two categories of students.
The author has been developing the special course "Computer information
technologies for historical research".
In this paper an approach of presentation of content of the
didactically complex interdisciplinary lecture of this course
simultaneously for two different categories of students: historians and
"computer scientists" is discussed. The subject of the lecture is
"Organization of information for historical research".
The approach is based on a suggestion that historians are more
predisposed to verbal-imaginative perception of information but "computer
scientists" - to symbol- imaginative and uses the new educational
technologies. Our approach consists in making the lecture-presentation on
given subject with the use of such possibilities of new educational
technologies as multimedia and multiscreen demonstration.
The most important advantages of multimedia for our lecture are the
hyperlinks which allow to change the scenario of reading the lecture in
accordance with reaction of students and animation with sound
accompaniment allowing graphically present the dynamic aspects of concepts
and definitions.
Using multiscreen demonstration, we can present the same information
object in several forms simultaneously. In our lecture-presentation these
are ordinary text, graphics with animation and sound effects, mathematical
text, which we will accordingly conditionally name: "word", "image",
"symbol".
"Word" is a text without special symbolic designations. "Symbol" is a
text with special symbolic designations and expressions, which requires
for understanding special (in our case, mathematical) preliminary
education. The destination of "image" consists in providing the clarity in
presentation of information both for "word", and for "symbol". In our
lecture graphs are used as "images".
The presentation of information in the form of "word"+"image" is
intended for students-historians and in the form of "symbol"+"image" - for
students-"computer scientists".
It is obvious that realization of considered educational approach
requires the special facilities providing multimedia and multiscreen
presentation of information.
The lecture-presentation has been made in two versions: for traditional
presentation directly before listeners and for remote presentation by
Internet.
Nina V. Piotukh, A. Frolov
Electronic Historical Atlas of Derevskaya Pyatina, Great Novgorod
Region, XV-XVIIIth c.
The aim of the project is to create an electronic historical atlas of
Derevskaya pyatina 1) since the end of XVth till the
second half of the XVIIIth c. The main task is to put on the
electronic map all the settlements, which are possible to localize using
cadastral surveys. This task is conditioned by the fact that all the
XVth-XVIIIth c. survey's demographic, social and
economic characteristics describe separate settlement, not administrative
unit. Derevskaya Pyatina's GIS is regarded as a basement for the further
spatial-statistical analysis of the North-Western region of Russia to
understand its socio-economic development between XV-XVIIIth
c., for clarifying historical geography and for stimulation of
archaeological investigations of medieval Novgorod's region. The developed
technique of settlements localization may be regarded as a basement for
the further creation of electronic historical maps for other regions of
Russia.
Databases modelling information about settlements and population in
1770th and 1495/96 were created by now. Settlements of the southern part
of Derevskaya Pyatina were localised which are more than 4,5 thousands for
1770th and about 1200 for 1495/96. Series of maps presenting
distribution of settlement's types and population were made using
databases.
_______________________ 1) "Pyatina" is the
term for one of the five districts which Great Novgorod region consisted
of in XVth c.
Norbert Winnige
TASC B (Trans-national Database and Atlas of Saints' Cults.) A
GIS for Dedications of Saints in Europe
The TASC-project (Trans-national Database and Atlas of Saints' Cults)
aims at establishing an inventory of religious devotion in Europe. Though
evidence of cults is produced from all kinds of sources, the project
focuses on dedications of churches and chapels. TASC is a joint,
interdisciplinary enterprise of researchers from all over Europe and
beyond, initiated by Graham Jones (University of Leicester). Designed to
permanently attract more collaborators it will always be work-in-progress.
Collaborators are free to choose any software to build up local
datasets according to their needs. In a model database run at the
Max-Planck-Institute for History, Göttingen examples of data modelling are
developed. This presupposes establishing a set of core data, which will
allow for a standardized presentation of data, and developing common tools
for identification and encoding of data. Moreover, an encoded database of
saints is set up based on the Acta Sanctorum (AS), saints canonized and
venerated after the 17th century are added to it according to the
standards of the AS. Further tools needed are gazetteers of place-names.
Starting with lists of contemporary names based on data from ordnance
surveys this collection will grow constantly. Up to now data models are
more or less relational, in future more sophisticated data structures will
be included.
The data on saints and veneration are presented and analyzed using
geographical information systems (GISs). A GIS will be the main user
interface to the data. In order to set up this interface commercial
software (ESRI's ArcIMS 4.0) is used for reasons of availability. Yet,
offering alternative solutions, which consist in open source software GISs
free of charge, is considered.
Ingo H. Kropac
'Historical Informatics and Documentation': Positioning, Contents
and Integration in Academic Course Schemes"
Abstract: The area of "History and Computing" is treated by
several disciplines, which can be found under different denominations and
which contains different topics and approaches. What are the central
contents of these subjects, where are the delimitations and the
coherencies to other disciplines? After more than fifteen years a
repositioning of this subject - at Graz university it is called
'Historical Informatics and Documentation' - seems to be needful. This
paper will reflect upon possible definitions of the subject, its
scientific basics, its scope of duties and the role of people concerning
themselves about formal procedures and IT-application within history. The
institutional setup, especially in academic curricula, should complete the
review.
To this effect the paper intends to resume the discussion on the
positioning of a subject located between applied computer science,
information science and the various historically oriented studies
themselves. For this purpose the way we followed at Graz to establish the
discipline institutionally and to build curricula for several course
schemes will also be presented.
________________________________
Workshop: History & Computing - quo vadis?
Proposal: What does "History and Computing" really mean at
the beginning of the 21st century? Is it just "using IT in
historical research and education, and the related methodology"
(conference announcement) or has H & C been developed to a new
discipline within the humanities - either in the group of the historical
studies or in the more formal and younger subjects like computational
linguistics or computational philology?
To answer these questions the Austrian branch of the AHC proposes a
substantial and international discussion about the notion, the contents
and the institutional classification of this subject. The AHC seems to be
the best and only frame to provide for the potential success of this
attempt. This workshop should initiate the discussion and should give an
overview about the different points of view, represented by the members of
the AHC. For that reason we propose a forum, where the representatives of
the national branches - or delegates of them - meet to compare notes and
to discuss the further procedures.
Valeriy Kanishev and Roman B Konchakov
The Databases for study the Social-Demographic History of
Russia(in XIX B beg. XX c.) on microlevel: formation, record linkage, make
GIS, problem analysis.
The given statement is devoted to some results of long-term work on
creation and processing of the electronic social - demographic databases
based on initial sources of the account of the population. One of the
first problems in the researches connected to definition of changes of
demographic shape of separate persons and families, appeared
variographical (a different spelling of personal data same people).
Algorithms of the phonetic analysis of Russian nominative data (people
names , names of settlements etc.) are applied to overcoming this problem
and linkage of records about one person.
It is known, that the basic problem at work with sources with a low
degree of structure is construction of model of a database which optimum
would reproduce the data of a source.
At a difficult and non-formalized kind of representation of the
information in a source usually determine the general attributes and
standardize the data on a course of work.
For example, breaking a line " In 1881 has graduated a faculty of law
of the Moscow university" on fields "education", "years of graduation ",
"place of studying", "Speciality" etc. This problem has certain limit of
complexity behind which reduction of the data of a source standard
variants results in loss of the information because the form of the
message of those or other data may be the helpful information and may not
be described in model of a database.
In our work we came to a conclusion about high efficiency of creation
focused on a source databases the model which reflects not formal, but
logic structure of a source.
Separate question is research of processes (for example, social
mobility) the dynamics which is reflected in the mass-sources different by
the form and contents .
These sources are difficult for correlating in view of the different
form of representation of the information
For example, the property of the person may be appreciated in one case
as the sum of money, and in the other case the real estate is simply
listed. In the form, thus, sources are difficultly combined, but on sense
contain the same data.
The model of the data should be focused not only on the description of
a classical parity object - attribute, but also on the description of
properties of attributes. For example, in a database on studying social
mobility in Tambov (XIX-the beginning XX cnt.) "property" is not only
attribute of object "owner", but also itself has attribute "type"
accepting values: "many", "real estate" etc. "Property" may accept
different values on the attribute and be described accordingly by the sum
of roubles, the description and quantity of constructions etc.
Using "sub-attributes" the model of a database provides not only ample
opportunities of scaling but also to provide effective mechanisms of
linkage of records.
For visual representation of historical-demographic,
historical-geographic, historical-ecological processes use connection
databases with geographical data (electronic maps) by GIS.
In a result created by us GIS, containing various demographical and
ecological information on the Tambov's region for the period of 17-20
cnt., which allows to present development of region in the variable time
(historical) and spatial kind.
Sofia Salomatina
Bargain or Reputation: The Cluster Analysis of Discount in
Russian Banks, 1864-1894
The subject of the research is the practice of crediting of business
deals in Russian joint-stock banks during their early development from
1864 to 1894. It was a period of severe crisis in the second half of the
1870s and deep depression until the 1890s.
Joint-stock banks were credit institutions designed mainly for
short-term crediting of businessmen. In the second half of 19th century
the main credit operation of these Russian banks were discount of the bill
of exchange. In theory it was a credit without any collateral basing on
debtor's business reputation and, importantly, on quality of the bargain
caused the bill. Inasmuch as banking theory consider the discount as a
tool just for bargain on credit, bank's discount customers must be mainly
active businessmen.
But in 1885 economist Petr Gambarov published a paper asserting that
any wealthy per-son, even not a businessman, could discount a bill of
exchange in Russian joint-stock bank if he had guarantors reliable to
bank's manager. If this point of view was true, Russian banks were the
institutions crediting local elite without sufficient guarantee, because
if the subject of crediting was not a short-term bargain, the customer had
to pawn valuable. The successful short-term bargain was one of the
conclusive factor of repayment of a short-term credit.
Petr Gambarov doubted that these short-term bargains were an essential
part of Russian discount. His appraisal must be verified using
quantitative approach. For that it's necessary to reveal some quantitative
traces of bargains underling discount.
This task requires non-standard approaches, because the bill of
exchange was a formal and abstract promissory note containing no
information on underling bargain. Moreover, there was no immediate data
containing information on the subject of our interest. There is only a set
of quantitative materials on Russian discount in 1864-1894: total sum of
monthly discount in every joint-stock bank, derived from balance sheets
that banks had to provide Ministry of Finance and lay open to public
monthly.
Possible information whether banks discounted the bills of businessmen
have to be ex-tract from disposable data using multivariate analysis. In
that case the general indicator of bargains' crediting might be the
seasonal factor, which would be substantial if demand for credit conformed
to any yearly business cycle. But only third of about 50 of Russian
joint-stock banks have evident seasonal discount. The rest of the banks
needs the other characteris-tics estimating "the quality" of discount, and
especially confirming or, on the contrary, deny-ing that banks' customers
were diversified. But all quantitative indicators of discount might be
derived from two disposal time series from 1864 to 1894: 1) monthly total
discount and 2) monthly total asserts. In additional to seasonal factor
these indicators are available for each of the banks:
- mean of total asserts for the 1880s - 1894;
- number of branches in 1881 (no other data is available);
- discount to asserts ratio for 3 major time period (initial -
1864-1875; crisis - 1876-1880; depressed - 1881-1894);
- average annual increase of discount (mean for 3 above-mentioned
periods).
As the next step banks were ranged on these indicators using cluster
analysis (distance measure by method '1-Pearson r', linkage rule -
weighted pair-group average). The banks in revealed clusters differ by
discount characteristics.
This analysis partly disproves Gambarov's appraisal. It reveals that
considerable part of Russian banks were crediting short-term bargains
during a very long period of negative eco-nomic situation. Let's review
these results.
There were 47 joint-stock banks in Russia from 1864 to 1894. 18 from
them had evident seasonal discount and therefore most likely credited
bargains. 7 banks bankrupted until the 1880s and were excluded from the
research because they operated approximately 5 years on average and had no
time to develop individual business strategy.
Discount operations in 6 banks were 'credits on reputation' as it was
described by Petr Gambarov. It can be concluded from the following
factors: these banks were very small and had almost no other credit
operations except for the discount, hence they had no diversified
customers. Moreover, time series of discount operations of these banks
contain no traces of either seasonal factor or any other economic cycles
(discount trend does not correspond to the general discount market
conjuncture).
16 largest Russian banks had off-seasonal diversified discount
portfolios; 9 of them were gradually taking their funds out of discount
operations, 7 of them had obvious problems with discount management. But
nobody was able to get stable growth of discount operations and all large
banks were strongly depending on extremely negative general economic
situa-tion. Scanty market conditions and elitist and closed character of
Russian banking didn't im-ply the loss of economic basis of discount.
Svetlana Ashmarina
Dynamics of social insurance and traumatism indicators at Russian
industry in 1892-1907.
The issue of social insurance for workers in Russia in late XIX - early
XX centuries has not been adequately explored. This issue comprises two
main components, namely institutional aspect - organization of industrial
worker's insurance in tsarist's Russia to include development of
legislation (and adoption of fundamental insurance laws), as well as
practical aspect - implementation of legislative regulations. The practice
of social insurance, reflected in statistical sources, remains poorly
studied. Social insurance was applied to workers, employed in industries,
posing most risks of personal injury, e.g. in metallurgy.
On the brink of centuries (late XIX - early XX) the Ural region was the
centre of metallurgical industry, but it bit by bit yielded the palm to
the young Southern region of the Russian Empire. Metal mining and
processing enterprises and over 50% of workers were concentrated there.
The Ural was one of the first region, where social legislation was applied
to both state and privately-owned workers. Due to it this area is of
special interest in relation to studying the development of insurance
practices. Inadequately researched social insurance at privately-owned
enterprises of the Ural region is also worth attention. Published data and
statistical materiel of the Ural archives on personal injury levels and
composition as well as data on insurance payments for accidents in
metallurgical enterprises of the Urals and other Russian regions in 1892 -
1907 allowed one to trace the dynamics of trauma and social support in
case of accidents. The report provides analysis results, obtained by means
of statistical soft ware «Statistica».
George Vascik
Sophisticates or Neophytes: Charting the Campaign Behavior of
Candidates in Imperial Germany Using Historical GIS
In her recent monograph on Imperial German elections, Margaret Anderson
observed that German voters became more active and discerning participants
across 50 years of electoral experience. In this paper, I will demonstrate
that politicians became more sophisticated campaigners. Using a digital
electoral atlas that I have created for the Prussian province of Hanover,
I will examine candidate behavior in one constituency (the 19th
Hanoverian electoral district) across a series of five elections between
1893 and 1912. I can use the GIS to chart the timing and coordination of
rallies, attempts by party activists to disrupt other party's rallies,
leafleting and petition campaigns. Analyzing these factors in conjunction
with relevant income, occupational and confessional data, I can
demonstrate that parties and candidates became more sophisticated and
nuanced in their appeals, while developing complex campaign strategies to
achieve their ends.
Yuriy Svyatets
Fluctus numerare: Difficulties in agreement of aggregate
statistics with changeable programme
Research of social and economical processes in Ukraine agriculture
during the new economical policy (1921 - 1927) needs to analyse
statistical information. The Central Statistical Department of Ukraine
organized and headed every year sample (10% of total) spring and autumn
censuses of farms. Unfortunately primary sources (questionnaires) are not
preserved. Materials of the censuses are published as aggregated data.
GIS is created with the research object. Changeable of the program with
published data is the main problem of creation. Territorial and
administrative reform continued during the period of new economical
policy. Therefore quantity and borders of administrative units had been
changed a few times. The Ukraine territory included 53 districts in 1923
year, 49 districts and Moldavan Autonomy in the spring of 1925 year, 41
districts in the spring of 1926 year. Farms of each of the districts are
groupped according to size of sowing squares. Quantity and sizes of sowing
squares intervals are already changeable in published materials. Quantity
and list of variables that describe farm groups are not stable.
The aim of paper is to discuss main principles of GIS constructuring
and analysis of aggregated censuses data with changeable program.
Lotta Vikström
Mastering and Measuring Historical Data: The Possibilities and
Obstacles to Explore Individuals' Life-Courses Using Statistical Analyses
on Swedish Parish Registers in the Case of Migrants to Late
Nineteenth-Century Sundsvall, Sweden
Using advanced statistical analyses has enabled scholars to better
grasp demographic change and its impact on both the individual and
societal scale. The last few decades, historical demographers have
increasingly turned to issues concerning how to statistically deal with
demographic changes over lifetime. However, applying statistical methods
on historical data is a complicated process especially for historians who
often lack experience of such methodology.
This presentation addresses the possibilities and obstacles to explore
individual migrants' dynamic life-courses by using multiple regression
models. It shows how longitudinal data formed by Swedish parish registers
meet the demand for applying such models. However, the presentation also
highlights some difficulties linked to the application of multiple
regression models on historical data and suggests methods to help overcome
them.
Two types of regression models illustrate the issues addressed above.
The first model measures the duration of residence of migrants in the town
of Sundsvall and the other one concerns social mobility in times
characterized by rapid industrialization. Demographic characteristics such
as gender, age, geographical background and social and marital status are
analyzed to examine the impact of these covariates on the two events of
out-migration and upward social mobility. The data is drawn from
computerized parish registers stored at the Demographic Data Base (DDB) at
Umeå University, Sweden.
Valeriy V. Kanishev
The project "Micromigrations processes in the Russian Eurasia of
the second half XIX - the beginnings XX cnt. (the Tambov region - the
Altay region)" : The preliminary results.
Among researches of migrations from the European Russia to Siberia in
second half XIX - the beginning XX ct. there are no the works studying
this process at a microlevel on the basis of the initial data and
connection with the help of the computer of the information from different
sources. Such approach allows to find out motives of resettlement and its
consequence for concrete families and people.
The majority of initial documents on a theme was postponed in the Altay
archive - hundreds archival documents. In the Tambov's archive there are
tens documents about families of immigrants, the most part from which is
secondary, which arose in connection inquiries of the Altay's authorities
to Tambov about necessity of legal fastening the resettlements. It speaks
about prevalence of autocratic resettlements. It were roundabout means of
often negative decisions of the Tambov authorities under requests for
resettlement. But in sometimes in Tambov's documents the interdictions
were substantiated by authorities of a financial inconsistency of
immigrants. Following by Altay's data's 1888-1892, about half of migrants
from Tambov's region could not get an independent facilities and send in
farm laborers. But results of resettlements (in 1880th came back ~ 3 % of
the Tambov's peasants) speak that the poor peasants chose difficult, but
real perspective of improvement of the position on Altai instead of gloomy
poverty on the native land where the accrued agrarian overpopulation.
Demographic and economic positions tambov's people on Altai region it
is at present investigated on census sheets by agricultural census(1917
y.) of villages Podojnikovo and Krutiha of Barnaul district., where lived
~1200 immigrants. On a parameter of birth rate (26 o/oo) the Altay's
village is near to Tambov (in 1916 birth rate around 30 o/oo). Death rate
of the Tambov's peasants on Altai was much lower than their fellow
countrymen - 15 o/oo against 25-27 o/oo. Very low on Altai in comparison
with ??????????? a parameter of infants mortality rate (less than 2
o/oo against 20o/oo and more). The natural increase on new places at the
Tambov's peasants appeared the high - 10 o/oo against 5 o/oo in the Tambov
villages.. The data for one year on 2-nd Altay's villages in comparison
with 3-4 Tambov's villages do not give the bases for wide conclusions. .
We assumption , that the high natural increase on Altai is connected to
absence "race" rates of mortality behind birth rate, characteristic for
the overpopulated territories.
Comparisons of economic parameters of 1917 on villages Podojnikovo and
Krutiha with average indices on Tambov's region are interesting. A new
peasant sufficed quantity the plot of land on Altai's region.
In Tambov's reg. in 1910-years took in rent the ground of 27 % of
peasants. In using immigrants was on the average on 8 desjatines (1
desjatina = around 1 hectare.) on one household. In Tambov provinces - 6.3
desjatines. As a whole on Altai the Tambov's peasants have received the
grounds not more, but means for rent here were not spent. Census of 1917
year has fixed in household of immigrants with Podojnikovo and Krutiha it
is significant the big quantity cattle animals, than in Tambov's
household. It speaks presence Altai of the big fodder resources, than in
the Tambov's province. There to beginning XX.cnt. in view of wide
ploughing the meadows and haymakings area was sharply reduced.
By calculations of Tambov's historian V.L.D'jachkov, in 1897 in the
provinces like Tambov, the males dominated only at senile age, younger
were in a zone "women's kingdom". In such situation the need(requirement)
for resettlement from old agrarian regions grew in connection with a
problem of distribution in marriage superfluous brides. Feature of sexual
structure of villages population of Siberia was the numerical superiority
of the male's population. By the end of 1890th on the each 100 males were
98 females. It is difficult to explain, how peasants of the European's
Russia have guessed this situation, but at the end of XIX - the beginning
XX cnt. the part of women among immigrants to Altai began to grow. During
1889-1896 among immigrants in the age of 10-20 years the girls were 62 %.
Among immigrants, begin to live in "old based" settlements, at age till 14
years the percent of boys was much lower, than girls. Probably, in the new
environment, where males were enough, the chance of brides to marriage was
more. Thus, peasants of agrarian regions of the central Russia moved to
Siberia not only behind the ground, but also for grooms for the brides.
However, it was normal for a traditional society with are practically
overall the marital of population.
So already the first results of the Tambov's and Barnaul's project have
particularly and obviously shown efficiency of resettlement to Altai
region as to a way of overcoming of system crisis of an agrarian society
of overpopulation provinces of the European Russia. About hundred families
of the Tambov peasants could have more prosperous household than the
staying peasants in native territories, than at the staying in native
territories and also to solve the demographic problems.
Inez Egerbladh / Sören Edvinsson
The Swedish Parish Statistics "Tabellverket" 1749-1859.
Advantages and disadvantages for research of computerized long-term
statistics.
Tabellverket is the designation of the earliest systematized population
censuses in Sweden for the period 1749-1859. It consists of two
statistical serials focussing different demographical aspects, the size
and composition of the population with certain time intervals - mostly
five years - in Population forms respectively demographical events yearly
in Mortality forms. For the first time all this unique statistics is
accessible at one place by the Demographic Data Base.
This source enables studies at the lowest administrative level (the
parish) on population development and social structure, fertility,
illegitimacy, mortality, nuptiality and migration, all from several
aspects. Digitisation of this source strengthens the research potential.
It allows for creation of new variables in order both to detect
miscalculations by the clergyman and to facilitate comparisons within and
between different form editions of statistics. It enables validation of
data, substitution and estimation of missing data.
The general problems in analysing long term statistics refer to changes
in adminstrative areas over time, loss of data certain years or time
periods, damages in the sources as well as changes in the character of
information about events and the population. Digitisation involve
additional complications as for interpretation and analysis of data,
especially when made with the DDB demands for a literal reproduction. For
instance, the changing structure and meaning of information in the sources
(e.g. occupations and causes of deaths) complicate the database structure
and coding. Usage of fixed prepared inregistration formats adapted to the
tables in the sources restricted the possibilities to consider all kinds
of additional or deviate information by the clergyman.
Nanna Floor Claussen
Widowhood in Denmark 1801 An analysis of the living
conditions of a specific social group - based on the complete 1801
census.
The complete 1801 census database presents new demographic research
possibilities. This paper demonstrates the use of this large database in
working with a complete social group. The use of databases and computers
is essential for analysing the data. Especially when working with specific
social groups the availability of a complete digitised census is
essential.
The paper analyses the living conditions of widows and widowers. This
social group is easily identified uniquely in the census records and
consists of persons that often had difficult living conditions. The main
areas covered will be analyses of the size of the households, and whether
widowhood meant that a person was living alone or within a family. The
investigation will also include an analysis of the situation in different
regions of Denmark: Copenhagen, the cities and the rural areas.
Furthermore, it will be studied if and how differences exist between the
two sexes. It is only possible to make this analysis due to enrichment
made on the computerised data, as this information is not in the original
nominal lists. The paper will analyse the age of the widowed persons and
try to establish whether and how age influences the size of the
households.
During the last year great effort has been done to add HISCO codes to
the 1801 census and this work will be used to analyse the occupations for
this group.
It is the intention that this paper will demonstrate how it is possible
to make both a detailed description and to make general statistics in
order to describe the living conditions of a selected group. A great deal
of these analyses is only possible due to the enrichment of the material
and the completeness of the data.
The paper will also discuss the methods used for producing the
statistics and analyses.
Zlobine Evgueni
Moscow secret police department (Okhranka) undercover agents
records - the problems of database creation.
The Department for the Defence of Public Security and Order in Moscow
(the Moscow Okhrana or Okhranka) was the most important body of political
investigation of Russia at the end of XIXth - the beginning of
XXth century. It was created on November, 1, 1880 by the order
of Minister of Internal Affairs count M.T.Loris-Melikov. The Okhrana
submitted to the Department of police. The direct control of the Okhrana
was carried out by the governor of Moscow. The department search activity
exceeded the limits of Moscow and province. The Okhrana carried out a role
of all-Russian center of political detection. The structure of the Okhrana
included five departments. The Special office department of the Okhrana in
Moscow was established in 1902. It supervised (was responsible) the
organization of external supervision (correspondence on external
supervision, development of addresses of persons behind which it was
necessary to establish supervision, daily data on external supervision and
drawing up of reports and circuits of external supervision). The
Secret-service department was founded in 1910. The Department managed of
the an internal agents: development of the data received from confidential
collaborates. The documents of fund contain data about politicians, public
figures, scientists of the end XIXth - the beginning of
XXth centuries (writers, journalists, lawyers: publishers,
editors of newspapers and magazines, etc.), members of political parties
(Russian Social-Democratic Labour Party, parties of Socialists -
revolutionaries, Cadets, Octyabrists, Anarchists, Labours, etc.), national
parties ("Bund", The Socialist Party of Poland, etc.), participants of
mass actions, political demonstrations, strikes, the peasants' revolts,
the armed actions against imperial autocracy, political exiled and
convicted, etc.
Documents (more than 30000 cards) include all first and seconds names,
a rank, a home address, a revolutionary nickname, a nickname of external
supervision by undercover agents, an occupation, the name of the
organization (party) to which the suspect belonged to. On cards there are
photos (not everywhere), the list of measures (searches, arrests), a
nickname of the agent, delivered data on the person, the contents of the
report.
The main mission of our project is to create full collection of the
documents images and database on one DVD disk probably. The given project
is the joint one sold by archive and historical Faculty of Moscow State
University. Together with workers of archive in the project students of
historical faculty who work with archival documents during passage of
educational archival practice are borrowed. Such practice is a component
of educational process. As taking out of the true archival cards of fund
from archive is impossible, students entered materials into a database,
using the scanned images of the cards which have been written down on CD
disks. According to results of input of the first file of cards (more then
10 000), pilot research of database had been led.
This project was supported by Soros Open Society Institute (Moscow).
Svetlana Mintz
Cancelled !
The Place and Function of University Course in Quantitative
Methods for Historians
This paper put an actual topic for Russian universities as they are
ready to take part in the Bologna reform.
The paper compares two periods in teaching University course in
Quantitative methods for historians in Russia (1969-1988 and 1989-2003).
It describes a paradoxical result of intensive training of historians
in statistics and computer science. Historians became familiar with the
computer and the internet, but ten years of crash courses has been to
return many historians to the "pre-information era". Many historians
reverted to conceptualizing events in the manner of the eighteenth and
nineteenth centuries. They began to utilize numbers as a metaphor. A
paradoxical situation occurred - the widespread mastery of the results of
advanced information technology caused scholarship to regress and deprived
it of basic methodological achievements.
Main thesis of this paper is that in the history classroom, the
mathematician or specialist in information science becomes the bearer of a
historical-philosophic paradigm. This aids future historians to inscribe
knowledge of history, and, in a broader sense, of the humanities and
social sciences, in an existing hierarchy of scholarly knowledge.
The paper makes several assumptions about the role and place of courses
of quantitative methods and computer science in contemporary humanities
and social science higher education.
Ivan Sintchouk
The onomastic cross-sectional study of "new belorussians": the
modern databases in the special historical research.
The purpose of this work is the reception of the quantitative
characteristics for an estimation of economic activity of the different
ethnoses representatives and ethnic groups living in Belarus during
1993-1997 years. As there are no statistical data on this period the
method of the reception of the required information to indirect
attributes, which contains in surname's endings, was used. As a source for
creation of databases focused on the decision of research tasks, the
modern databases intended for needs of the machinery of management and
information and publicity services (for example, lists of the telephone
subscribers) were used.
The common number of Belarussian surnames in graphic variants is
roughly close to 100 thousand and is reduced to 20-30 thousand
phonetically close variants.
The complication of the analysis of surnames consists in their last
graphic mobility, that has resulted in variety of the close forms, which
are indistinguishable during the use of standard means of sorting and
indexing.
In the given research the revealing, except for direct comparison of
surnames, conformity and discrepancy of a share of surnames of elite and
usual for region of a share of surnames of the population was most
important. The applied toolkit well fixes within republic migrations of
the Soviet period, first of all among the persons with high education, to
which "new Byelorussians" mainly concern.
As far as Minsk, the capital of Belarus, grew up at the account of
newcomers from different regions the distribution of formants for Minsk is
close to the average for the country which is vivid from comparison of the
telephone subscribers of Minsk and private businessmen of republic.
The share of participation of Jewish ethnos representatives in economic
life is found out in some times less expected. Small share of Jews can
have 2 main reasons: traditional from the Soviet Union times disguising of
nationality by changing surnames or uneven migration of Jews according to
age structure (young and middle-aged educated people emigrated first of
all). Such a disproportion makes us pay more attention to the
identification of social and economic activity of Belarussian Jews. There
is ground to believe that traditional activity of Jews in this sphere has
changed and social stereotype of a Jew does not correspond to the reality
any longer.
It is rather essentially the small increase of a share of ethnic
Russian in the sphere of economic activity of the country in comparison
with their share among the population of republic, as the general share of
Russian people in Belarussian population is rather significant. The
phenomena of disproportionate representation in "postperestroika" business
ethnic Russian is not an exclusive feature of Belarus - for example, the
similar processes are marked in the neighbouring countries: Lithuania,
Latvia, Estonia.
On the whole it is possible to speak about the smaller economic
activity of the representatives of the title ethnos in comparison with the
representatives of others ethnoses living in Belarus.
Andy Carver and Eve A Anderson
Data Integration and Modeling in the Social History Domain: An
Open Source Project
Data exchange among genealogical applications and/or web-accessible
data repositories involves expectations about the sorts of data that will
be exchanged and what the data means. To be more precise: data exchange
deals not with mere un-interpreted data (text strings, bit strings etc.)
but information, propositions that are known, supposed, or hypothesized
(by one or more persons) to be true. Primarily then, we must define the
types of assertions that are to be stored, along with the relevant
business rules (constraints and derivation rules) that limit the allowed
fact populations and possible transitions between database states. Such
expectations, collectively, define a data model. The structure of such a
model is a conceptual schema.
In the analysis phase of a genealogy system development, a conceptual
schema may be used to describe the information structure of the
application domain in a way that is easily understood and validated by the
domain expert. Once validated, the conceptual schema can then be mapped to
logical/physical/external schemas using automated and/or manual processes.
Neither XML DTDs nor XML schemas afford an easily understandable and
semantically adequate way to model an application domain, as they lack
mechanisms for representing several common constraints and relationships
(e.g., exclusion constraints or many-to-many relationships). While the
high level data modeling may be performed using attribute-based approaches
such as Entity Relationship (ER) modeling [2] or Unified Modeling Language
(UML) class diagrams [11], there are many benefits to be gained by using
an attribute-free approach to conceptual information modeling, as found in
the Object-Role Modeling (ORM) [6, 7] and Object-oriented Systems Model
(OSM) [3, 4] approaches, which offer greater semantic stability, as well
as graphical and textual languages for capturing a wider range of business
rules. These approaches may also be fruitfully used to reengineer
databases.
As part of a new project at Northface University to create an
open-source genealogy system for pedagogic use, the ORM methodology was
used to reverse engineer some well known genealogical schemas into ORM
schemas. This process exposed various problems with these schemas. After
briefly reviewing these genealogical models, we discuss a number of their
problems, and identify some relevant design trade-offs. We then propose a
possible solution, using ORM to visually specify business rules that
typically cannot be displayed using attribute-based approaches.
Andrey Andreev and Leonid Borodkin
Nonlinear Modelling of Historical Social Conflicts:
Self-Organization and Chaotic Behavior
As it was said in the discussion on applicability of chaos theory in
historical research, "accurate dynamical models of history, were we able
to produce them, would be nonlinear". In this paper we discuss nonlinear
nature of historical social conflicts dynamics. We propose nonlinear model
of strikes dynamics which includes three differential equations. Three
variables of the model are: level of strikers activism, level of agitation
and level of authorities pressure. Workers self-organization plays an
important role in nonlinear effects of model behavior.
Computerized research of the model dynamics reveals chaotic behavior in
some areas of parameters. The paper contains classification of chaotic
regimes generated by the model.
Leonid Borodkin and Anna Konovalova
Chaotic Dynamics of Share Prices at St. Petersburg Stock Market
in the First Decade of the 20 th century: Metal and Oil
Analysing historical time series one should realise that sometimes
unexpeted behavior of the system under consideration could happen without
any substantial external cause, so the metaphor "Big-Big" ("big causes
generate big changes") is not valid generally speaking. Chaos theory gives
an appropriate tool to study unstable social processes which can be
characterised by sensitive dependence on initial conditions. With the
development of specialised software the "new" sciences of chaos and
complexity have arisen largely.
Stock market dynamics is one of "classical" examples of unstable
behavior. The standard question is: what accounts for stock market booms
and busts? Analysis of share price fluctuations is a complicated task -
with too many contradictory factors (both internal and external) which
must be taken into consideration. In our paper we analyse the factors
influenced on fluctuating share prices of large Russian joint-stock
companies, the securities of which were quoted in the St. Petersburg stock
market between 1900 and 1909 (during the "decade of industrial stagnation"
in Russia). Out of the two-hundred companies whose share prices were
quoted at the beginning of the period, we have selected six of the most
well-known joint-stock Russian companies: three of them are
machine-building companies and three others are oil companies. The primary
source for our research was a newspaper Birzhevyie Vedomosti ("The Stock
Market News") of St. Petersburg, information from which was used to
construct six time series - daily values of share prices of the six
companies within the decade. As a result we have about two thousand points
for each time series so they are rather long to be analysed by chaos
theory methods to detect chaotic behavior.
The main aim of our research is to define the relative role of
"internal" factors of the market dynamics, those related to the
interrelationships of market players. The copies of Birzhevyie Vedomosti
(among other periodicals from the beginning of the 20th century) contain
quite a lot of descriptions of the stock market events and their
interpretations which give an evidence of the noticeable influence of
external factors on the stock market behavior and share prices changes
(reports from foreign stock markets, political and economic news, etc.)
However, we cannot avoid noticing the certain "insensitivity" of share
prices as they relate to external events.
To analyse the dynamics of the six time series we used special software
"Chaos Data Analyzer: The Professional Version". It should be noted that
we detected chaotic behavior in all time series, however it was expressed
more clearly for three machine-building companies. The results permit us
to generate the hypothesis concerning the greater role of "internal"
(homogeneous) factors of the functioning of the St. Petersburg stock
market in the long-run consideration and substantial role of external
(heterogeneous) factors in short run at the beginning of the 20th century.
Lars Nygaard
Methods and tools for testing, preserving and utilising
electronic archives
Since the start of the era of preserving electronic archives (about
1980) the National Archives of Norway have received and preserved more
databases from governmental information systems than "real" records from
registry systems with electronic documents. According to the Norwegian
tradition in this field information value is regarded as an equally
important criterion for preserving records than evidential value. These
information systems typically contain data about the population, real
property, buildings, cars, ships etc. Fore more than 30 years we have also
digitised (by transcription) paper based material of this type (e.g.
census material and parish registers) to make it more available and
utilisable to researchers and to the public, for the recent years through
"The Digital Archive" on the Internet.
The information systems or their databases are not preserved in their
original form. Instead, the data (complete or subsets) are exported from
the database to hardware and software independent text files, which are
stored for the future. Through the years it will probably be necessary to
convert the data files to new formats (character sets etc.). This is
called the migration strategy. In addition to the data files themselves,
different kinds of metadata about the original system and the extracted
data are collected and stored. Metadata about the content and the context
of the system and its data will be kept in our archival description system
Asta. Metadata about the structure of the data (often called technical
metadata) have to be very detailed and standardised for the data to be
effectively re-used by the computer. To achieve that the National Archives
have developed an XML based "language" called ADDMML for describing the
outer and inner structure of the data files. The structure is described on
four levels; data set, data file, data record type and data item (field)
type.
Arkadukt is an interactive program to be used by the records creators
to describe the archival extracts of their databases in ADDMML before
transfer to an archival depository. Arkadukt is based on Microsoft Access.
Arkade is a large system to be used by archival institutions to
"prepare" received data files for long term storage, i.e. by analysing
further the structure and the contents (e.g. the range of values in
particular data fields) of the files, verifying the metadata in the
accompanying ADDMML file, or converting the data files to another text
file format (e.g. XML). Arkade will also be used to describe and manage
archival material digitised by transcription. Arkade is developed on top
of The SAS System.
Arkade also has a module which creates and fills a relational database
in SAS with the data from the read text files. One of our plans is to let
researchers and other skilled users (mastering SQL or the SAS language)
access and utilise such archival data sets by means of SAS (e.g. for
statistical analysis). Another plan is to load open electronic archives
into MySQL databases and offer the users (even the public) access to them
on the Internet through a general web interface.
Eila Williamson
Digitisation of Archive Material
The outcomes of projects which have digitised archive material are
increasingly available over the WWW to the history researcher. This paper
will survey what Scottish archival and library material has been
digitised, and is available over the WWW, and will explore the subsequent
implications which this has for research. Topics to be discussed will
include: reasons why digitisation has taken place; how these projects have
been funded; ease of access for users; whether or not there is any
commercial gain for the archive or library; and what users are being
catered for. Particular consideration will be given to the needs of
different user groups, such as family historians and academics, and the
possible dangers that prioritising one user group over others may have. Do
projects which have digitised material with one particular category of
researcher in mind actually inhibit research for another category of user?
Ole Martin Sørumgård and Arnfinn
Kjelland
Putting a main part of Norwegian Local History - the old farm-
and genealogical history genre - into the Computer
Norway has an old but still flourishing genre of farm- and
genea-logical histories. Such histories describe the history of all farms
and other dwellings in the selected local community, combined with
genealogical data of past and present residents. The information is
usually presented in a narrative form with dwellings ordered
geographically, residents arranged in «families» or other groups and
listed below dwellings they belonged to. Further, each individual has
cross references to other places where he or she has stayed, place of
birth etc. Because of this it is possible to trace individuals as they
moved from place to place, map the pedigree of a selected person and so
on. A large number of such books are still published each year in Norway.
The area of a project is usually a parish, and the result is usually
several large volumes.
Traditionally, manipulation and analysis of the huge amounts of
historical data needed as background material are based on more or less
manual, highly time consuming methods. In recent years, though, there have
been some improvements, especially through faster and cheaper computers
available. But each historian has still to rely heavily on his or her own
skills and ingenuity in developing the appropriate tools.
It should be noted that historians working in this genre are forced to
use somewhat different methods in their analysis, compared to both
historical demographers and traditional genealogists. The demographers'
approach yields valid results on the statistical level, but not
necessarily about individuals in the material which historians and
genealogists have to survey. On the other hand, genealogists often map
specific families, leaving out the considerable number of individuals with
no links to any known family, or no descendants. Clearly, historians in
the «farm and genealogy» field have a point of view somewhere between the
two others, and must in addition cover the general history of the farms
and other dwellings in the researched area.
It should also be noted that there are not two such projects that are
identical; they vary considerably as far as the settlement structure, size
of population, amount and quality of source material etc. of the
investigated areas (parishes) are concerned. Also the willingness of the
project organisers (usually the municipality) to fund accurate collection,
treatment and analysis of sources and the qualifications of the reseacher
(often a local historian without formal training in history) can vary to a
great deal. The results are books of very variable quality. However, good
books in this genre can generate considerable income and contribute to the
funding of the project. Such projects - at least 114 are ongoing in 2002 -
were funded with a minumum of NOK 21,5 mill. (EURO 2,7 mill.) in Norway
this year.
Despite several commercial software packages available, serving both
demographers (statistical analysis tools) and genealogists (family record
programs), none of them quite fullfills the needs of historians in the
field. This is primarily due to their lack of capacity to handle farms and
other dwellings, and the fact that they rarely can produce references
between persons and places.
The purpose of this project is to develop a useful tool for historians
working in this field, a tool to assist researchers in all stages of work,
including final presentation of the achieved result as a book manuscript.
The title of the system is «Busetnadssoge» («Settlement history»),
abbreviated BSS, or BSS-application (under Windows). This application will
be able to handle all such projects, regardless of settlement structure,
sources etc. in the area investigated; the only limit is population size.
BSS is now tested in a bygdebok project of Volda, with a present
population of 8.500 (4.556 in 1900, 3.939 in 1801).
Some documentation in Norwegian is found at http://tilsett.hivolda.no/ak/BSS/Busetnadssoge.html
Aada Must
"The Estonians'History in the Databases of the Russian Memorial:
an Analysis of the Results of Linking the Databases
At the 1998 International Conference of AHC, I introduced the databases
I had created about the Estonians’ surnames and their ancient homes (by
1835 there were 41,000 different surnames, among them 29,000 unique ones
that only occurred in one concrete parish), the shift of surnames (in the
years 1935-1940 about 20% of the Estonians’ foreign-sounding surnames were
Estonianized) as well as the preliminary results of the use of the
database, gained in the research on internal-Estonian migration.
In the years 2001-2003, we have applied the mentioned database and new
search engines to the research on the history of the Estonians living in
Russia (in 1918 c. 130,000 persons). Luckily, I found an important source
of individual biographical data of the communist regime victims (incl.
their place of birth and that of residence when they were arrested, etc.)
in the databases of the Russian Memorial, published on the CD ROM (partly
supplemented on the Internet).
The Memorial’s databases, written in Russian (in the Slavic alphabet)
cannot directly be linked with the database of the Estonians’ surnames in
the Roman type, there are curious examples in the practice of
transliteration of place names, cf. “St Isaac’s Parish” = “439 kg”). Based
on an analysis of practical data we have created a respective search
engine. We had to eliminate German-origin surnames from our research
(denoting mainly Germans, seldom Estonians, in the Memorial’s database).
This elimination was supported by the search for specific Estonian first
names (Endel, Virve, etc.). However, in most cases the Estonian first
names became Russianized in Russian-language documents (cf. Andres
?Andrei, Tõnis ? Anton, the end of the typical Russian marker of gender
“a” of Russian female first names was added to Estonian names, etc.).
The internal source-criticism of the Memorial’s databases revealed
essential errors. E.g., 100% of entries in the database of the victims of
the Pskov Oblast contained gross mistakes (thus, 250 Estonians had been,
after their execution, for the second time arrested and shot; in
processing the Excel Table only part of the fields had been selected). As
is the common knowledge – a small confusion can be caused by a historian
by himself, a chaos needs a computer.
Until 1991, the theme about the Estonians’ resettlement movement to
Russia was only episodically-fragmentarily researched (mainly based on the
reference book published in 1918 about Estonian villages in Russia).
Thanks to our links, we found a number of Estonian colonies in Russia, not
mentioned in the historiography to date. Novel data were found about the
Estonians who had settled down in towns and cities (incl. those who at the
beginning of the 20th century had gone to Russia to work as agronomists,
veterinarians, station masters, cheese makers, engineers, mechanics, car
drivers, teachers, etc.) and to make educational career (e.g., a great
number of Estonians had studied in Siberia, at the University of Tomsk).
We drew up a detailed map about the proceeding areas of emigration,
constructed emigration timelines (the ancient home of the surname, birth
place, various areas of settlement in Russia, a map of later more
important centres of attraction). We gained massive biographical and
genealogical information (e.g., about the people who at the beginning of
the 20th century were not given permission to resettle to Siberia but
finally they somehow reached their dreamland – we found their names on the
list of those shot in the Krasnoyarsk district in 1937). Likewise, the
search engines we created yielded important information about the
remigration of the Estonians from Russia, however, it has been a
suppressed topic (primarily for psychological reasons, to avoid being
repelled, etc.) for decades.
Elwin Koster
Mapping and analyzing the urban landscape
In 1960 the British geographer M.R.G.Conzen published his book on the
urban transformation of the townplan of Alnwick. In his introduction he
announced two other volumes on the building types and the land use
patterns of the same town. When he died in 2000 these volumes where still
not published. In his personal archives a number of notebooks were found
containing surveys of several towns, including two surveys of Alnwick, the
first one was carried out in 1953, the second one in 1964. With the use of
a Geographical Information System the data from these surveys are now
mapped. The main question is if specific queries can be used to answer the
questions that Conzen asked in order to write the two missing volumes of
Alnwick. A related question is how the urban landscape developed over the
last 50 years, now a third survey is made in 2003.
Peter Doorn (Netherlands Institute for Scientific Information
Services - NIWI) and Kees Mandemakers (International Institute of Social
Science History - IISH)
Life courses in context: a collaboratory based on
Dutch population registers and censuses (19th and 20th
century).
Summary
Two institutes of the Royal Netherlands Academy of Arts and Sciences
(KNAW) have started a programme to develop a database with about 40.000
individual life courses of people born in the Netherlands the period of
1863-1922. This micro-data will be supplemented with aggregate (tabular)
data from the Dutch censuses as they were taken between 1859 and 1947. In
a few years time, all Dutch censuses between 1795 and 1971 will be
available in electronic form.
Contrary to some other Western countries, census data on individuals
for the period before 1960 do not exist anymore for the Netherlands. To
bring research on comparable international stand-ards it is necessary to
reconstruct these data. But, the construction of this database will not
only eliminate the disadvantage of lacking data on the micro level, faced
by the Netherlands compared with other countries. The HSN database, drawn
from the Dutch population registers, will actually put the Netherlands in
a unique position in the international research world. Because the
popu-lation registers have recorded details of every change of residence
for every individual since 1850, investigators can have access to research
populations that are not limited to persons who stayed put in just one
municipality.
The database of the Historical Sample of the Population of the
Netherlands (HSN) will cover the entire country and contain micro-level
data on the life courses of over 40,000 individuals born between 1863 and
1922. These life courses are to include data on each successive family
situation in which the individuals lived, all the addresses where they
lived, as well as data on the religi-on and occupational title of each
subject and of every person with whom they co-resided (and, for married
subjects, data on the occupational title and place of residence of family
members of the subject’s spouse). In the social and historical sciences,
the life-course perspective is increasingly important in explaining
demographic and socio-structural processes. The HSN database will co-ver
the entire period from 1863 to 2000 and can be regarded as a chronological
expansion of the retrospective databases currently in use by the social
sciences.
Individual life courses have to be analysed in a rapidly changing
environment of an industrialising and modernising Dutch society. The Dutch
national censuses form a fundamental source of information for conditions
on the level of the municipality. In addition to the population size,
popula-tion censuses contain information on the structural characteristics
of the population, such as age, gender, marital status, religion,
household status, occupational activity, and nationality. In some years
the censuses were combined with an occupational census and a housing
census. The project includes the digitisation of the population and
occupational censuses of the Netherlands for the full period 1859 to 1947.
The data will be linked and integrated with those from the Historical
Database of Dutch Municipalities and will build on those censuses already
compiled by the Netherlands Institute for Scientific Information Services
(NIWI) and Statistics Netherlands (1795-1859, 1899, 1930, 1960 and 1971).
The expansion of the existing databases of HSN and NIWI into complete
systems will be of ma-jor significance in the long term. In the short term
it will provide a strong stimulus to academic research into social,
demographic and economic developments over the past two centuries. This
programme will result in a series of studies in historical de-mography,
social and economic history, human geography, sociology and epidemiology.
The work will be performed in stages so that the data will become
available for research before the project is completed.
Another aspect of methodological innovation envisaged by this programme
lies in the scope for connecting data from the micro and meso/macro
levels. The population censuses and other muni-cipal data offer a context
for the individual-level and family-level data. The combination of the
different sources will create new opportunities for multi-level or
cross-level analysis.
The results of the proposed project will be presented and communicated
in the form of a collaboratory for the humanities. A collaboratory is an
open meta-laboratory that spans multiple geographical areas with
collaborators interacting via electro-nic means - "working together
apart." The impact of these databases will however not be confi-ned to the
research world. The databases will be accessible on the Internet to the
general public. However, for privacy reasons files or data relating to
individuals still living will only be accessi-ble using a technique known
as remote execution.
Evan Roberts
Integrating international census data: the North Atlantic
Population Project
Integrating international census data: the North Atlantic Population
Project This paper discusses the North Atlantic Population Project (NAPP),
which is harmonizing and distributing complete-count historical census
microdata from Canada, Great Britain, Iceland, Norway and the United
States for the late nineteenth century. NAPP is a collaborative effort,
involving researchers from 7 institutions in five countries. The first
preliminary release of harmonized data are scheduled to be released
shortly before the IAHC meeting. This paper gives an overview of the data
made available in the preliminary release, and outlines what the database
will contain at the final release in summer 2005. Potential research
applications of the databas
Michelle van den Berk and Peter Doorn
A data archiving disaster: The Dutch population census of
1960
The 13th Dutch population census of 1960 was to be the first 'digital'
census in the country. Data from the questionnaires concerning about 11
million inhabitants were entered manually into computers, analyzed,
published, and then stored on punch cards. That is the digital part.
However, the archiving of punch cards is a different matter altogether,
which has far more to do with traditional than with digital archiving, and
at this stage things started to go crucially down-hill. When storage space
was needed for the storage of the cards of the 1971 census, the 1960 cards
were acquired by the Steinmetz foundation (the forerunner of the social
science data archive in The Netherlands) and transferred to the University
of Amsterdam. Optimal storage locations were not available, nor was money
to keep them in storage: boxes with cards got moist, punch cards crumbled,
were torn, and stuck together.
When the punch cards were read into a computer again in the early
1970s, some more mistakes were added: many cards were read twice, many
others were not read at all. Some 300,000 records are missing; some
250,000 records have been entered twice. In those days, the processing
time required on the mainframe computer of the academic computer centre to
investigate the full extent of the problems exceeded the computing budget
of the whole faculty of the social sciences. The owner of the material,
Statistics Netherlands, decided that the resulting dataset was so
incomplete and corrupted that it had no sufficient value and could be
discarded: fortunately, it was not.
A good 40 years after the census was held, the retrospective
digitization of the Dutch censuses forms the framework for a project in
which we are trying to reconstruct:
- What exactly went wrong and when
- What can be done to create a usable data set from the digital files
with the help of previously published tables?
- How can the data be made available for historical and social science
research?
Needless to say, data entry from the original forms or
cards is no option: they have all been destroyed a long time ago.
Rafal T. Prinke
Digitised sources and research strategies
Dynamically growing availibility of digitised historical sources and
secondary literature presents important new challanges to the whole
community of historians, ranging from ethical and economic issues related
to the ease of access to electronic resources, to what can be called
"research strategies". The focus of the intellectual process involved in
planning and doing research work is slowly moving away from the
traditional reading to computer-assisted searching, comparing and
analysing digital information. As a results the researcher gains on speed
and scope, but also loses on contextual and accidental aspects of her
perception of the source. New emerging standards for electronic editions
of sources require new technical abilities from the researcher - but also
a different mental approach in which the visualisation of the research
process must take into account the very nature of the application of
strict algorithmic methods to the imprecise matter of history. Sharing the
method in order to allow peer control of the results (which is the essence
of scholarly approach) must include sharing the algorithmic tools and
digitised data which is not always possible for various reasons. While
"historical computing" is becoming widely recognized and applied by all
historians, the major task for the former pioneers in the field is to make
them realize not only the advantages of using new technologies (which is
hardly needed anymore) but the challanges they have to face.
Patricia Kelly Hall
World War II Black Veteran Migration and the Distributive Impact
of the G.I. Bill
This paper uses census microdata from the IPUMS database to examine the
extent to which the government's investment in World War II veterans under
the "G.I. Bill" was unevenly distributed among regions and types of places
as a result of differential patterns of internal migration. The IPUMS
database contains all the information collected by the census bureau for a
1 percent or larger sample of the entire U.S. population from 1850 through
2000. With the application of computerized statistical techniques, the
power of this database can be harnessed to answer research questions not
readily addressed with traditional historical evidence.
Internal migration in the United States, which had begun to increase
during the Great Depression, rose rapidly during and after World War II.
Earlier research has shown that veterans migrated at significantly higher
rates than non-veterans and constituted a substantially higher proportion
of every migrant group. White migrants showed higher absolute gains to
migration than blacks with additional gains to veteran migrants. But black
veterans showed a more dramatic rise in relative income over non-veterans
than did whites. What has not been examined is where these veteran
migrants came from and where they settled.
The differential distribution of veterans within migrant groups
suggests that the direct economic benefits veterans were given with the
G.I. Bill - as well as the indirect returns to investment in human capital
- resulted in an uneven spatial distribution of these social welfare
dollars. During a critical period of economic growth and diversification,
some areas and types of places may have received a disproportionate share
of the government's veteran benefit dollars
Anne Lif Lund Jacobsen
Using GIS and Multivariable statistic in regional economic
analyses
Original situated within the field of marine environmental and economic
history, this paper will investigate the potential of using multivariat
statistic combined with GIS to outline the structure of a regional
economy.
The object for research is the Limfjord fishery from 1890-1920. During
my research the question was raised how the developments of new fishing
techniques had an effect on the local investment pattern in fishing gear,
and how these investments were distributed spatially. In order to do so a
combined analysis of qualitative and geographical data found in the annual
fishery statistics is made. Whereas there was no or little spatial
correlations between the single sets of data, a multivariate analyses of
several data sets shows more promising results.
The ultimate aim is to develop an integrated method of analysing
historical statistic data with both multivariable statistic and GIS
Andrei Volodin
Statistics Multimedia Teaching
The paper presents first results of Multimedia Teaching Project,
initiated by Prof. Joerg Baten (Tuebingen University), that aimed to
prepare multimedia presentations teaching statistics, econometrics, and
economic theory. My presentation is based on the teaching materials
prepared in Laboratory for Historical Computing of Moscow Lomonosov State
University.
Integral multimedia presentations with support of Microsoft PowerPoint
and TechSmith Camtasia Studio technologies are elaborated/ Each
presentation consists of several main parts: text explanations, images and
graphs illustrations, audio-files which pointing out main assertations,
and video-files showing how to use properly statistic programs such as
StatSoft "Statistica" and SPSS. Such notions and methods of statistics as
correlation and regression are treated, mostly "Statistica" computing
experience including. All multimedia files produced in this project based
on original Camtasia technologies and integration is executed in
PowerPoint framework.
The prospective product of Statistics Multimedia Teaching Project is a
number of CDs, proving full necessary information for lectures improving
and distance training. This year our product will be used in the course of
teaching applied statistics at the History Faculty of Moscow Lomonosov
State University.
Alla Polevaya (co-authors L.Borodkin, I.Garskova,
T.Valetov, A.Volodin)
Internet Resource on the History of Labor Relations in Russian
Industry: 1880-1930 1)
Since the beginning of 2002 the Electronic Library of the Faculty of
History (Moscow Lomonosov State University) includes a new
subject-oriented section devoted to labor history, 1880-1930. This complex
resource is named "The Evolution of Labor Relations in Russian Industry:
From Pre-Revolutionary Industrialization to NEP" ( http://www.hist.msu.ru/Labs/HisLab/
), it is constantly renewed. Nowadays the resource (short name: WHR -
Working History Resource) has the following subsections:
- Basic legislative and law materials concerning labor relations;
- Books and articles on state regulation of labor relations;
- Texts of contemporaries dealing with the problems of labor
relations, labor legislation and its realization;
- Bibliography;
- Texts written by the Project participants on the subject-matter (and
by some historians-collaborators);
- Data bases;
- Digitized archive and other documents (images);
- Electronic texts;
- Visual materials (photographs, newspaper and journal illustrations,
etc.);
- Links to the materials, placed on other web-sites.
By the
beginning of 2003 the electronic resource WHR has become the most
representative and imposing one in Internet regarding the quantity and
quality of information about labor relations in Russian industry during
the period under consideration. So, Section 5 contains a big collection of
publications of the last decade on the problem of motivation of industrial
labor during pre-revolutionary industrialization and NEP (New Economic
Policy). In Section 4 is represented the unique bibliography on working
history, composed of the materials from digests of the 1970s - 1980s and
contemporary Russian and foreign historiography (about 2000 items).
Section 8 is filling up with electronic texts of archive documents (from
Moscow and regional archives) and newspaper materials of the 1920s. So,
sources difficult of access are put into use. Section 3 contains a
representative set of related electronic texts of articles, published in
journals Vestnik Truda and Voprosy Truda in the 1920s - early 1930s.,
reflecting the discussions of that period on the issues of regulating
labor relations in Soviet industry. In the same Section is represented the
data base concerning 2577 articles published in those journals during the
period under consideration (made by the journal's contents). In Section 1
and 2 we go on collecting materials on law regulation of labor relations
and the role of the State in that process (especially the recent
publications). Section 7 and Section 9 reflect another aspect of labor
relations (anthropological); they contain materials of individuals'
(photographs, diplomas, identity cards, etc.). Section 10 is very
important, it includes links to other Internet resources on the Project's
subject-matter. The emphasis is done on the document collection giving
"visual images" of the epoch in the context of the issues of working
history (photographs, posters, etc.); here are also collected almost all
articles and other publications on working history of the period found in
Internet.
As the sections of WHR are filled up with new electronic documents the
necessity of providing the Resource with different services increased.
Spade-work has been done to provide the resource with ftp-access to
document collections.
1) The work is being done with the financial
support of Russian Foundation for Humanities (N 02-01-12001v).
Wolfgang Meixner
Regional and social background of expatriated illegal members of
the NSDAP in Austria between 1933 and 1938.
A central role in the research to NS-history takes the question about
the social and regional composition of the NSDAP. This is particularly of
importance, there the NSDAP changed in Austria 1932 from a very
job-specific party, consisting of officials of traffic, commercial
employees, officials and freelance academic graduates, to a mass party.
With the prohibition of the party in Austria (1933), membership registers
are missing in order to be able to reconstruct for instance social
recruiting of the party comrades. Newer research showed the fact that the
investigations accomplished by Gerhard Botz for the social structure of
the NSDAP only for the time up to the party prohibition are plausible but
for the time of "illegality" cannot convince. Its attempt, on the basis
after the "Anschluss" honour for the sake of lent membership numbers the
social and regional composition of these "alten Kämpfer" ("old fighters")
determine to be able, today one criticizes and is not longer practicable.
Arises however the question, which documents us then for the order, around
the social structure of the "NS-Gesinnungsgemeinschaft" to examine for the
time between 1933 and 1938? Such a source could be the more than 10.000 of
persons comprehensive "Ausbuergerungsverzeichnis", in that all expatriated
persons with birth, date of birth and residence as well as occupation data
gone merrily since the party prohibition in the sense of the prohibitory
law of the nationality are registered. The straight occupation data found
there place the research however again before problems, in particular with
the setting-up of horizontal and vertical stratum models. In my
contribution I will bring up these problems for discussion and will try to
present proposals for solution.
Irina Garskova
MA Programs in Historical Computing (the Experience of Russian
and CIS Universities): Application of Advanced Research Projects
Historical computing at Russian and CIS universities (as well as at
Western universities) is dramatically changing under the impact of new
information and communication technologies. Professional historians more
and more often deal with large scale innovations at libraries, museums,
archives, publishing and multimedia houses and other repositories of the
national cultural resources. Advanced skills in computing are extremely
needed to enable students to get a good jobs in the information society.
New professional disciplines emerge as a response to the challenges of
information technology. Humanities departments tend to innovate the
learning content and the learning methods.
Some CIS universities have already recognized the need to establish and
develop historical computing as a teaching subject, to include MA in
historical computing in the general curriculum. Moreover, historical
computing MA programs have recently been opened at the Belorussian State
University, Altai State University, Stavropol State University, and some
other CIS universities.
The established MA in historical computing curriculum include both
traditional approaches on one hand, and the knowledge and skills of
advanced computing methods and technologies, on the other hand, as two
important components of professional education. The second component
embraces:
- general theoretical and methological issues on advanced computing in
the humanities, adopted mathematical cources (mathematical logic, theory
of probability and mathematical statistics, etc.);
- quantitative methods in the historical research, computer simulation
as a tool in historical research;
- basic and advanced courses in formal methods and advanced
computational processing, database management, visual processing and
visual analysis, multimedia applications, text-based and
multimedia-based computing.
Historians in the information
society become more experienced in using historical sources in electronic
form. For this reason the forming a teaching repository of electronic
resources is very important, especially taking into account that a lot of
electronic resources are available today on the web. Many of these
resources came from research results and can be secondary used for
educational purposes. Forming the electronic library of readers and links
to existing Internet teaching resiurces is also very important for
teaching historical informatics disciplines.
Galina Mozhaeva
Humanitarian informatics: historical aspect and problems
Peculiarity of contemporary society's development is conversion to its
new formation - to informational society. The category of informational
society is topical question now at the context of the humanities, because
it integrates all sphere of humanitarian knowledge in itself. That is why
informational society is object of philosophic, philological, sociological
and historical investigations. The process of informatization of society,
connecting with penetration informational technologies to the humanitarian
investigation's tools, have lead to of principle modification of
humanitarian knowledge's structure, when information has become primary
category. This process of transformation of informatization as reflection
on technological progress in the humanities to the primary mechanism of
structure humanitarian knowledge's reorganization and change of
interpersonal relations is happening during several decades.
Historical researches, which are forming the base for philosophic
comprehension of contemporary society, are taking on special actuality in
understanding of informatization`s process.
In the informatization`s process of social humanitarian sciences are
marked out three stages of informatization.
First stage - technical (1960 - 70-th years). It is
characterized by using IBM in applied investigations, making instrumental
facilities.
Second stage - technological (1980 - 90-th years). Using of
informational technologies in social humanitarian investigations,
development of new research methods, which making on the base of
informational technologies, have begin at this stage. Technological stage
gives the start of development of «branch informatics» - historical
informatics, economic informatics and others. Now we understand that the
development of any «branch informatics» have faults to the side of applied
researches, basing into practical using of informational technologies, to
the side of preference logic of facilities rather than logic of matter of
humanitarian investigations. At the beginning of informatization in
humanitarian as the development and the introduction instrumental
facilities it was necessarily and helped to develop humanitarian sciences.
Contemporary informational surroundings is making new aspirations to
the system of humanitarian knowledge. The process of informatization is
deeply wounding the sphere of humanitarian education: its methods,
technologies and governing apparatus and sphere of basic researches, where
informational technologies are using not only in the stage of elaboration,
keeping and analysis of codification of traditional material and become
the object of humanitarian research.
Informatization of the humanities have discover deep social changes
which produce by process of informatization. The start of third stage -
methodological - connect with it at the 2000-th. At this stage
recomprehension of the understanding of informatics, change of scientific
views on informatics as science have place. Clarification of mechanisms of
informatization`s influence on the `s development, on forming of new stage
of culture, construction of theory of informational society is starting.
It is clear that at the field of contemporary information science vision
have to be appropriateness of beginnings and functioning all sorts of
information, appropriateness and consequences of informational processes
in the society.
Study of informational processes, sources and channels of information
will allow to understand reasons and disposition of social behavior,
social interactions. Investigation of humanitarian problems of informatics
is playing to leading outlook part, which manifests itself in formation of
whole system informational world's picture, in understanding of community
of the informational processes of operation in animate nature, society and
technology.
The necessity and importance of such researches is clearly defined by
number of factors.
In the first place, information has become determining category in the
economic and social development at the modern society. That is why
informatization (as the main mechanism of the conversion to informational
society) is becoming a subject of the researches of very wide sections of
scientists now.
Secondly, informatization of society is driving to modification of
social connections and relations between people. Creation and development
of the computer communication calls modification of communication
processes in the society and character of communication interactions. Mass
communications exchange system of social culture perception of the
information, allow to manipulate by people consciousness.
Thirdly, introduction of person computers and development of the
informational and communication technologies provide the impact on the
person development, on his world outlook change, on the system of personal
valuables. The man is plunging into virtual and real computer worlds more
and more, and he is confronting with necessity of change of life style, of
way of living and thinking, of the character of rapport with surrounding
world.
The further development of informatization demands not only computer
literacy, but certain lever of the informational culture, which is founded
on the understanding of the law-governed natures of informational
society's development. The understanding of the gist of the
informatization and informatics must change in the first place.
Development of the information, expansion of informational canals,
deepening of connects between these canals, intensification of there
influence to the man and society, but not only inculcation of
informational technologies in different spheres of the society's life are
understood as informatization now. That is why the social aspect of
informatics, determining its as the humanitarian science ore humanitarian
informatics, arises.
Humanitarian informatics is the science, which investigates the
law-governed natures of the information's beginning and development in
society, the law-governed natures and results of the informational
processes in society, the philosophy and methodology of the informational
society, the informatization as the social phenomenon.
The sufficient scientific material already has been made, and the
informational society's theory has been developed. This theory gives the
possibility to speak about humanitarian informatics not only as scientific
direction, but as knowledge's system about informational society, which
may be the base for the training of specialists in this field.
At the same time, the situation has formed in the education system,
when there aren't specialists in the field of humanitarian informatics.
The thorough research into social sphere's informatization only begins in
Russia. The concept of informational culture isn't studied in historical
development and philosophical comprehension. First of all, this fact has
an influence on education. The technical perfection in the field of
informational technologies is losing its meaning if the appropriate
pedagogical means are absent. These means may be made on the base of new
methodological principles only from the field of the humanities.
The specialists in the humanities are necessary for decision of these
problems. They will help to develop the informatics as the science, which
determines the modern society's development at all relations. It's
necessary to create the system of the train personnel - the specialists in
the humanitarian informatics's field, and to organize the scientific
research in the informatization's field by way of consolidation of the
specialists in different humanitarian fields and in the inmormatics's
field.
Marc St-Hilaire, CIEQ/Geography, Université Laval
Byron Moldofsky, Geography, University of Toronto
Mapping 20th Century census micro-data: The Canadian Century Research Infrastructure cartographic system
An interuniversity and multi-disciplinary team has recently undertaken the building of a set of census micro-databases
in order to facilitate historical research on the Canadian society in the first half of the 20th century. One innovative
aspect of the research infrastructure to be developed is the cartographic component associated with the micro-datasets,
a mapping system intending to provide the researchers with tools which will allow them to partly take into account the
spatial components of the phenomena they study. This paper aims to present the mapping system, from the principles that
guide its construction to the product to be delivered and the technical frame of its utilisation. A special attention is
paid to the concepts related to the geographical construction of the census, as to the methodology used to create the basic
cartographic files underlying the whole system.
|