Home


AHC 2003:
Program
Photos

Search in:
Censuses
Cen. Advanced
Parish registers
Matrikkel1886

Links



tr
tr tr
tr
tr  Norsk

XVth CONFERENCE OF THE INTERNATIONAL ASSOCIATION FOR HISTORY AND COMPUTING

Abstracts - July 31 2003

Peter Teibenbacher

Demographic transition - a relational data-base and statistical oriented solution

This paper deals with the topic of demographic transition in a small parish in Upper Styria/Austria between 1880 and 1940. The parish`s population mainly was an agrarian one, partially engaged in handicraft and traffic. Besides that the parish consisted of 67% catholics and 33% protestants. Thus also the question concerning the role of so-called cultural and so-called social-economic factors determining demographic behaviour can be tested. The analysis is done with the help of a relational data-base in SPSS-format. This solution does not want to present itself as the best one but as a functional one among others. The advantage is, that SPSS is a wide-spread program, well-known and usable in many other concerns. The data-matrix primarily wants to represent female demographic biographies - but also contains fathers and bridegrooms -, it depicts different events like cases of births, deaths and marriages and different persons with different roles involved. The process of identifying persons is done in a semi-automatic way with the held of stepwise oriented sorting-routines, selecting routines comprising different variables like maiden-names, first and last names, dates of birth, marriage or death of a person etc. The decisions at least have been made case-by-case, cause there is no real satisfying automatic record-linkage-procedure at our disposal. Using this mix of methods the central questions concerning demographic behaviour like starting, stopping, spacing, the amounts of births for each woman, series of sequential events in a woman`s demographic biography etc. can be tested in a satisfactory way. But with the help of different routines, especially the lag and lead-functions in SPSS it is also possible to insert automatically "demographic family"-codes, comprising all persons which are involved in common demographic events like births (mother, father, child) and marriages (brides and bridegrooms).


Onno Boonstra, Leen Breure & Peter Doorn

Past, Present and Future of Historical Information Science

The roots of historical information science are grounded in quantitative socio-economic history on the one hand and in computerized analysis of historical texts on the other. In the second half of the 1980s 'history and computing' got a strong impulse by the advent of the PC. In the 1990s a debate on source-oriented versus problem-oriented computing was important. The Internet stimulated many heritage institutes (archives, libraries and museums) to digitize collections. Since the late 1990s and in the first years of the21st century, the 'H&C movement' seems to have lost it's momentum: humanities computing courses and departments in universities are under pressure or cut back; journals or series are being discontinued; the number of new publications is diminishing; H&C conferences are less frequent and attract less participants.One might even speak of a 'crisis' in historical computing.

It is useful to reflect on what has been accomplished in the past 15 years, what the present situation is, and to set out lines for the required future development of the field. To this aim, we have formulated a research project, which should result in a conference, to be held in Amsterdam on 12-13 December 2003, and later in a book. The research questions have been formulated as follows:

What has been the effect of computing on history as a dicipline? Which new challenges lay ahead for history and computing?

Which innovations did computing bring the historical discipline? What innovations for the future are possible and desirable? What are the requirements of the historical field on computing?

What were the main topics for discussion? What new topics require discussion? Future agenda?

We intend to focus the research questions focus on the one hand on the quantitative approach to history (socio-economic history), on the other to the Qualitative approach to history (cultural history), text & images. Moreover, attention will be paid to the role of digitization and digital archiving in heritage institutions for research, on teaching in universities, and on the communication of the History and Computing specialists with the sectors of historical research at large, heritage institutions, and 'general' informatics.


Serguei Kachtchenko and Ekaterina Kostrigina

Population of Karelia (1800-1917). Computer Analysis of individual records.

In 2000-2003 the work on finding and studying parish registers of Olonetz and Pudozsk parishes of the Province of Olonetz with karelian and russian population has been continued. The research has been carried out by the chair of Source Studies Of Russian History of St.Petersburg State University, with the support of Russian Humanitarian research Fund of Russian Academy of Science (Grant ?01-01-78002 ?/?).

As a result of search works at The Central State Archives of The Republic Kareliya (Petrozavodsk) and The Central State Historical Archives of St. Petersburg numerable volumes of parish registers (since 90-th of the 18th century to 1917), confession records, revision materials and censuses of the Province of Olonetz were found.

The found items of information were formalized and entered into computer databases, those files, which contain the information on births (baptisms), marriages (weddings), deaths (burial services). Among the issues discussed the most attention was drawn to the principles of creating the computer databases on historical demography. The developments were towards the adaptation of the standard statistical programs and creation of the original. The analysis of parish registers, first of all issues of holes in infants' births and deaths registration is of a great importance. We managed to define the exact time when the registration of perinatal, neonatal and postneonatal mortality had begun. A number of questions, connected with the analysis of so-called "small selected sets" were investigated. The items, connected with mortality were studied (the peaks of deaths (connected with epidemics), the reasons of deaths, life expectancy (the dependence on sex and social status); rates of births and marriages (particular attention was paid to the changes in the marriage age during 150 years). Some pre-liminary results were compared with the data from the different regions of Russia and Netherlands at the international seminars, which were held in 1999 - 2001 in St. Petersburg and Groningen. Several people were busy in the research, apart from the authors of the letter, K. Valegina, M. Markova, S.Smirnova and a number of students of the History Faculty of St. Peterburg State University have taken part in the investigations


Siegfried Gruber

Migration in Albania in the early 20th century

There is hardly anything known about migration in Albania in the early 20th century on the basis of quantitative research. This is the consequence of a lack of appropriate data. A group of researchers has made a sample of the Albanian census of 1918 available for research and this sample can be used for research about migration since the census includes data about place of birth and place of present residence. This is of course only data about one point in time and does not allow for analysing migration histories of individuals, but it provides us with information about the percentage of people born in the place where they were staying at the time of the census. It allows also to analyse residence patterns following marriage and traces of migration histories can be found when children or siblings were born in different places. All places will be linked to a map for creating a historical GIS of Albania in 1918. This historical GIS enables research about the places of origin of the urban population and therefore about the different hinterland of the urban centres. The exact information about the place of residence in the cities can be used for finding the places within the cities where the rural immigrants lived and whether they tended to live near other immigrants from the same region or not.


Robert M. Schwartz

Making the Grade: An Application of Digital Elevation Modeling to the Study of Railways and Environment in Victorian England

This paper uses a digital elevation model to study the changing impact of railway construction on the landscape of England and Wales from the 1840s to the eve of World War I. It examines the degree to which the increasing power of locomotives and other technical advances after 1860 enabled the later rail system to negotiate the landscape less by cutting through hills and mountains via huge excavations than by steaming over it. In this way, branch lines to remote and topographically difficult areas became less costly and more feasible, and the geographical linkage of England and Wales was extended and enhanced. To examine this, I use a digital elevation model to calculate and compare the slopes of rail-line segments constructed up to 1855 and those built between 1856 and 1876. The initial results bear out my working hypothesis and illustrate the usefulness of this approach. I also plan to present a real-time fly-through of the virtual landscape of Wales to demonstrate another feature of GIS visualization.


Onno Boonstra

Information technology in history teaching. Learning history by writing history: the Low Countries History Museum

Undergraduate students at the University of Nijmegen need to present dozens of papers in order to pass on to the graduate courses. In 99 percent of the time, the papers they have written do not get full attention: neither from their fellow students, nor from their teachers. Although students realize that it is worthwile to put a lot of effort in writing papers, it is frustrating to experience that the merit a paper gets is in no proportion to the hours of hard work that have gone into it. In order to solve the discrepancy, the Department of History has set up a web site, called the Low Countries History Museum, which is hosting a collection of hundreds of items, each item relating to a major event in the history of the Low Countries. Instead of writing a paper, students are invited to write a lemma for the museum, thus learning history by adding to its historiography.


Mats Hayen

The coding of 3.000.000 titles into HISCO at the Stockholm Historical Database

Abstract: This paper presents the work with title standardisation and HISCO-coding at the Stockholm Historical Database. HISCO (Historical International Standard Classification of Occupation) is a collaborative project involving different institutions all over the world. In Stockholm work with HISCO began in 1999 and at the present time we have managed to code approximately 90-95% of all titles that occur in our database (currently containing more than 3 million person entries). The paper discusses the source material, coding methods, coding problems and how the material may be used by historians and other academics.


Marco van Leeuwen, Amsterdam

HISCO and the History of Work Website

Marco van Leeuwen, Amsterdam: HISCO and the History of Work Website

The International Institute for Social History will soon make public the History of Work Website. This website contains information on occupational titles, descriptions of the work involved, images on the world of work, essays about these images, measures of class and prestige as well as a bibliography. Ten thousands occupational titles from a dozen countries and languages all over the (as yet Western) world are made available in a comparable and systematic way. This has been made possible by the work of scholars from about 20 research institutes. The coding system used is HISCO, a structured coding schedule able to capture relevant details of the world of work in the past, and with short descriptions of the tasks and duties involved. Next to this the History of Work Website has at present about a thousand images on the world of work, as well as a number of essays by art historians on these images. The website offers HISCO-based measures of class and prestige in the past, recode jobs to current class and occupational schemes in use by statistical agencies, as well as a computer assisted module coding occupational titles into HISCO.


Marco Sunder

On the Biological Standard of Living of the Middle Class in 19th Century U.S.

Average physical stature of a population is often used as an indicator of (biological) living standards, insofar as human height reflects net nutritional status during the first two decades of life. The "Antebellum Puzzle" refers to the unusual situation in 19th century United States, when the average height of (male, non-slave) cohorts born in the decades preceding Civil War declined, although the economy was growing in terms of real per capita output. This has been called the "Antebellum Puzzle". The literature argues that economic forces during the American industrialization can explain a substantial proportion of this puzzle. The main argument is that increasing relative prices of foodstuffs induced an adverse dietary change among the average population. We address this issue on the basis of a newly collected sample on human heights from passport applications. It allows us to assess trends in net nutritional status of the richer parts of the society for the birth cohorts c. 1820-90. It primarily represents the urban North Eastern United States and covers not only males but also females. About 17,000 individuals have been linked to counties of birth so that differences in local conditions, especially in agricultural production, can be taken into account. Our analysis indicates that the local configuration did influence the biological living standard of the people in our sample, however the time trend suggests that they were more immune against the adversities of those years.


Markus Heller

Web Access to Classic Text Retrieval

The data format "Beta Code" in which the PHI and TLG corpora are encoded has ceased to be developed more than a decade ago. Being optimized for saving space, compactness is no longer a primary requirement for modern technology. Instead many new technologies in the field of "Information Retrieval" have developed and data exchange and ease of access are key factors of any corpus project.

Being topic of a Masters' Thesis, the current work was initially meant to be a proval of concept to show that it was possible to analyze, extract and process the corpus data into an index using inverted indexing technology as known from many search engines.

As a secondary step an access tool was built as to enable Web users to issue search commands using the highly versatile Perl Regular Expressions. Direct text access including translation into the appropriate fonts provided comfortable access for researchers interested in searching as well browsing the classical texts available.

Ongoing research which is to be presented at the AHC conference in Tromsoe concentrates on optimizing the corpus index and on transferring the index data into a relational model on the basis of a relational database.

The result will be searchable and browseable Web access to the corpus through efficient search technology on basis of a powerful relational database.


Luuk Schreven

Providing access to the Dutch population census of 1971

Although the first Dutch population census was held in 1795 by the occupying French, it wasn't until 1829 that the Dutch picked up on the idea and institutionalised the concept. From then on, there was a decennial census until 1930. The 1940 census was cancelled due to World War II, but soon thereafter the thread was picked up again, resulting in general population censuses in 1947 and 1960 and a housing census in 1956. The late sixties and seventies showed an increasing public concern with the protection of privacy. This led to a limited public ban on the 1971 census, only some .18 percent actually refused to cooperate. The 1981 census on the other hand was first postponed and later cancelled because of an average non-response of 26 percent during census trials.

Since 1997 the Netherlands Institute for Scientific Information Services (NIWI) and Statistics Netherlands (CBS) have been working on several projects to digitise the Dutch population censuses. The first results, consisting of two sets of CD-ROMs, a Website (http://www.volkstellingen.nl/), were presented in 1999. Through these CD-ROMs and the website images of the census publications from 1795 to 1971 were presented, also some 10,000 pages of published data were manually converted for the 1899 census, these are available through the CBS Statline system (http://www.cbs.nl/en/figures/statline/index.htm). More recently NIWI and CBS are cooperating in a project aimed to do the same for all the other censuses. What's more, the individual data of the last two censuses (1960 and 1971) will become available for research as well.

My paper will deal with the some of the problems encountered while examining the 1971 individual data, as well as the actions taken to ensure that individual citizens cannot be identified within the data. Furthermore I will present the end results of the project, as it is to be finished by the end


Donald Spaeth

Structured or Semi-Structured? Applying XML to Historical Data

Most historical sources take the form of semistructured data. Most computer-based historical research, however, represents sources as structured data, whether statistical or relational database management software is used. This is not a new problem. Proponents of 'source-oriented data processing' have argued since the late 1970s that, rather than fitting the data to software, the software should be fitted to the data. This paper will consider the suitability of Extensible Markup Language (XML) and related tools (such as XSLT) for representing and analysing historical data, using a database of probate inventories from seventeenth-century Thame (England) as a testbed.

Computer scientists have embraced XML as a means of representing semi-structured data. Developed initially to publish information on the World Wide Web, it is becoming clear that XML has an important role in representing and interrogating data. Documents on the Web present computer scientists with the problem of making sense of a heterogenous body of information, unlike the more regular business data used by commercial applications. Like historical documents, semistructured data are characterised by

  • irregularities in structure
  • missing data
  • duplicated fields
  • minor changes in representation
  • ambiguous data
  • information that does not conform to strict data types
  • data from heterogenous sources
XML provides a flexible user-definable approach to represent the, often implicit, structures of historical documents. Rather than copying data into the fields and records of a relational table, tags are inserted into a transcript of the text. The user may define any tags that he or she wishes, so long as a limited number of conventions are followed. The tagging system may be documented in a Document Type Definition (DTD), if desired. For those familiar with developing data models, the development of DTD, which can be used to validate and document the data, seems an obvious step. The DTD also facilitates analysis by providing a consistent and predictable structure. Yet proponents of semi-structured data have argued that the DTD itself imposes a structure, and thus a set of constraints, on the document. One must still balance ease of analysis against being true to the source.

This paper will compare the relative advantages of treating data as structured or semi-structured data. The Thame inventory database was devised using a DTD, so that the document confirms to a structure, although some slackness in definition loosens the constraints (e.g. by permitting some tags to appear at more than one level of the hierarchy). The data can be analysed in native XML, using XSLT, or can be exported to a relational database. Indeed, the ease of exporting the data confirms the structured form of its representation in XML; it is possible, for example, to relate information from one portion of the hierarchy (or indeed from a separate hierarchy) with another. The relational analogy can be used to cope with other complications created by source-orientation, e.g. the large numbers of variant spellings found in early modern documents.

The alternative is to code the data as they appear, without using a DTD. Since tags may then potentially appear at any level of the hierarchy, the data may be more difficult to analyse in one or more 'flat' matrices. XSLT and XPath provide means for retrieving content from fields wherever they may appear, but if one wishes to retrieve and relate several fields from different portions of the hierarchy navigation may be more difficult. It is necessary to develop new research strategies, both to explore the location of tags that have been used after markup is complete and to perform multi-dimensional analysis. The forms such analysis will take are only beginning to take shape, and this paper will provide a preliminary statement.


Marjan Balkestein, Annelies van Nispen and Peter Doorn

Round table on data archiving

Topics
1. The (changing?) role of historical data-archives in national data-infrastructures
Are there developments that change(d) the role and position of data-archives in your country? Does public awareness of the need for digital preservation influence centralized data-archives? What are the pros and cons of centralized and decentralized data-storage? How do you see the future of data-archives?

2. Trends in data acquisition How are your data-collections developing? Is there (still) a strong emphasis on quantitative historical datasets? What kind of trends do you see when you look at the datasets you acquire? What are the trends in computer-based historical research at the moment?

3. New developments
What new developments are of interest to our work as data-collectors and providers? XML, OAI, DDI?

4. Projects and opportunities for cooperation What (inter)national projects do you participate in? Are there opportunities for data-archives to participate in projects together?


Heinz Berger

Teaching History Online

I would like to present a paper about a project called Geschichte Online - History Online. In this project we are dealing with the possibilities of online teaching in various fields of history studies at the university. We ask the question what basic knowledge and skills could be taught by e-learning and online teaching. What is the didactic surplus of online teaching compared to face to face teaching or the reading of books? How can we find the best way of alternating between phases of presence and online phases? Can we expect progress in improvement and standardisation of qual-ity through public and transparent teaching? Is it possible to create metadata for historical con-tents?

Questions like these are focused in the web based project History Online started in October 2002 at the University of Vienna and supported by the Austrian Ministry for Education, Science and Culture. Our project group consists of historians of two departments of the Viennese University and free lance collaborators. Important parts of our work are frequent discussions about our conception with colleagues all over Austria. They also undertake the permanent evaluation of the resulting "virtual learning objects" in their history lessons at various Universities.

Elementary skills and knowledge should be presented and taught in the five following modules: Introduction into academic historical work, investigation of historical information and literature, edu-cation and training in the field of didactics for history teachers, education in didactics of media based on examples like TV-news and online teaching combined with seminars and lectures (based on a specific examples in the field of historical cultural studies).

I would also like to present the structure of the project (sitemap and requirements) and the system of metadata we use (based on Dublin core). The courses we offer start with a site of guidance fol-lowed by the main site which is divided into basic text, examples, exercises, literature and a glos-sary for historical, didactical and technical terms. In a short demonstration I will show one of the prototypes of History Online including the whole functionality.

Various didactic ways will be used in History Online like guided web investigation in traditional face-to-face lessons, interactive online learning where the users are introduced to the topic, online phases of practice with response about the success and tests, team oriented courses, presentation of historical subjects in lectures with a limited number of participants and in seminars and courses for training of multipliers.


Sergey Kornienko

Teaching the history of Stalinism in Internet The problem of our research is the opportunity to use Internet resources in history teaching. One of the historical themes in which the using of Interrnet technology and resources is "Stalin and Stalinism". The successful usage of Internet resources in the teaching of this problem demands to reveal, analyze and consider them. Method and procedures. The result of searching Internet recourses on the problem with "Yandex-system" (Russian segment of the net) and "Alta-Vista" permitted to obtain the whole amount of resources, their distribution according to types and problems. The quantitative results of research are organized in the tables which data are presented with different diagrams.

Results and Discussion. The comparative analysis of number data and graphics shows that the general amount of resources in Russian and world-wide Internet are approximately equal and that is why the educational potentials the both parts of Internet are equal. The analysis of types of documents on the problem "Stalin and Stalinism" shows that they can be classified as texts, articles, portraits and drawings of different kinds (Yandex: "Stalin" 1663). There are also audi and video documents, which are of special value in the educational process. The analysis of topics of the theme "Stalin and Stalinism" shows that they include the most valuable problems and aspects, such as Stalin's biography, the Stalin's regime, repressions, terror, the USSR in Stalin's time, foreign and inner policy, Stalin and bolshevism, Stalin and nuclear bomb, Stalin and cold war, Stalin's death and others. The detailed thematic distribution of resources is accompanied by tables and graphics.

The Internet materials allow to make a conclusion that the Internet resources are not only rather large, but unique as well. It is especially important for teaching the history of Stalinism. The more full and adequate discussion of the educational Internet resources of the history of Stalinism must be added by the inner quality analysis. We continue our research just in this direction. Nevertheless the quantitative analysis of Internet resources on the problem "Stalin and Stalinism" illustrates that it must be an important component in teaching the History of the 20-th century.


Janet Delve and Mark Allen

Data warehousing - a new approach for historians?

Relational databases are a standard tool for historical research [Harvey and Press, 1996; Hudson, 2000], with object-oriented databases receiving the occasional mention. [Bradley, 1994] highlighted the difficulties associated with relational modelling and [Breure, 1995] pointed out the advantages of inputting data into a model which matches it closely. [Burt (Delve) and James, 1996] considered the relative freedom when using source-oriented data modelling as compared to relational modelling with its restrictions due to normalisation, and drew attention to the forthcoming possibilities of data warehouses (DWs).

This paper considers in depth the nature of DWs, exploring how they differ from databases. DWs allow primitive data to be stored along with summarised and related integrated data. Normalisation is not enforced as it is with relational databases, but a degree of normalisation is possible where desirable. DWs comprise fact and dimension tables, which differ from relational tables. Fact tables are broadly geared towards numerical data whilst the dimension tables contain relatively unchanging descriptive data. DWs are loaded periodically with snapshots of data from a database, and inherently contain a time element. The issues surrounding the use of this primarily business-oriented technology for historical purposes is addressed, especially in view of the availability of appropriate DW software in universities with, for example, Oracle site licenses.

References
Bradley, J. 1994 'Relational Database Design', History and Computing, 6.2, 71-84 Breure, L. 1995 'Interactive data Entry', History and Computing, 7.1, 30-49 Burt (Delve), J. and James, T.B. 1996 'Source-Oriented Data Processing. The triumph of the micro over the macro?' History and Computing 8.3 160-9 Harvey, C. and Press, J. 1996 Databases in Historical Research, Basingstoke Hudson, P. 2000 History by numbers. An introduction to quantitative approaches, London

Biographical details
Janet Delve is a senior lecturer in Information Systems at Portsmouth University, England. She received her doctorate, 'The development of the mathematical department of the Educational Times 1847-62' in the History of Mathematics and Historical Computing, at Middlesex University in 1999 under Ivor Grattan-Guinness. Her research interests include nineteenth-century mathematics, early British computing and data warehousing. She contributed articles on William John Clarke Miller and Thomas Turner Wilkinson to the forthcoming New Dictionary of National Biography [2004]. Janet is a Co-director of the Winchester Project, King Alfred's College, and a committee member of the UK Branch of the AHC.

Mark Allen is a lecturer in history at King Alfred's College in Winchester, England. He works on the development of Winchester in the nineteenth century and the uses of the census as an historical source. Mark is also a Co-director of the Winchester Project, King Alfred's College.


Alexander Kobrinsky

Speeches of the Leaders of Factions in the State Duma of the Russian Federation (computerised analysis: applying TACT)

Making a close study of the activity of the State Dumas of the Russian Federation and analysing stenographs we can get answers to many questions which we are interested in concerning the a ctivity of the Dumas as a single whole legislative body, the activity of factions and deputies groups on the positions and views which they upheld as well as the work of separate deputies in the supreme legislative body and so on. Studying speeches of the leaders of deputy unions gives us the possibility to get more complete and distinct idea of the position of a faction or a group, of its participation in discussion of this or that question. The analysis allows to reveal factions priorities in the legislative activity.

Very often an appraisal of the activity of a deputy union in the State Duma is made on the basis of journalistic articles based, in their turn, on separate utterances of its leader given by mass media which are, as a rule, fragmentary, systemless and, besides, not always correct enough chosen and without proper scientific analysis. Argumentative questions presented by the press and television are sometimes not quite exact and make their own contribution to non-objective perception of this or that leader for the part of users of information and this is directly connected with the appraisal of work of his union in the State Duma.

The similar approach favours the appearance of definite stereotypes of perception of different politicians as well as of different parties, the Duma factions and groups in a society. The existing stereotypes prevent an objective appraisal of the activity of the elected representatives of the people.

The subject of the research constitute stenographic reports of meetings of the SD FM RF (the State Duma of the Federal Meeting of the Russian Federation) of 1993 - 1995 in their electronic version. The original source basis was composed, in fact, by stenographs of 1993 - 1995 plenary meetings of the SD FM RF (which are at our disposal in their electronic version). The present investigation proceeds the work on studying stenographic reports of the plenary meetings of the State Duma of the Federal Meeting of the Russian Federation. The results of this work were published in different scientific articles and monographs. 1

The aim of our research is to give a scientific analysis of speeches of the leaders of the main factions for revealing their positions as well as the dynamics and evolution of their views on actual problems of home and foreign policy of Russia. To reach the goal of the research work it is necessary: (1) to define the main problems touched upon by the leaders of factions in the course of their speeches, (2) to reveal their position on a number of questions discussed in the SD RF, (3) to find out the degree of their activity and priorities in the legislative activity.

Of special interest are speeches of the leaders of the CPRF, LDPR and YBL (now known to us as Yabloko). On December 19, 1999 the elections to the State Duma of the third convocation took place and today we can state that these three factions managed, for the third time (with different success), to overcome the 5 per cent barrier. Taking into consideration that the leaders of these factions remained to be the same the present research work creates the basis for the following comparative analysis: to what degree priorities of factions stated by their leaders appeared to be unchangeable and, as well, what changes took place in positions of the leader themselves and whether there were such.

Of the whole mass of the stenographs reports (in the electronic version it is about 25 mg bites that approximately makes 40 full value 600 pages books in the printed form containing materials of 140 meetings of the State Duma of the first convocation) we selected the parts which directly concern speeches of the characters we are interested in. We carried out the work on defining the main blocks of interest of each of the factions leaders in the course of his activity in the State Duma of the first convocation. We formed the list of categories reflecting the main directions in the speeches of leaders of the Duma factions. And, having selected a set of indicators (semantic units of a text ) making concrete a semantic meaning of each category, we made the analysis of speeches applying the computerised TACT programme. It was very important for us to use one and the same methodological set of instruments based on computer technologies applied to historical researches that gave us the possibility to get objective data for our analysis and made it real to make conclusions "obvious" to the maximum possible degree.

In the course of work we determined the frequency and intensity of speeches of the leaders of factions and the main accents put by them in the process of discussion of various problems. There was shown the dynamics of semantic categories appearance in their speeches. We studied the context of separate utterances and gave the estimation of it. We also showed the frequency of appearance of different terms having an important meaning for the legislative activity in speeches of the leaders of the deputies unions of the first convocation and we made their comparative analysis.
____________

  1. Kobrinsky, A. L. Problems of the State Building in the Russian Federation (on the materials of stenographs of the State Duma plenary meetings in 1993 - 1995), M.: MGU, 2001.
    Kobrinsky, A. L. Speeches of Zshirinovsky, Zjuganov, Yavlinsky in the State Duma (1993 - 1995), M., 2003.


Batyrbaeva Sh. D. Bishkek

Application of Information Technology in Database Creation at the Research of Demographic Processes in Kyrgyzstan of XX Century.

The demographic history of Kyrgyzstan in Soviet and post-Soviet period is not formed yet as the independent scientific trend. So, our investigation is devoted to the problems of database formation on the history of Kyrgyz population in retrospective view, on the basis of the censuses taken in the XX century with attraction of data related to genealogical tree on the level of separately taken settlements from 1999 to 1926.

Such methodology is conditioned by the fact, that the programs of conducted censuses differed from each other and groups of data had not been coordinated with previous data at their preparing for publication. Besides, the territory of Kyrgyz Republic, regardless of its small size, was divided into some typological regions differing greatly in their natural-climatic conditions, formed ancient traditions and customs, conditioning the differences in all areas of Kyrgyz population functioning, which also made difficulties in comparing of the available data. Hence, the selection of settlements for database formation was carried out depending on the undamaged condition of initial regional summary data.

At the formation of the database on census materials we faced with the problem of insufficient registration and inaccuracy of collected information. In order to settle this problem, we conducted the comparative analysis of initial censuses records and data from Official Registries of Village Soviets, but not always we got the desired results.

For this purpose, we conduct the additional research on the selected settlement with application of Computer technology, based on the informational logic modeling of Kyrgyz families genealogies, because according to traditions, the male representatives were to know their genealogical tree seven times removed.

The results of modeling of 3-4 generations show rather adequately the dynamics of number variations and migration mobility of men, starting from 13 years old in the separately taken settlement, because according to tradition, the male population was considered to be adult since that age and in case of their death, they were taken into account as the separate generations of definite ancestors.

For this purpose, while analyzing the clan-tribe settling of Kyrgyz people and using the list of informative objects, we shall make-up the diagram of data flows in the form of a scheme (see below).

At the making of diagram, here, the attributes of each object are being substantiated and the relations between them are being established. On the base of the collected materials for informational reflection of the clan-tribe settlement of Kyrgyz people, we created the database MS ACCESS in SQL.

This database allows to explain the number deviations in male population of some clans at the selected settlements, to make clear on their base the natural and mechanical motion for each inter-census period and to define hypothetically the dynamics of women's number variations in the definite age groups and therefore, to fill in the data, not-registered at database formation.

In a whole, the work in this direction is being continued. The determination of dynamics of the male portion of Kyrgyz population in retrospective towards the XV century will be the result of this research, because at present time we have already some published works of some connoisseurs of genealogical trees - "sanzharachy", revealing the genealogies of the Kyrgyz people.

Scheme of the genealogical tree of the Kyrgyz


Kees Mandemakers and Lisa Dillon

Best practices with large databases on historical populations

Program round table AHC 2003 Friday 8th of August 11.15 hours

  • 11.15 Introduction by Kees Mandemakers, International Institute of Social History Amsterdam and Lisa Dillon, Université de Montréal
  • 11.45 Comments by Gunnar Thorvaldsen, University Tromsø; Ólöf Garðarsdóttir, Statistics Iceland; and Marco van Leeuwen, International Institute of Social History, Amsterdam
  • 12.15 Short reaction by authors
  • 12.30 Discussion with the public
Abstract Since the late 1960s researchers who transform routinely-generated primary sources into machine-readable data have produced numerous methodological articles and book chapters detailing the process of data creation and describing how the peculiarities of primary sources can affect interpretation of the data. However, in such articles, the best practices for creating large databases have usually been implicit rather than explicit. Here a comprehensive list of best practices for the creation of large databases on historical populations is introduced, drawing upon the experiences of the Historical Sample of the Netherlands, the Canadian Families Project, the IPUMS and other projects.

The following guidelines represent a revised version of the protocol formulated on the occasion of the workshop of the Historical Sample of the Netherlands (HSN) which took place under the title 'HSN-workshop on large databases: Results and best practices' at Amsterdam on 17-18 May 2001. All the participants are to be thanked for their comments on this earlier version which have been incorporated as much as possible into these guidelines.

This protocol is not exhaustive, but it does address those principles which should be considered by all projects. We use the phrase 'should be considered' because we are well aware that other factors such as cost-benefit considerations may make it useful or necessary to make alternate choices than those advocated by this protocol. In general, diverging from one or more of the recommendations described below is not problematic as long as the argument for these choices is made clear to the users and future owners of the database. Thus, adherence to this protocol does not mean that every principle must be applied, yet the reasons for not following a specific rule or recommendation should be explained.

We consider here only those large databases which have the goal to be open for secondary analysis and which are essentially open systems, to which data may be periodically added.

The purpose of maintaining a set of best practices for creating large databases on historical populations is to:

  1. articulate the standards necessary to create and maintain high-quality databases and database documentation. Appropriate funding must be reserved to achieve these standards.
  2. ensure that the historical community can trust the results of research based on these data.
  3. benefit from previous experience in creating large databases. Thus, these rules can also be applied to data created by non-academic organizations such as genealogical societies or the Church of Jesus Christ of Latter-day Saints.
  4. ensure databases are of sufficient quality for use by secondary researchers (researchers outside the main database creation team).
For the sake of clarity and argument and to stimulate further discussion, this list of best practices is written in the form of a specific set of rules. However, since every organization bears ultimate responsibility for its own work, this protocol should be read as a specific set of recommendations. Practical examples have been supplied in those instances in which we felt they would help illuminate a particular recommendation; however to keep this set of rules lean, we added examples with some prudence.

Finally, we wish to stress that we are conscious that these recommendations are based on technology and expertise which will evolve with time. Rather than attempt to forecast future developments, we recommend that this protocol be revisited and periodically revised in keeping with new developments and insights. We invite readers to contribute to periodic revisions of this protocol by forwarding comments directly to the authors (Kees Mandemakers, kma@iisg.nl and Lisa Dillon, ly.dillon@umontreal.ca) or by addressing public comments to the H-DEMOG listserv.


Victoria Dmytrieva

Program Support for Content-Analysis: Ukraine project

TACT and TextQuest are wellknown as software for content-analysis. Both oriented mainly english texts (or in latin) to solve frequencies of words, form content categories and ets. But working is problemly with texts in cyrilic of the period before beginning of XX centure (for example, Lithuanian metrica, even texts of the end of XIX centure). The maine reason is to adapt program support for text speciffic particularly letters spelling and writing, the essence of words and phrases that went long ago out of everyday. The methodic, contents, alhorithm and other questions of software construction for content-analyse are developed in Dniepropetrovsk National University at the Department of Historiography and Source Science (Yuriy Svyatets and Victoria Dmytrieva are project authors).

Besides that the technic realize of program support calls the such question as choice of program environment for the product. Contemporary user is habit working with Windows and is disposed to interface and alhorithm of this software. Therefore we come to Visual Basic as more appropriate language to solve this task.

The scheme of program working is standard. It includes loading of main menu, operations with text file (load, edit, save and print), text analyse (structure of word vocabulary, frequencies and entropies of words, letters and phrases, form of cathegories, rezults preserve in files and ets.), help with hypertext links. We foresee connection of program with Microsoft Word application. The standard schemes of dialogs (when files open, save, print and ets.) from dinamic libraries of Window\System cataloges are adopted for the program needs.

This content-analysis program is Windows application. The start up of it realises from Worktable or Windows Explorer. Program has multi-window interface that allows to work in each of opened forms in autonomous regime.


Ian N. Gregory

Integrating space and time into analysis - the Portsmouth approach

The Portsmouth Geography Department acts as home base to one of the world's leading centres of excellence in the use of Geographical Information Systems (GIS) and spatial approaches to analyse of historical and other cultural data. The department employs eight full time academic staff with research interests in these areas backed up by dedicated support staff and a number of research assistants. Areas of technical and methodological expertise include the development of large spatial and spatio-temporal databases, surveying, photogrammetry, image processing, quantitative spatial and spatio-temporal analysis, qualitative analysis and cartography. These techniques have been applied to research areas as diverse as archaeology, historical demography, historical geography, local history and regional development. The department has an excellent track record in attracting funding for small, medium and large-scale research projects in these areas. It has links with the key individuals and organisations involved in GIS and the arts and humanities in the UK, Europe, North America, Asia and Australia.

This paper outlines work underway in the Department and provides guidance to scholars interested in integrating and analysing spatial and temporal aspects of their data.


David J. Bodenhamer

Online data resources and analytical tools

The Polis Center at Indiana University Purdue University Indianapolis is widely recognized for its expertise in historical and Internet GIS. It also has an ESRI-certified professional training program with clients across the United States. Polis has a full-time professional staff of 35, including 10 staff with GIS expertise. Its strengths are in needs assessment and requirements specification, system and application design and implementation, information architecture and database modelling, Internet data delivery and dynamic mapping, and computer cartography. Research interests are in distributed systems, semantic and technical interoperability, and adaptive systems. Polis also has a strong project management system, with over 200 successful GIS projects at local, state, national, and international levels. Its projects in humanities and social science GIS include several large-scale community information systems and major historical GIS databases served over the Internet.

The paper describes best practice making spatially referenced data available over the Internet.


Sheila Anderson

Lifelong preservation of digital data - a structured approach

The Arts and Humanities Data Service (AHDS) is a UK-wide national service to contribute to the UK arts and humanities digital information education environment by:

  • Preserving arts and humanities digital resources created by higher education
  • Providing rich, deep, access to arts and humanities digital resources created by and for Higher Education
  • Supplying advice and guidance in the creation of digital resources to quality standards that ensure their suitability for informed use in research and teaching, and their long-term viability
The AHDS is a distributed service with five subject centres providing services for archaeology; history; literature, language, linguistics and textual studies; performing arts; and visual arts, and a managing Executive at King's College, London. The AHDS subject centres identify and accession a wide range of digital resources from across the broad spectrum of the arts and humanities. These include collections of images and texts, databases, archaeological field archives, GIS, CAD, VRM, multi-media, and potentially sound and moving image collections. A range of contextual documentation and descriptive metadata accompanies these collections. The subject centres evaluate, validate, structure existing and add additional metadata, and incorporate the collections into AHDS resource discovery, delivery and preservation systems. The AHDS Executive provides overall strategic direction and policy planning.

This paper describes AHDS's approaches to data preservation, repackaging and reuse and its leading efforts to develop and promote best practice in these fields.


Paul S. Ell

Strategic datasets for the Broadband age

The Centre for Data Digitisation and Analysis at the Queen's University of Belfast is a provider of key data resources to the humanities, arts and social sciences. In essence it converts analogue research materials which are often difficult for scholars to obtain, or are analytically challenging and converts the material into digital format using a range of specialist equipment

Much of the Centre's work has involved the collection of historical numeric census data for a number of counties converting printed reports into machine-readable files. The unit also has considerable experience in the conversion to digital of alpha data and the capturing of images of objects. The Centre is working with bodies such as the Electronic Cultural Atlas Initiative and the Arts and Humanities Data Service to digitise contextual data from around the world.

This paper discusses best practice in data capture, outlines the centre's work, and reports on research agendas both in terms of methodological advances in digitisation and the use of large and complex resources in scholarship.


Adrianne Fielding

Late 19th Century Mortality in New England Mill Towns: Analyzing Linked Death, Census, and Tax Records

The construction of multiple-source databases through record-linkage is central to historical studies ranging from European family reconstitutions to cross-census studies of American occupational mobility. The construction of linked longitudinal data resources from panel data, e.g. censuses, in delimited geographic areas confronts methodological concerns that are profound yet seldom addressed. This paper details both the methods and findings to date of the Connecticut Valley Historical Demography Project (CVHDP), which has utilized longitudinal area studies and interactive, computer-assisted record linkage to examine various aspects of urban and industrial change that affect the continuing high levels of mortality and the eventual mortality decline in nineteenth-century New England.

The CVHDP has examined mortality and social change in the neighboring and rapidly growing towns of Northampton and Holyoke, Massachusetts, which differ in their economic and demographic characteristics. Holyoke was a rapidly growing and industrializing company mill town, while Northampton experienced more gradual growth and the emergence of a mixed commercial market economy. To analyze mortality in the two towns, a database for individual-level mortality analysis was constructed through computer-assisted record linkage of geographically sampled observations from decennial censuses from 1850 to 1910 and the deaths recorded within two years of each census. The CVHDP research design combines the strengths of a longitudinal panel study with the value of the continuous observation of mortality. The quasi-experimental research design also supports comparative studies of mortality in the two differently developing urban communities across time (Hautaniemi, Swedlund, and Anderton 1999). This database has facilitated cause of death analyses and GIS analyses of the mortality impacts of public health interventions in the two emerging urban centers.

The initial results from these analyses also suggested that wealth influenced cause-specific mortality, which led us to further expand the linked database. Until recently, the absence of wealth data in the late nineteenth-century U.S. federal censuses constrained analyses of wealth distribution and inequality in the U.S. and confounded longitudinal analysis. Historical demographers like Steckel (1994), Hautaniemi et al. (2001), and Ferrie (2001) have turned to tax valuation records as an alternate and supplemental source of wealth data for this critical period in U.S. history. Similarly, the CVHDP has also linked tax valuation records to the database of census and death records. This linked wealth data allows us to analyze the changing types and inequalities of wealth in the two communities over time and the effect of wealth on mortality, and the longitudinal comparison of wealth and inequality allows us to contrast the effects of the rapid industrial growth in Holyoke with the Northampton's emergence as a mixed economy market town. The findings support the influence of wealth on mortality and Steckel's (1994) suggestion that wealth inequality was uniformly high in nineteenth-century Massachusetts, and contribute to the emerging hypotheses on changing inequality in the late nineteenth-century United States.


Anastasia Leonova

Content-analysis of the data collected by NKVD among the German POWs in 1945-1949

Investigation of the attitudes of the former German servicemen who were kept in the Soviet POWs camps after the WW2 towards the different aspects of contemporary political situation was one of the important tasks of the Soviet secret services agents which were involved into the work with the POWs.

One of the key problems of the post-war German historiography was the responsibility for the beginning of the WW2 and the war crimes which caused the loss of many millions people lives.

The documents discovered in the secret services archives reveal the discussions held between the top German officers kept in the POWs camps on this topic. The former officers were divided into those who put responsibility on the whole German nation which let the national-socialist party come to power and those who reserved the opinion of the personal responsibility for the war crimes.

The report describes the contents of the discussions between the POWs on the problem of the responsibility and the dynamics of the different opinions in this case.


Boris Grekov

German Politician Walther Rathenau and his Ideas about Russia 1914-1922. (Content analysis of his political thinking.)

The paper focuses on W.Rathenau, President of the AEG, Head of the supply in raw materials in German industry during World War 1, owner of AEG factories in Russia, in 1921-1922 minister of economic rehabilitation and then foreign minister in Wirth's cabinet. An attempt is made to analyze the evolution of his attitude to Russia from the start of the First World War up to the Rapallo Treaty (1922). A database of Rathenau's works was made on the bases his private papers, which were found in the Moscow "Sonderarchive". Then, with the help of content analysis, a model was made of his political image of Russia. It was shown, that the policy towards


Anna Dzhaleva-Chonkova and Vladimir Doulov

Computer-aided possibilities of teaching history and humanities

Higher School of Transportation, Sofia, Bulgaria The main advantage of computers especially when they are connected in a network is that they may display electronic copies of documents, photos, old papers, etc. which is difficult to have an access to (they are kept in libraries and archives some of which have a special order for using). Teachers can show documentary films as well and that will give them a possibility to stop and look longer at some photos, to repeat some parts, etc.

Using computers in teaching history can present some museums in electronic version so it will be a great chance as sometimes it is difficult to visit them (if they are outside the university campus and even abroad). The virtual museum is a great idea but to develop it teachers need a collection of brochures, leaflets, books to be scanned. It is a hard work but the efforts will result in great effect.

Distance teaching and using ??? and H??L technologies makes possible to turn the teaching process from "communication" in the traditional meaning of the world into "imprinting" in the sense John Lock put in this word in preparing knowledge of preliminarily determined parameters.

The hypertext technology creates conditions to overcome one of the oldest defects of writing and teaching history - "Orwell paradox" - the effect of rewriting history according to the particular socio-communication conjuncture. Including historical issues in the multi-dimentional semantic space that authors interprete by the complex notion of perceptional and semantic continuum gives a possibility to make the history knowledge "free" from context overloading and phone expectations.

Unlike the traditional learning in history when all possible versions are interpreted through only one dominating structure of saying, the hypertext distance teaching makes possible the free perception of almost all versions of the history process according to the computer abilities and the communicator's competence.

This new communication position overcomes the alienation of knowledge characteristic for traditional education and sets the trainee in a completely different condition: he is directly involved in continuum outlined by the historical issues and he/she is made to "experience" them.

The paper presents the conceptional base of a new "virtual hermeneutics" as methods background for achieving authentical understanding of history and other humanitarian subjects and adequate teaching. Thus it is possible for history knowledge to function as "pure" history knowledge to a greater degree. The paper is based on the experience of using computer-aided methods in teaching humanities at the Higher School of Transport in Sofia, Bulgaria.


Tamara Izmestieva

Syndicate and its Impact on the Development of Russian Coal Mining Industry. The Late 19th - Early 20th Century. Methods of Analysis and Results

Russia was provided with great reserves of the two basic materials of the industrial revolution - coal and iron. Both railway construction and exploitation of very rich iron deposits in the Krivoi Rog region gave substantial impetus to the establishment of sizeable coal industry in Russia. The industry grew rapidly from the 1890s at the expense of the emergence of some large-scale enterprises.

When crises struck in the early twenties century, many of them were faced with difficulties. The current selling price of coal barely covered the costs of production. Many firms produced losses and only a handful of firms offered dividends to shareholders. In this condition, competition became destructive. What was way out?

Coal producers tried to solve coal' problems by formation of a coal syndicate. The syndicate (in Russia it was known by the name of Produgol') began to operate in February 1906. How did syndicate affect the development of coal industry?

Produgol' acted as a sales agent for its members. Because the syndicate did have the right to assign sales quotas and prices to its contracting firms, it was able to reduce market fluctuations and affect prices on coal market. So, the syndicate created favorable conditions for both survival and development of coal mining enterprises. How did members of Produgol' exploit these advantages?

Naturally, all firms were similar in its attempts to get good profit, but approach to an allocation of profits was different. Some firms used profits to pay dividends while others plowed back profit to improve their production. Some of the latter firms were able to reduce average costs at larger levels of output.

In this paper we consider a special group of those enterprises, which potentially may use increasing economies of scale. The group to be analysed is notable for many reasons, including a good correlation between interests of each of its members and interests of the coal industry as a whole.

Our analysis is based on decomposition of total cost into fixed cost and variable cost. It is not easy to determine what is a fixed cost and what is a variable cost, because of problems in accounting practice. One sensible approach is to apply an approximate method which makes possible estimating two components of total costs. Decomposition of total cost helps us to formulate a cost equation which reflects potential production capability of the enterprise.

When investigating Produgol' and its impact on the development of Russian coal mining industry, we introduce in our analysis two external factors - prices and production limits, which were under control of the syndicate. This method of attack serves to clarify the subject under consideration.

Framework of our research includes some steps (calculations of elasticity of cost with respect to output, estimation of costs' structure, formulation of equations, and so on). Although each step of analysis is rather simple, this procedure as a whole is cumbrous and tedious. So, we created in Excel special template, which make possible to perform these operations automatically.

Methods of our analysis in detail, some results of computation and comments will be presented on the conference.


Michael Shovman

A Seminar on Record Linkage Solution Framework

In the past couple of years our team has been developing several Record Linkage projects. As a result, we had created a RL solution framework that could be adjusted and applied to a wide range of RL processes. We think that as such, our framework may be of interest to the developers of fine RL tools, as well as to the historians doing RL work. Therefore, we offer to organize at the upcoming XVth AHC conference a Seminar on Cooperation in RL Systems Creation. In this seminar, we would discuss the following topics:

  • General RL process classification:
    • Desirable multiplicity vs. undesirable one.
    • "Historian", "statistician" and compound cases.
    • One-time vs. continuous operations.
    • RL Lab and RL Factory cases, etc.
  • Basic construction units of RL applications
    • Human-Machine Interface.
    • DB structure and thesauri.
    • Decision-making and saving algorithms.
  • Specific RL tools.
  • Calibration tools.

We are ready to make a presentation covering our vision of the first two topics, followed by a discussion on the third and fourth ones.


Garry Keyes & Jens Toftgaard Jensen

Mapping Urban History
GIS in Spatial Analyses of a Multi-Source Database

The paper presents a method of reconstructing a historical city map and examines its usefulness in data representation, analysis and visualisation. The core data-set for the GIS-model is a nominally linked multi-source database, which describes people, place and property in early 19th century Aarhus, Denmark. As no historical land register map of the city existed for this time period the usage of GIS required the construction of a credible map, based on the geographical information and structure inherent to the textual sources. The resulting digital map enables spatial analysis to uncover patterns in urban social and economic structures.


Lars Göran Carlsson

Record Linkage on Swedish Parish Registers - Some Methodological Aspects

The Demographic Data Base in Umeå, Sweden provides historical data from parish registers and parish statistics for researchers from both Sweden and other countries. The primary materials computerized by the Demographic Data Base consist of entries on individuals in 19th-century parish registers. A distinguishing characteristic for the material is its wealth of detail and high quality.

In order to carry out demographic analyses at individual and group levels, a method of linking individual records from parish registers has been developed. This paper discusses the DDB method for record linkage. Main outlines of the method is the combination of computerized record linkage and computer aided record linkage where the majority of the linking is performed by a computer program while remaining record are linked manually with computer aid.

Some general rules for computerized record linkage used by DDB are;

  • Well defined algorithms and rules are used to decide if a link will be established.
  • All individuals is given a unique identity number.
  • For records impossible to link to any other record a new individual will be "created" with only one record.
  • Matching is done between pairs of records or between groups of records, i.e. pages in the source.

In the computerized record linkage exact matches and clear mis-matches are processed automatically using well-tried algorithms for searching and matching , the program is able to match and link records without having a unique identifier for the individuals.


Timur Valetov, Moscow State University

Wage Inequality Simulation (on the materials of Russian industries, 1890-1910s)

The problem of income distributions and wage inequalities during different historical periods is one of interest and important aspects of social history. Wage inequality is the theme of researches by a lot of historians. The beginning of these researches is related to S.Kuznets, who suggested that income inequality dynamics for industrialization period is inverted-U curve ("Kuznets curve"). J.Williamson and P.Lindert did also research this problem on the base of a lot of materials on the USA and Great Britain and they actually confirmed the Kuznets's hypothesis, but the latest researchers of social history of different countries sometimes made another conclusions on the basis of their data. It is necessary to tell, that the question is generally not easy to study because of difficulties in collecting the authentic sources. For example, for Russia during the industrialization (1890s - 1910s) dynamics of workers' wage inequality remains completely unknown. Besides the dynamic of wage inequality itself, the problem of explanation of inequality growth or reduction is possibly even more important. This task is also still insufficiently investigated, mostly because of lack of available data.

Maybe a mathematical stimulation modeling of wage inequality could help to understand the mechanisms inequality dynamics. In the current report I attempt to make such computerized modeling at a microlevel, without taking into account the changes having place in the national economy during industrial development.

After researching on wages of workers on the basis of contemporary records from one of the textile factories of Moscow province I managed to collect the database where annual rates of all the workers-engravers' wages of the factory during 20 years (1890s-1910s, about 400 people) are recorded. This database allows to trace some factors which defined the wage rates of the workers. The most important factors are workers' age, experience and qualification levels. The wage rates were usually increased from time to time both for skilled an unskilled workers, but the wages for unskilled were limited by a low wage maximum and the unskilled workers often left the factory, so their wage could not reach even this maximum. This is why the average wage of unskilled workers was lower than of the skilled while the annual wage increases were almost the similar.

This idea is the basis of some modifications of the simulation. The model consists of 10 differential equations. At the first step of the model the simulated-factory hires the workers of different qualification levels and each of the levels is characterized by its own parameters, similar to those got from the historic source. These are the wage minimum and the possible wage maximum, the workers' possibilities to leave the factory or to increase the wage rate, possible size of the increase etc. The model calculates the main wage inequality coefficients dynamics (i.e. Gini coefficient, the percentage of common wage got by 10% of well-paid workers etc.)

The main result of the model is the following. During the first steps all the dynamics increase (because the first step simulates impossible conditions of the perfect equality), then become stable. It is interesting and important that these stable values of the coefficients and the time the system becomes stable are almost independent on probability fluctuations included into the model and they are little dependent on the values of the model constants. I guess this shows that it is impossible to explain serious changes in wage inequality just on the basis of natural processes of wage growth or reduction important for all the workers taken separately.


George M. Welling

New points of view on Price History, based on analysis of the Paalgeld Portbooks, 1771-1778

In the early thirties of the last century an international group of economic historians gathered to set standards for the writing of national price-histories of the participating countries. The Second World War made contacts between the historians involved very difficult and most of them decided to continue their work based on the agreements made earlier. Since then these price histories have been the solid foundations for various discussions, a.o. for the standard of living debate. A crucial aspect of the price-histories is the choice of goods to create a price index. Because very little large scale research had been done about the composition of the total of goods going around, the choice was mainly based on pre-conceptions. In an earlier publication I have shown that this choice was especially unfortunate for the Dutch historian Posthumus, because his sources were far from perfect. But based on my recent research on the imports of Amsterdam at the end of the 18th century, I will argue that the composition of the group of goods for the indexes does not reflect the realities of the trade. My statistical analysis will show that a different set of goods as a basis for indexes will allow a much more reliable view of the economical fluctuations.


Jane E. Dabel

Census Database of the African-American Population in Nineteenth-Century New York City

My research focuses on the unique role that African-American women played in nineteenth-century New York City. One of the most significant tools for my project is a database that contains profiles of over 50,000 African Americans residing in Manhattan. I have collected this information from the original United States Manuscript Census from New York City for 1850, 1860, 1870, and 1880. This database provides a breadth of information about free blacks in general and black women in particular with specific information about women's employment cycles, the role of women in black family structure, female migration patterns, and the finances of black women. This is such an important demographic source because it furnishes an unprecedented amount of data about black women who often did not leave written records. After examining my database for information integral to my manuscript, I intend to publish my database on a searchable CD-ROM for use by historians, independent scholars, genealogists, and teachers.

A presentation at the XVth Conference on the Association for History and Computing at the Norwegian Historical Data Centre will provide me with the opportunity to share the information in my valuable database. It will also allow me to discuss strategies for examining such sizable amounts of information.


Olga Porshneva

The Mentality of the Russian People: The 1917 Revolution in the Focus of Mass Consciousness: a Multi-Dimensional Statistical Analysis of the Letters Posted by Workers, Peasants and Soldiers to the Central Organs of the Soviets of Workers' and Soldiers' Deputies

The goal of the paper is to reconstruct the content and structure of the mass consciousness of workers, peasants and soldiers during the 1917 Revolution, their major economic, social and political preferences, ideals and values, and also the mechanism of the transformation of their mental orientations.

The paper under consideration presents an original source based on a great number of letters posted by workers, soldiers and peasants to the central organs of the Soviets of Workers' and Soldiers' Deputies and gives an opportunity to study the problems posed by the author. The content-analysis of all the letters taken from archive (GARF) files can be regarded as a quantitative analysis done by the method of random selection. I made a content-analysis of the correspondence by singling out the meaningful units - indicators, expressed by the authors of the letters in the form of statements (views) on social, political, economic and cultural problems. 238 meaningful units-indicators - have been singled out. I have also elucidated the interconnections of the indicators in each source of information and their total number, the system of relationships of each indicator with the others. The number of categories - combined indicators, greater meaningful units, was 64. The system of their interrelationships was also investigated. As a result, I have created the structure of four relatively stable layers, "blocks" of social consciousness (traditionalistic, revolutionary-defensive, radically anti-war, socialist). I have studied the nature and the level of overlapping of these "blocks".

Chronologically the correspondence was divided into three groups: spring, summer and autumn, which reflects the presence of qualitative peculiarities in the process of the revolution at each of this stages. The distribution of the meaningful categories in the aforementioned periods visually demonstrates the evolution of the priorities and the content of mass consciousness throughout 1917.

The method used in the quantitative analysis of the interconnections of the meaningful categories is based on calculating the coefficient of correlation. An analysis of the distribution of the totality of meaningful categories was made on the X2 criterion using STATGRAPHICS. The sum total of meaningful categories was distributed according to a normal law which accounted for the use of the coefficient of linear correlation as a quantitative characteristic of the interrelationships of meaningful categories.

I have also investigated the mass consciousness and system of mental priorities of each analysed social group. I have defined the character of typical verbal reactions in mass consciousness and social psychology pertaining to different social groups of people and dominant models of solving current social issues of that time.

The spring-autumn period witnessed the evolution of mass consciousness of the "lower" classes, the modification of its content in the direction of the growing radicalisation, mobilisation of communally-equal in origin moral orientations of social justice and equality. Mass consciousness of the people was fairly contradictory and multi-layered. The obvious prevalence in mass consciousness of the two "blocks" - revolutionary-defensive and radically anti-war - testified to the dominance at that period of one of the basic archetypes of people's mentality - the rebellious one. Over a third of the correspondents (intentionally or unintentionally) expressed their support for the traditional values and championed them. Part of them adhered to the old monarchic rule; others, while remaining loyal to firm power, strove to establish it in a new, revolutionary aspect. The choice of the priority themes for their confessions, demands, complaints which included the problems of war and peace, agrarian reformations, establishment of people's power, social justice, and the stereotyped variants of how to solve them, - demonstrate the lower classes' basic concepts of justice in an orderly and just social system.


Vladimir Pereverten

An approach of presentation of interdisciplinary lecture simultaneously for two categories of students.

The author has been developing the special course "Computer information technologies for historical research".

In this paper an approach of presentation of content of the didactically complex interdisciplinary lecture of this course simultaneously for two different categories of students: historians and "computer scientists" is discussed. The subject of the lecture is "Organization of information for historical research".

The approach is based on a suggestion that historians are more predisposed to verbal-imaginative perception of information but "computer scientists" - to symbol- imaginative and uses the new educational technologies. Our approach consists in making the lecture-presentation on given subject with the use of such possibilities of new educational technologies as multimedia and multiscreen demonstration.

The most important advantages of multimedia for our lecture are the hyperlinks which allow to change the scenario of reading the lecture in accordance with reaction of students and animation with sound accompaniment allowing graphically present the dynamic aspects of concepts and definitions.

Using multiscreen demonstration, we can present the same information object in several forms simultaneously. In our lecture-presentation these are ordinary text, graphics with animation and sound effects, mathematical text, which we will accordingly conditionally name: "word", "image", "symbol".

"Word" is a text without special symbolic designations. "Symbol" is a text with special symbolic designations and expressions, which requires for understanding special (in our case, mathematical) preliminary education. The destination of "image" consists in providing the clarity in presentation of information both for "word", and for "symbol". In our lecture graphs are used as "images".

The presentation of information in the form of "word"+"image" is intended for students-historians and in the form of "symbol"+"image" - for students-"computer scientists".

It is obvious that realization of considered educational approach requires the special facilities providing multimedia and multiscreen presentation of information.

The lecture-presentation has been made in two versions: for traditional presentation directly before listeners and for remote presentation by Internet.


Nina V. Piotukh, A. Frolov

Electronic Historical Atlas of Derevskaya Pyatina, Great Novgorod Region, XV-XVIIIth c.

The aim of the project is to create an electronic historical atlas of Derevskaya pyatina 1) since the end of XVth till the second half of the XVIIIth c. The main task is to put on the electronic map all the settlements, which are possible to localize using cadastral surveys. This task is conditioned by the fact that all the XVth-XVIIIth c. survey's demographic, social and economic characteristics describe separate settlement, not administrative unit. Derevskaya Pyatina's GIS is regarded as a basement for the further spatial-statistical analysis of the North-Western region of Russia to understand its socio-economic development between XV-XVIIIth c., for clarifying historical geography and for stimulation of archaeological investigations of medieval Novgorod's region. The developed technique of settlements localization may be regarded as a basement for the further creation of electronic historical maps for other regions of Russia.

Databases modelling information about settlements and population in 1770th and 1495/96 were created by now. Settlements of the southern part of Derevskaya Pyatina were localised which are more than 4,5 thousands for 1770th and about 1200 for 1495/96. Series of maps presenting distribution of settlement's types and population were made using databases.

_______________________
1) "Pyatina" is the term for one of the five districts which Great Novgorod region consisted of in XVth c.


Norbert Winnige

TASC B (Trans-national Database and Atlas of Saints' Cults.) A GIS for Dedications of Saints in Europe

The TASC-project (Trans-national Database and Atlas of Saints' Cults) aims at establishing an inventory of religious devotion in Europe. Though evidence of cults is produced from all kinds of sources, the project focuses on dedications of churches and chapels. TASC is a joint, interdisciplinary enterprise of researchers from all over Europe and beyond, initiated by Graham Jones (University of Leicester). Designed to permanently attract more collaborators it will always be work-in-progress.

Collaborators are free to choose any software to build up local datasets according to their needs. In a model database run at the Max-Planck-Institute for History, Göttingen examples of data modelling are developed. This presupposes establishing a set of core data, which will allow for a standardized presentation of data, and developing common tools for identification and encoding of data. Moreover, an encoded database of saints is set up based on the Acta Sanctorum (AS), saints canonized and venerated after the 17th century are added to it according to the standards of the AS. Further tools needed are gazetteers of place-names. Starting with lists of contemporary names based on data from ordnance surveys this collection will grow constantly. Up to now data models are more or less relational, in future more sophisticated data structures will be included.

The data on saints and veneration are presented and analyzed using geographical information systems (GISs). A GIS will be the main user interface to the data. In order to set up this interface commercial software (ESRI's ArcIMS 4.0) is used for reasons of availability. Yet, offering alternative solutions, which consist in open source software GISs free of charge, is considered.


Ingo H. Kropac

'Historical Informatics and Documentation': Positioning, Contents and Integration in Academic Course Schemes"

Abstract:
The area of "History and Computing" is treated by several disciplines, which can be found under different denominations and which contains different topics and approaches. What are the central contents of these subjects, where are the delimitations and the coherencies to other disciplines? After more than fifteen years a repositioning of this subject - at Graz university it is called 'Historical Informatics and Documentation' - seems to be needful. This paper will reflect upon possible definitions of the subject, its scientific basics, its scope of duties and the role of people concerning themselves about formal procedures and IT-application within history. The institutional setup, especially in academic curricula, should complete the review.

To this effect the paper intends to resume the discussion on the positioning of a subject located between applied computer science, information science and the various historically oriented studies themselves. For this purpose the way we followed at Graz to establish the discipline institutionally and to build curricula for several course schemes will also be presented.

________________________________

Workshop: History & Computing - quo vadis?

Proposal:
What does "History and Computing" really mean at the beginning of the 21st century? Is it just "using IT in historical research and education, and the related methodology" (conference announcement) or has H & C been developed to a new discipline within the humanities - either in the group of the historical studies or in the more formal and younger subjects like computational linguistics or computational philology?

To answer these questions the Austrian branch of the AHC proposes a substantial and international discussion about the notion, the contents and the institutional classification of this subject. The AHC seems to be the best and only frame to provide for the potential success of this attempt. This workshop should initiate the discussion and should give an overview about the different points of view, represented by the members of the AHC. For that reason we propose a forum, where the representatives of the national branches - or delegates of them - meet to compare notes and to discuss the further procedures.


Valeriy Kanishev and Roman B Konchakov

The Databases for study the Social-Demographic History of Russia(in XIX B beg. XX c.) on microlevel: formation, record linkage, make GIS, problem analysis.

The given statement is devoted to some results of long-term work on creation and processing of the electronic social - demographic databases based on initial sources of the account of the population. One of the first problems in the researches connected to definition of changes of demographic shape of separate persons and families, appeared variographical (a different spelling of personal data same people). Algorithms of the phonetic analysis of Russian nominative data (people names , names of settlements etc.) are applied to overcoming this problem and linkage of records about one person.

It is known, that the basic problem at work with sources with a low degree of structure is construction of model of a database which optimum would reproduce the data of a source.

At a difficult and non-formalized kind of representation of the information in a source usually determine the general attributes and standardize the data on a course of work.

For example, breaking a line " In 1881 has graduated a faculty of law of the Moscow university" on fields "education", "years of graduation ", "place of studying", "Speciality" etc. This problem has certain limit of complexity behind which reduction of the data of a source standard variants results in loss of the information because the form of the message of those or other data may be the helpful information and may not be described in model of a database.

In our work we came to a conclusion about high efficiency of creation focused on a source databases the model which reflects not formal, but logic structure of a source.

Separate question is research of processes (for example, social mobility) the dynamics which is reflected in the mass-sources different by the form and contents .

These sources are difficult for correlating in view of the different form of representation of the information

For example, the property of the person may be appreciated in one case as the sum of money, and in the other case the real estate is simply listed. In the form, thus, sources are difficultly combined, but on sense contain the same data.

The model of the data should be focused not only on the description of a classical parity object - attribute, but also on the description of properties of attributes. For example, in a database on studying social mobility in Tambov (XIX-the beginning XX cnt.) "property" is not only attribute of object "owner", but also itself has attribute "type" accepting values: "many", "real estate" etc. "Property" may accept different values on the attribute and be described accordingly by the sum of roubles, the description and quantity of constructions etc.

Using "sub-attributes" the model of a database provides not only ample opportunities of scaling but also to provide effective mechanisms of linkage of records.

For visual representation of historical-demographic, historical-geographic, historical-ecological processes use connection databases with geographical data (electronic maps) by GIS.

In a result created by us GIS, containing various demographical and ecological information on the Tambov's region for the period of 17-20 cnt., which allows to present development of region in the variable time (historical) and spatial kind.


Sofia Salomatina

Bargain or Reputation: The Cluster Analysis of Discount in Russian Banks, 1864-1894

The subject of the research is the practice of crediting of business deals in Russian joint-stock banks during their early development from 1864 to 1894. It was a period of severe crisis in the second half of the 1870s and deep depression until the 1890s.

Joint-stock banks were credit institutions designed mainly for short-term crediting of businessmen. In the second half of 19th century the main credit operation of these Russian banks were discount of the bill of exchange. In theory it was a credit without any collateral basing on debtor's business reputation and, importantly, on quality of the bargain caused the bill. Inasmuch as banking theory consider the discount as a tool just for bargain on credit, bank's discount customers must be mainly active businessmen.

But in 1885 economist Petr Gambarov published a paper asserting that any wealthy per-son, even not a businessman, could discount a bill of exchange in Russian joint-stock bank if he had guarantors reliable to bank's manager. If this point of view was true, Russian banks were the institutions crediting local elite without sufficient guarantee, because if the subject of crediting was not a short-term bargain, the customer had to pawn valuable. The successful short-term bargain was one of the conclusive factor of repayment of a short-term credit.

Petr Gambarov doubted that these short-term bargains were an essential part of Russian discount. His appraisal must be verified using quantitative approach. For that it's necessary to reveal some quantitative traces of bargains underling discount.

This task requires non-standard approaches, because the bill of exchange was a formal and abstract promissory note containing no information on underling bargain. Moreover, there was no immediate data containing information on the subject of our interest. There is only a set of quantitative materials on Russian discount in 1864-1894: total sum of monthly discount in every joint-stock bank, derived from balance sheets that banks had to provide Ministry of Finance and lay open to public monthly.

Possible information whether banks discounted the bills of businessmen have to be ex-tract from disposable data using multivariate analysis. In that case the general indicator of bargains' crediting might be the seasonal factor, which would be substantial if demand for credit conformed to any yearly business cycle. But only third of about 50 of Russian joint-stock banks have evident seasonal discount. The rest of the banks needs the other characteris-tics estimating "the quality" of discount, and especially confirming or, on the contrary, deny-ing that banks' customers were diversified. But all quantitative indicators of discount might be derived from two disposal time series from 1864 to 1894: 1) monthly total discount and 2) monthly total asserts. In additional to seasonal factor these indicators are available for each of the banks:

  1. mean of total asserts for the 1880s - 1894;
  2. number of branches in 1881 (no other data is available);
  3. discount to asserts ratio for 3 major time period (initial - 1864-1875; crisis - 1876-1880; depressed - 1881-1894);
  4. average annual increase of discount (mean for 3 above-mentioned periods).

As the next step banks were ranged on these indicators using cluster analysis (distance measure by method '1-Pearson r', linkage rule - weighted pair-group average). The banks in revealed clusters differ by discount characteristics.

This analysis partly disproves Gambarov's appraisal. It reveals that considerable part of Russian banks were crediting short-term bargains during a very long period of negative eco-nomic situation. Let's review these results.

There were 47 joint-stock banks in Russia from 1864 to 1894. 18 from them had evident seasonal discount and therefore most likely credited bargains. 7 banks bankrupted until the 1880s and were excluded from the research because they operated approximately 5 years on average and had no time to develop individual business strategy.

Discount operations in 6 banks were 'credits on reputation' as it was described by Petr Gambarov. It can be concluded from the following factors: these banks were very small and had almost no other credit operations except for the discount, hence they had no diversified customers. Moreover, time series of discount operations of these banks contain no traces of either seasonal factor or any other economic cycles (discount trend does not correspond to the general discount market conjuncture).

16 largest Russian banks had off-seasonal diversified discount portfolios; 9 of them were gradually taking their funds out of discount operations, 7 of them had obvious problems with discount management. But nobody was able to get stable growth of discount operations and all large banks were strongly depending on extremely negative general economic situa-tion. Scanty market conditions and elitist and closed character of Russian banking didn't im-ply the loss of economic basis of discount.


Svetlana Ashmarina

Dynamics of social insurance and traumatism indicators at Russian industry in 1892-1907.

The issue of social insurance for workers in Russia in late XIX - early XX centuries has not been adequately explored. This issue comprises two main components, namely institutional aspect - organization of industrial worker's insurance in tsarist's Russia to include development of legislation (and adoption of fundamental insurance laws), as well as practical aspect - implementation of legislative regulations. The practice of social insurance, reflected in statistical sources, remains poorly studied. Social insurance was applied to workers, employed in industries, posing most risks of personal injury, e.g. in metallurgy.

On the brink of centuries (late XIX - early XX) the Ural region was the centre of metallurgical industry, but it bit by bit yielded the palm to the young Southern region of the Russian Empire. Metal mining and processing enterprises and over 50% of workers were concentrated there. The Ural was one of the first region, where social legislation was applied to both state and privately-owned workers. Due to it this area is of special interest in relation to studying the development of insurance practices. Inadequately researched social insurance at privately-owned enterprises of the Ural region is also worth attention. Published data and statistical materiel of the Ural archives on personal injury levels and composition as well as data on insurance payments for accidents in metallurgical enterprises of the Urals and other Russian regions in 1892 - 1907 allowed one to trace the dynamics of trauma and social support in case of accidents. The report provides analysis results, obtained by means of statistical soft ware «Statistica».


George Vascik

Sophisticates or Neophytes: Charting the Campaign Behavior of Candidates in Imperial Germany Using Historical GIS

In her recent monograph on Imperial German elections, Margaret Anderson observed that German voters became more active and discerning participants across 50 years of electoral experience. In this paper, I will demonstrate that politicians became more sophisticated campaigners. Using a digital electoral atlas that I have created for the Prussian province of Hanover, I will examine candidate behavior in one constituency (the 19th Hanoverian electoral district) across a series of five elections between 1893 and 1912. I can use the GIS to chart the timing and coordination of rallies, attempts by party activists to disrupt other party's rallies, leafleting and petition campaigns. Analyzing these factors in conjunction with relevant income, occupational and confessional data, I can demonstrate that parties and candidates became more sophisticated and nuanced in their appeals, while developing complex campaign strategies to achieve their ends.


Yuriy Svyatets

Fluctus numerare: Difficulties in agreement of aggregate statistics with changeable programme

Research of social and economical processes in Ukraine agriculture during the new economical policy (1921 - 1927) needs to analyse statistical information. The Central Statistical Department of Ukraine organized and headed every year sample (10% of total) spring and autumn censuses of farms. Unfortunately primary sources (questionnaires) are not preserved. Materials of the censuses are published as aggregated data.

GIS is created with the research object. Changeable of the program with published data is the main problem of creation. Territorial and administrative reform continued during the period of new economical policy. Therefore quantity and borders of administrative units had been changed a few times. The Ukraine territory included 53 districts in 1923 year, 49 districts and Moldavan Autonomy in the spring of 1925 year, 41 districts in the spring of 1926 year. Farms of each of the districts are groupped according to size of sowing squares. Quantity and sizes of sowing squares intervals are already changeable in published materials. Quantity and list of variables that describe farm groups are not stable.

The aim of paper is to discuss main principles of GIS constructuring and analysis of aggregated censuses data with changeable program.


Lotta Vikström

Mastering and Measuring Historical Data: The Possibilities and Obstacles to Explore Individuals' Life-Courses Using Statistical Analyses on Swedish Parish Registers in the Case of Migrants to Late Nineteenth-Century Sundsvall, Sweden

Using advanced statistical analyses has enabled scholars to better grasp demographic change and its impact on both the individual and societal scale. The last few decades, historical demographers have increasingly turned to issues concerning how to statistically deal with demographic changes over lifetime. However, applying statistical methods on historical data is a complicated process especially for historians who often lack experience of such methodology.

This presentation addresses the possibilities and obstacles to explore individual migrants' dynamic life-courses by using multiple regression models. It shows how longitudinal data formed by Swedish parish registers meet the demand for applying such models. However, the presentation also highlights some difficulties linked to the application of multiple regression models on historical data and suggests methods to help overcome them.

Two types of regression models illustrate the issues addressed above. The first model measures the duration of residence of migrants in the town of Sundsvall and the other one concerns social mobility in times characterized by rapid industrialization. Demographic characteristics such as gender, age, geographical background and social and marital status are analyzed to examine the impact of these covariates on the two events of out-migration and upward social mobility. The data is drawn from computerized parish registers stored at the Demographic Data Base (DDB) at Umeå University, Sweden.


Valeriy V. Kanishev

The project "Micromigrations processes in the Russian Eurasia of the second half XIX - the beginnings XX cnt. (the Tambov region - the Altay region)" : The preliminary results.

Among researches of migrations from the European Russia to Siberia in second half XIX - the beginning XX ct. there are no the works studying this process at a microlevel on the basis of the initial data and connection with the help of the computer of the information from different sources. Such approach allows to find out motives of resettlement and its consequence for concrete families and people.

The majority of initial documents on a theme was postponed in the Altay archive - hundreds archival documents. In the Tambov's archive there are tens documents about families of immigrants, the most part from which is secondary, which arose in connection inquiries of the Altay's authorities to Tambov about necessity of legal fastening the resettlements. It speaks about prevalence of autocratic resettlements. It were roundabout means of often negative decisions of the Tambov authorities under requests for resettlement. But in sometimes in Tambov's documents the interdictions were substantiated by authorities of a financial inconsistency of immigrants. Following by Altay's data's 1888-1892, about half of migrants from Tambov's region could not get an independent facilities and send in farm laborers. But results of resettlements (in 1880th came back ~ 3 % of the Tambov's peasants) speak that the poor peasants chose difficult, but real perspective of improvement of the position on Altai instead of gloomy poverty on the native land where the accrued agrarian overpopulation.

Demographic and economic positions tambov's people on Altai region it is at present investigated on census sheets by agricultural census(1917 y.) of villages Podojnikovo and Krutiha of Barnaul district., where lived ~1200 immigrants. On a parameter of birth rate (26 o/oo) the Altay's village is near to Tambov (in 1916 birth rate around 30 o/oo). Death rate of the Tambov's peasants on Altai was much lower than their fellow countrymen - 15 o/oo against 25-27 o/oo. Very low on Altai in comparison with ??????????? a parameter of infants mortality rate (less than 2 o/oo against 20o/oo and more). The natural increase on new places at the Tambov's peasants appeared the high - 10 o/oo against 5 o/oo in the Tambov villages.. The data for one year on 2-nd Altay's villages in comparison with 3-4 Tambov's villages do not give the bases for wide conclusions. . We assumption , that the high natural increase on Altai is connected to absence "race" rates of mortality behind birth rate, characteristic for the overpopulated territories.

Comparisons of economic parameters of 1917 on villages Podojnikovo and Krutiha with average indices on Tambov's region are interesting. A new peasant sufficed quantity the plot of land on Altai's region.

In Tambov's reg. in 1910-years took in rent the ground of 27 % of peasants. In using immigrants was on the average on 8 desjatines (1 desjatina = around 1 hectare.) on one household. In Tambov provinces - 6.3 desjatines. As a whole on Altai the Tambov's peasants have received the grounds not more, but means for rent here were not spent. Census of 1917 year has fixed in household of immigrants with Podojnikovo and Krutiha it is significant the big quantity cattle animals, than in Tambov's household. It speaks presence Altai of the big fodder resources, than in the Tambov's province. There to beginning XX.cnt. in view of wide ploughing the meadows and haymakings area was sharply reduced.

By calculations of Tambov's historian V.L.D'jachkov, in 1897 in the provinces like Tambov, the males dominated only at senile age, younger were in a zone "women's kingdom". In such situation the need(requirement) for resettlement from old agrarian regions grew in connection with a problem of distribution in marriage superfluous brides. Feature of sexual structure of villages population of Siberia was the numerical superiority of the male's population. By the end of 1890th on the each 100 males were 98 females. It is difficult to explain, how peasants of the European's Russia have guessed this situation, but at the end of XIX - the beginning XX cnt. the part of women among immigrants to Altai began to grow. During 1889-1896 among immigrants in the age of 10-20 years the girls were 62 %. Among immigrants, begin to live in "old based" settlements, at age till 14 years the percent of boys was much lower, than girls. Probably, in the new environment, where males were enough, the chance of brides to marriage was more. Thus, peasants of agrarian regions of the central Russia moved to Siberia not only behind the ground, but also for grooms for the brides. However, it was normal for a traditional society with are practically overall the marital of population.

So already the first results of the Tambov's and Barnaul's project have particularly and obviously shown efficiency of resettlement to Altai region as to a way of overcoming of system crisis of an agrarian society of overpopulation provinces of the European Russia. About hundred families of the Tambov peasants could have more prosperous household than the staying peasants in native territories, than at the staying in native territories and also to solve the demographic problems.


Inez Egerbladh / Sören Edvinsson

The Swedish Parish Statistics "Tabellverket" 1749-1859. Advantages and disadvantages for research of computerized long-term statistics.

Tabellverket is the designation of the earliest systematized population censuses in Sweden for the period 1749-1859. It consists of two statistical serials focussing different demographical aspects, the size and composition of the population with certain time intervals - mostly five years - in Population forms respectively demographical events yearly in Mortality forms. For the first time all this unique statistics is accessible at one place by the Demographic Data Base.

This source enables studies at the lowest administrative level (the parish) on population development and social structure, fertility, illegitimacy, mortality, nuptiality and migration, all from several aspects. Digitisation of this source strengthens the research potential. It allows for creation of new variables in order both to detect miscalculations by the clergyman and to facilitate comparisons within and between different form editions of statistics. It enables validation of data, substitution and estimation of missing data.

The general problems in analysing long term statistics refer to changes in adminstrative areas over time, loss of data certain years or time periods, damages in the sources as well as changes in the character of information about events and the population. Digitisation involve additional complications as for interpretation and analysis of data, especially when made with the DDB demands for a literal reproduction. For instance, the changing structure and meaning of information in the sources (e.g. occupations and causes of deaths) complicate the database structure and coding. Usage of fixed prepared inregistration formats adapted to the tables in the sources restricted the possibilities to consider all kinds of additional or deviate information by the clergyman.


Nanna Floor Claussen

Widowhood in Denmark 1801
An analysis of the living conditions of a specific social group - based on the complete 1801 census.

The complete 1801 census database presents new demographic research possibilities. This paper demonstrates the use of this large database in working with a complete social group. The use of databases and computers is essential for analysing the data. Especially when working with specific social groups the availability of a complete digitised census is essential.

The paper analyses the living conditions of widows and widowers. This social group is easily identified uniquely in the census records and consists of persons that often had difficult living conditions. The main areas covered will be analyses of the size of the households, and whether widowhood meant that a person was living alone or within a family. The investigation will also include an analysis of the situation in different regions of Denmark: Copenhagen, the cities and the rural areas. Furthermore, it will be studied if and how differences exist between the two sexes. It is only possible to make this analysis due to enrichment made on the computerised data, as this information is not in the original nominal lists. The paper will analyse the age of the widowed persons and try to establish whether and how age influences the size of the households.

During the last year great effort has been done to add HISCO codes to the 1801 census and this work will be used to analyse the occupations for this group.

It is the intention that this paper will demonstrate how it is possible to make both a detailed description and to make general statistics in order to describe the living conditions of a selected group. A great deal of these analyses is only possible due to the enrichment of the material and the completeness of the data.

The paper will also discuss the methods used for producing the statistics and analyses.


Zlobine Evgueni

Moscow secret police department (Okhranka) undercover agents records - the problems of database creation.

The Department for the Defence of Public Security and Order in Moscow (the Moscow Okhrana or Okhranka) was the most important body of political investigation of Russia at the end of XIXth - the beginning of XXth century. It was created on November, 1, 1880 by the order of Minister of Internal Affairs count M.T.Loris-Melikov. The Okhrana submitted to the Department of police. The direct control of the Okhrana was carried out by the governor of Moscow. The department search activity exceeded the limits of Moscow and province. The Okhrana carried out a role of all-Russian center of political detection. The structure of the Okhrana included five departments. The Special office department of the Okhrana in Moscow was established in 1902. It supervised (was responsible) the organization of external supervision (correspondence on external supervision, development of addresses of persons behind which it was necessary to establish supervision, daily data on external supervision and drawing up of reports and circuits of external supervision). The Secret-service department was founded in 1910. The Department managed of the an internal agents: development of the data received from confidential collaborates. The documents of fund contain data about politicians, public figures, scientists of the end XIXth - the beginning of XXth centuries (writers, journalists, lawyers: publishers, editors of newspapers and magazines, etc.), members of political parties (Russian Social-Democratic Labour Party, parties of Socialists - revolutionaries, Cadets, Octyabrists, Anarchists, Labours, etc.), national parties ("Bund", The Socialist Party of Poland, etc.), participants of mass actions, political demonstrations, strikes, the peasants' revolts, the armed actions against imperial autocracy, political exiled and convicted, etc.

Documents (more than 30000 cards) include all first and seconds names, a rank, a home address, a revolutionary nickname, a nickname of external supervision by undercover agents, an occupation, the name of the organization (party) to which the suspect belonged to. On cards there are photos (not everywhere), the list of measures (searches, arrests), a nickname of the agent, delivered data on the person, the contents of the report.

The main mission of our project is to create full collection of the documents images and database on one DVD disk probably. The given project is the joint one sold by archive and historical Faculty of Moscow State University. Together with workers of archive in the project students of historical faculty who work with archival documents during passage of educational archival practice are borrowed. Such practice is a component of educational process. As taking out of the true archival cards of fund from archive is impossible, students entered materials into a database, using the scanned images of the cards which have been written down on CD disks. According to results of input of the first file of cards (more then 10 000), pilot research of database had been led.

This project was supported by Soros Open Society Institute (Moscow).


Svetlana Mintz

Cancelled !

The Place and Function of University Course in Quantitative Methods for Historians

This paper put an actual topic for Russian universities as they are ready to take part in the Bologna reform.

The paper compares two periods in teaching University course in Quantitative methods for historians in Russia (1969-1988 and 1989-2003).

It describes a paradoxical result of intensive training of historians in statistics and computer science. Historians became familiar with the computer and the internet, but ten years of crash courses has been to return many historians to the "pre-information era". Many historians reverted to conceptualizing events in the manner of the eighteenth and nineteenth centuries. They began to utilize numbers as a metaphor. A paradoxical situation occurred - the widespread mastery of the results of advanced information technology caused scholarship to regress and deprived it of basic methodological achievements.

Main thesis of this paper is that in the history classroom, the mathematician or specialist in information science becomes the bearer of a historical-philosophic paradigm. This aids future historians to inscribe knowledge of history, and, in a broader sense, of the humanities and social sciences, in an existing hierarchy of scholarly knowledge.

The paper makes several assumptions about the role and place of courses of quantitative methods and computer science in contemporary humanities and social science higher education.


Ivan Sintchouk  

The onomastic cross-sectional study of "new belorussians": the modern databases in the special historical research.

The purpose of this work is the reception of the quantitative characteristics for an estimation of economic activity of the different ethnoses representatives and ethnic groups living in Belarus during 1993-1997 years. As there are no statistical data on this period the method of the reception of the required information to indirect attributes, which contains in surname's endings, was used. As a source for creation of databases focused on the decision of research tasks, the modern databases intended for needs of the machinery of management and information and publicity services (for example, lists of the telephone subscribers) were used.

The common number of Belarussian surnames in graphic variants is roughly close to 100 thousand and is reduced to 20-30 thousand phonetically close variants.

The complication of the analysis of surnames consists in their last graphic mobility, that has resulted in variety of the close forms, which are indistinguishable during the use of standard means of sorting and indexing.

In the given research the revealing, except for direct comparison of surnames, conformity and discrepancy of a share of surnames of elite and usual for region of a share of surnames of the population was most important. The applied toolkit well fixes within republic migrations of the Soviet period, first of all among the persons with high education, to which "new Byelorussians" mainly concern.

As far as Minsk, the capital of Belarus, grew up at the account of newcomers from different regions the distribution of formants for Minsk is close to the average for the country which is vivid from comparison of the telephone subscribers of Minsk and private businessmen of republic.

The share of participation of Jewish ethnos representatives in economic life is found out in some times less expected. Small share of Jews can have 2 main reasons: traditional from the Soviet Union times disguising of nationality by changing surnames or uneven migration of Jews according to age structure (young and middle-aged educated people emigrated first of all). Such a disproportion makes us pay more attention to the identification of social and economic activity of Belarussian Jews. There is ground to believe that traditional activity of Jews in this sphere has changed and social stereotype of a Jew does not correspond to the reality any longer.

It is rather essentially the small increase of a share of ethnic Russian in the sphere of economic activity of the country in comparison with their share among the population of republic, as the general share of Russian people in Belarussian population is rather significant. The phenomena of disproportionate representation in "postperestroika" business ethnic Russian is not an exclusive feature of Belarus - for example, the similar processes are marked in the neighbouring countries: Lithuania, Latvia, Estonia.

On the whole it is possible to speak about the smaller economic activity of the representatives of the title ethnos in comparison with the representatives of others ethnoses living in Belarus.


Andy Carver and Eve A Anderson

Data Integration and Modeling in the Social History Domain: An Open Source Project

Data exchange among genealogical applications and/or web-accessible data repositories involves expectations about the sorts of data that will be exchanged and what the data means. To be more precise: data exchange deals not with mere un-interpreted data (text strings, bit strings etc.) but information, propositions that are known, supposed, or hypothesized (by one or more persons) to be true. Primarily then, we must define the types of assertions that are to be stored, along with the relevant business rules (constraints and derivation rules) that limit the allowed fact populations and possible transitions between database states. Such expectations, collectively, define a data model. The structure of such a model is a conceptual schema.

In the analysis phase of a genealogy system development, a conceptual schema may be used to describe the information structure of the application domain in a way that is easily understood and validated by the domain expert. Once validated, the conceptual schema can then be mapped to logical/physical/external schemas using automated and/or manual processes. Neither XML DTDs nor XML schemas afford an easily understandable and semantically adequate way to model an application domain, as they lack mechanisms for representing several common constraints and relationships (e.g., exclusion constraints or many-to-many relationships). While the high level data modeling may be performed using attribute-based approaches such as Entity Relationship (ER) modeling [2] or Unified Modeling Language (UML) class diagrams [11], there are many benefits to be gained by using an attribute-free approach to conceptual information modeling, as found in the Object-Role Modeling (ORM) [6, 7] and Object-oriented Systems Model (OSM) [3, 4] approaches, which offer greater semantic stability, as well as graphical and textual languages for capturing a wider range of business rules. These approaches may also be fruitfully used to reengineer databases.

As part of a new project at Northface University to create an open-source genealogy system for pedagogic use, the ORM methodology was used to reverse engineer some well known genealogical schemas into ORM schemas. This process exposed various problems with these schemas. After briefly reviewing these genealogical models, we discuss a number of their problems, and identify some relevant design trade-offs. We then propose a possible solution, using ORM to visually specify business rules that typically cannot be displayed using attribute-based approaches.


Andrey Andreev and Leonid Borodkin

Nonlinear Modelling of Historical Social Conflicts: Self-Organization and Chaotic Behavior

As it was said in the discussion on applicability of chaos theory in historical research, "accurate dynamical models of history, were we able to produce them, would be nonlinear". In this paper we discuss nonlinear nature of historical social conflicts dynamics. We propose nonlinear model of strikes dynamics which includes three differential equations. Three variables of the model are: level of strikers activism, level of agitation and level of authorities pressure. Workers self-organization plays an important role in nonlinear effects of model behavior.

Computerized research of the model dynamics reveals chaotic behavior in some areas of parameters. The paper contains classification of chaotic regimes generated by the model.


Leonid Borodkin and Anna Konovalova

Chaotic Dynamics of Share Prices at St. Petersburg Stock Market in the First Decade of the 20 th century: Metal and Oil

Analysing historical time series one should realise that sometimes unexpeted behavior of the system under consideration could happen without any substantial external cause, so the metaphor "Big-Big" ("big causes generate big changes") is not valid generally speaking. Chaos theory gives an appropriate tool to study unstable social processes which can be characterised by sensitive dependence on initial conditions. With the development of specialised software the "new" sciences of chaos and complexity have arisen largely.

Stock market dynamics is one of "classical" examples of unstable behavior. The standard question is: what accounts for stock market booms and busts? Analysis of share price fluctuations is a complicated task - with too many contradictory factors (both internal and external) which must be taken into consideration. In our paper we analyse the factors influenced on fluctuating share prices of large Russian joint-stock companies, the securities of which were quoted in the St. Petersburg stock market between 1900 and 1909 (during the "decade of industrial stagnation" in Russia). Out of the two-hundred companies whose share prices were quoted at the beginning of the period, we have selected six of the most well-known joint-stock Russian companies: three of them are machine-building companies and three others are oil companies. The primary source for our research was a newspaper Birzhevyie Vedomosti ("The Stock Market News") of St. Petersburg, information from which was used to construct six time series - daily values of share prices of the six companies within the decade. As a result we have about two thousand points for each time series so they are rather long to be analysed by chaos theory methods to detect chaotic behavior.

The main aim of our research is to define the relative role of "internal" factors of the market dynamics, those related to the interrelationships of market players. The copies of Birzhevyie Vedomosti (among other periodicals from the beginning of the 20th century) contain quite a lot of descriptions of the stock market events and their interpretations which give an evidence of the noticeable influence of external factors on the stock market behavior and share prices changes (reports from foreign stock markets, political and economic news, etc.) However, we cannot avoid noticing the certain "insensitivity" of share prices as they relate to external events.

To analyse the dynamics of the six time series we used special software "Chaos Data Analyzer: The Professional Version". It should be noted that we detected chaotic behavior in all time series, however it was expressed more clearly for three machine-building companies. The results permit us to generate the hypothesis concerning the greater role of "internal" (homogeneous) factors of the functioning of the St. Petersburg stock market in the long-run consideration and substantial role of external (heterogeneous) factors in short run at the beginning of the 20th century.


Lars Nygaard

Methods and tools for testing, preserving and utilising electronic archives

Since the start of the era of preserving electronic archives (about 1980) the National Archives of Norway have received and preserved more databases from governmental information systems than "real" records from registry systems with electronic documents. According to the Norwegian tradition in this field information value is regarded as an equally important criterion for preserving records than evidential value. These information systems typically contain data about the population, real property, buildings, cars, ships etc. Fore more than 30 years we have also digitised (by transcription) paper based material of this type (e.g. census material and parish registers) to make it more available and utilisable to researchers and to the public, for the recent years through "The Digital Archive" on the Internet.

The information systems or their databases are not preserved in their original form. Instead, the data (complete or subsets) are exported from the database to hardware and software independent text files, which are stored for the future. Through the years it will probably be necessary to convert the data files to new formats (character sets etc.). This is called the migration strategy. In addition to the data files themselves, different kinds of metadata about the original system and the extracted data are collected and stored. Metadata about the content and the context of the system and its data will be kept in our archival description system Asta. Metadata about the structure of the data (often called technical metadata) have to be very detailed and standardised for the data to be effectively re-used by the computer. To achieve that the National Archives have developed an XML based "language" called ADDMML for describing the outer and inner structure of the data files. The structure is described on four levels; data set, data file, data record type and data item (field) type.

Arkadukt is an interactive program to be used by the records creators to describe the archival extracts of their databases in ADDMML before transfer to an archival depository. Arkadukt is based on Microsoft Access.

Arkade is a large system to be used by archival institutions to "prepare" received data files for long term storage, i.e. by analysing further the structure and the contents (e.g. the range of values in particular data fields) of the files, verifying the metadata in the accompanying ADDMML file, or converting the data files to another text file format (e.g. XML). Arkade will also be used to describe and manage archival material digitised by transcription. Arkade is developed on top of The SAS System.

Arkade also has a module which creates and fills a relational database in SAS with the data from the read text files. One of our plans is to let researchers and other skilled users (mastering SQL or the SAS language) access and utilise such archival data sets by means of SAS (e.g. for statistical analysis). Another plan is to load open electronic archives into MySQL databases and offer the users (even the public) access to them on the Internet through a general web interface.


Eila Williamson

Digitisation of Archive Material

The outcomes of projects which have digitised archive material are increasingly available over the WWW to the history researcher. This paper will survey what Scottish archival and library material has been digitised, and is available over the WWW, and will explore the subsequent implications which this has for research. Topics to be discussed will include: reasons why digitisation has taken place; how these projects have been funded; ease of access for users; whether or not there is any commercial gain for the archive or library; and what users are being catered for. Particular consideration will be given to the needs of different user groups, such as family historians and academics, and the possible dangers that prioritising one user group over others may have. Do projects which have digitised material with one particular category of researcher in mind actually inhibit research for another category of user?


Ole Martin Sørumgård and Arnfinn Kjelland

Putting a main part of Norwegian Local History - the old farm- and genealogical history genre - into the Computer

Norway has an old but still flourishing genre of farm- and genea-logical histories. Such histories describe the history of all farms and other dwellings in the selected local community, combined with genealogical data of past and present residents. The information is usually presented in a narrative form with dwellings ordered geographically, residents arranged in «families» or other groups and listed below dwellings they belonged to. Further, each individual has cross references to other places where he or she has stayed, place of birth etc. Because of this it is possible to trace individuals as they moved from place to place, map the pedigree of a selected person and so on. A large number of such books are still published each year in Norway. The area of a project is usually a parish, and the result is usually several large volumes.

Traditionally, manipulation and analysis of the huge amounts of historical data needed as background material are based on more or less manual, highly time consuming methods. In recent years, though, there have been some improvements, especially through faster and cheaper computers available. But each historian has still to rely heavily on his or her own skills and ingenuity in developing the appropriate tools.

It should be noted that historians working in this genre are forced to use somewhat different methods in their analysis, compared to both historical demographers and traditional genealogists. The demographers' approach yields valid results on the statistical level, but not necessarily about individuals in the material which historians and genealogists have to survey. On the other hand, genealogists often map specific families, leaving out the considerable number of individuals with no links to any known family, or no descendants. Clearly, historians in the «farm and genealogy» field have a point of view somewhere between the two others, and must in addition cover the general history of the farms and other dwellings in the researched area.

It should also be noted that there are not two such projects that are identical; they vary considerably as far as the settlement structure, size of population, amount and quality of source material etc. of the investigated areas (parishes) are concerned. Also the willingness of the project organisers (usually the municipality) to fund accurate collection, treatment and analysis of sources and the qualifications of the reseacher (often a local historian without formal training in history) can vary to a great deal. The results are books of very variable quality. However, good books in this genre can generate considerable income and contribute to the funding of the project. Such projects - at least 114 are ongoing in 2002 - were funded with a minumum of NOK 21,5 mill. (EURO 2,7 mill.) in Norway this year.

Despite several commercial software packages available, serving both demographers (statistical analysis tools) and genealogists (family record programs), none of them quite fullfills the needs of historians in the field. This is primarily due to their lack of capacity to handle farms and other dwellings, and the fact that they rarely can produce references between persons and places.

The purpose of this project is to develop a useful tool for historians working in this field, a tool to assist researchers in all stages of work, including final presentation of the achieved result as a book manuscript. The title of the system is «Busetnadssoge» («Settlement history»), abbreviated BSS, or BSS-application (under Windows). This application will be able to handle all such projects, regardless of settlement structure, sources etc. in the area investigated; the only limit is population size. BSS is now tested in a bygdebok project of Volda, with a present population of 8.500 (4.556 in 1900, 3.939 in 1801).

Some documentation in Norwegian is found at http://tilsett.hivolda.no/ak/BSS/Busetnadssoge.html


Aada Must

"The Estonians'History in the Databases of the Russian Memorial: an Analysis of the Results of Linking the Databases

At the 1998 International Conference of AHC, I introduced the databases I had created about the Estonians’ surnames and their ancient homes (by 1835 there were 41,000 different surnames, among them 29,000 unique ones that only occurred in one concrete parish), the shift of surnames (in the years 1935-1940 about 20% of the Estonians’ foreign-sounding surnames were Estonianized) as well as the preliminary results of the use of the database, gained in the research on internal-Estonian migration.

In the years 2001-2003, we have applied the mentioned database and new search engines to the research on the history of the Estonians living in Russia (in 1918 c. 130,000 persons). Luckily, I found an important source of individual biographical data of the communist regime victims (incl. their place of birth and that of residence when they were arrested, etc.) in the databases of the Russian Memorial, published on the CD ROM (partly supplemented on the Internet).

The Memorial’s databases, written in Russian (in the Slavic alphabet) cannot directly be linked with the database of the Estonians’ surnames in the Roman type, there are curious examples in the practice of transliteration of place names, cf. “St Isaac’s Parish” = “439 kg”). Based on an analysis of practical data we have created a respective search engine. We had to eliminate German-origin surnames from our research (denoting mainly Germans, seldom Estonians, in the Memorial’s database). This elimination was supported by the search for specific Estonian first names (Endel, Virve, etc.). However, in most cases the Estonian first names became Russianized in Russian-language documents (cf. Andres ?Andrei, Tõnis ? Anton, the end of the typical Russian marker of gender “a” of Russian female first names was added to Estonian names, etc.).

The internal source-criticism of the Memorial’s databases revealed essential errors. E.g., 100% of entries in the database of the victims of the Pskov Oblast contained gross mistakes (thus, 250 Estonians had been, after their execution, for the second time arrested and shot; in processing the Excel Table only part of the fields had been selected). As is the common knowledge – a small confusion can be caused by a historian by himself, a chaos needs a computer.

Until 1991, the theme about the Estonians’ resettlement movement to Russia was only episodically-fragmentarily researched (mainly based on the reference book published in 1918 about Estonian villages in Russia). Thanks to our links, we found a number of Estonian colonies in Russia, not mentioned in the historiography to date. Novel data were found about the Estonians who had settled down in towns and cities (incl. those who at the beginning of the 20th century had gone to Russia to work as agronomists, veterinarians, station masters, cheese makers, engineers, mechanics, car drivers, teachers, etc.) and to make educational career (e.g., a great number of Estonians had studied in Siberia, at the University of Tomsk). We drew up a detailed map about the proceeding areas of emigration, constructed emigration timelines (the ancient home of the surname, birth place, various areas of settlement in Russia, a map of later more important centres of attraction). We gained massive biographical and genealogical information (e.g., about the people who at the beginning of the 20th century were not given permission to resettle to Siberia but finally they somehow reached their dreamland – we found their names on the list of those shot in the Krasnoyarsk district in 1937). Likewise, the search engines we created yielded important information about the remigration of the Estonians from Russia, however, it has been a suppressed topic (primarily for psychological reasons, to avoid being repelled, etc.) for decades.


Elwin Koster

Mapping and analyzing the urban landscape

In 1960 the British geographer M.R.G.Conzen published his book on the urban transformation of the townplan of Alnwick. In his introduction he announced two other volumes on the building types and the land use patterns of the same town. When he died in 2000 these volumes where still not published. In his personal archives a number of notebooks were found containing surveys of several towns, including two surveys of Alnwick, the first one was carried out in 1953, the second one in 1964. With the use of a Geographical Information System the data from these surveys are now mapped. The main question is if specific queries can be used to answer the questions that Conzen asked in order to write the two missing volumes of Alnwick. A related question is how the urban landscape developed over the last 50 years, now a third survey is made in 2003.


Peter Doorn (Netherlands Institute for Scientific Information Services - NIWI) and Kees Mandemakers (International Institute of Social Science History - IISH)

Life courses in context: a collaboratory based on Dutch population registers and censuses (19th and 20th century).

Summary

Two institutes of the Royal Netherlands Academy of Arts and Sciences (KNAW) have started a programme to develop a database with about 40.000 individual life courses of people born in the Netherlands the period of 1863-1922. This micro-data will be supplemented with aggregate (tabular) data from the Dutch censuses as they were taken between 1859 and 1947. In a few years time, all Dutch censuses between 1795 and 1971 will be available in electronic form.

Contrary to some other Western countries, census data on individuals for the period before 1960 do not exist anymore for the Netherlands. To bring research on comparable international stand-ards it is necessary to reconstruct these data. But, the construction of this database will not only eliminate the disadvantage of lacking data on the micro level, faced by the Netherlands compared with other countries. The HSN database, drawn from the Dutch population registers, will actually put the Netherlands in a unique position in the international research world. Because the popu-lation registers have recorded details of every change of residence for every individual since 1850, investigators can have access to research populations that are not limited to persons who stayed put in just one municipality.

The database of the Historical Sample of the Population of the Netherlands (HSN) will cover the entire country and contain micro-level data on the life courses of over 40,000 individuals born between 1863 and 1922. These life courses are to include data on each successive family situation in which the individuals lived, all the addresses where they lived, as well as data on the religi-on and occupational title of each subject and of every person with whom they co-resided (and, for married subjects, data on the occupational title and place of residence of family members of the subject’s spouse). In the social and historical sciences, the life-course perspective is increasingly important in explaining demographic and socio-structural processes. The HSN database will co-ver the entire period from 1863 to 2000 and can be regarded as a chronological expansion of the retrospective databases currently in use by the social sciences.

Individual life courses have to be analysed in a rapidly changing environment of an industrialising and modernising Dutch society. The Dutch national censuses form a fundamental source of information for conditions on the level of the municipality. In addition to the population size, popula-tion censuses contain information on the structural characteristics of the population, such as age, gender, marital status, religion, household status, occupational activity, and nationality. In some years the censuses were combined with an occupational census and a housing census. The project includes the digitisation of the population and occupational censuses of the Netherlands for the full period 1859 to 1947. The data will be linked and integrated with those from the Historical Database of Dutch Municipalities and will build on those censuses already compiled by the Netherlands Institute for Scientific Information Services (NIWI) and Statistics Netherlands (1795-1859, 1899, 1930, 1960 and 1971).

The expansion of the existing databases of HSN and NIWI into complete systems will be of ma-jor significance in the long term. In the short term it will provide a strong stimulus to academic research into social, demographic and economic developments over the past two centuries. This programme will result in a series of studies in historical de-mography, social and economic history, human geography, sociology and epidemiology. The work will be performed in stages so that the data will become available for research before the project is completed.

Another aspect of methodological innovation envisaged by this programme lies in the scope for connecting data from the micro and meso/macro levels. The population censuses and other muni-cipal data offer a context for the individual-level and family-level data. The combination of the different sources will create new opportunities for multi-level or cross-level analysis.

The results of the proposed project will be presented and communicated in the form of a collaboratory for the humanities. A collaboratory is an open meta-laboratory that spans multiple geographical areas with collaborators interacting via electro-nic means - "working together apart." The impact of these databases will however not be confi-ned to the research world. The databases will be accessible on the Internet to the general public. However, for privacy reasons files or data relating to individuals still living will only be accessi-ble using a technique known as remote execution.


Evan Roberts

Integrating international census data: the North Atlantic Population Project

Integrating international census data: the North Atlantic Population Project This paper discusses the North Atlantic Population Project (NAPP), which is harmonizing and distributing complete-count historical census microdata from Canada, Great Britain, Iceland, Norway and the United States for the late nineteenth century. NAPP is a collaborative effort, involving researchers from 7 institutions in five countries. The first preliminary release of harmonized data are scheduled to be released shortly before the IAHC meeting. This paper gives an overview of the data made available in the preliminary release, and outlines what the database will contain at the final release in summer 2005. Potential research applications of the databas


Michelle van den Berk and Peter Doorn

A data archiving disaster: The Dutch population census of 1960

The 13th Dutch population census of 1960 was to be the first 'digital' census in the country. Data from the questionnaires concerning about 11 million inhabitants were entered manually into computers, analyzed, published, and then stored on punch cards. That is the digital part.

However, the archiving of punch cards is a different matter altogether, which has far more to do with traditional than with digital archiving, and at this stage things started to go crucially down-hill. When storage space was needed for the storage of the cards of the 1971 census, the 1960 cards were acquired by the Steinmetz foundation (the forerunner of the social science data archive in The Netherlands) and transferred to the University of Amsterdam. Optimal storage locations were not available, nor was money to keep them in storage: boxes with cards got moist, punch cards crumbled, were torn, and stuck together.

When the punch cards were read into a computer again in the early 1970s, some more mistakes were added: many cards were read twice, many others were not read at all. Some 300,000 records are missing; some 250,000 records have been entered twice. In those days, the processing time required on the mainframe computer of the academic computer centre to investigate the full extent of the problems exceeded the computing budget of the whole faculty of the social sciences. The owner of the material, Statistics Netherlands, decided that the resulting dataset was so incomplete and corrupted that it had no sufficient value and could be discarded: fortunately, it was not.

A good 40 years after the census was held, the retrospective digitization of the Dutch censuses forms the framework for a project in which we are trying to reconstruct:

  • What exactly went wrong and when
  • What can be done to create a usable data set from the digital files with the help of previously published tables?
  • How can the data be made available for historical and social science research?
Needless to say, data entry from the original forms or cards is no option: they have all been destroyed a long time ago.


Rafal T. Prinke

Digitised sources and research strategies

Dynamically growing availibility of digitised historical sources and secondary literature presents important new challanges to the whole community of historians, ranging from ethical and economic issues related to the ease of access to electronic resources, to what can be called "research strategies". The focus of the intellectual process involved in planning and doing research work is slowly moving away from the traditional reading to computer-assisted searching, comparing and analysing digital information. As a results the researcher gains on speed and scope, but also loses on contextual and accidental aspects of her perception of the source. New emerging standards for electronic editions of sources require new technical abilities from the researcher - but also a different mental approach in which the visualisation of the research process must take into account the very nature of the application of strict algorithmic methods to the imprecise matter of history. Sharing the method in order to allow peer control of the results (which is the essence of scholarly approach) must include sharing the algorithmic tools and digitised data which is not always possible for various reasons. While "historical computing" is becoming widely recognized and applied by all historians, the major task for the former pioneers in the field is to make them realize not only the advantages of using new technologies (which is hardly needed anymore) but the challanges they have to face.


Patricia Kelly Hall

World War II Black Veteran Migration and the Distributive Impact of the G.I. Bill

This paper uses census microdata from the IPUMS database to examine the extent to which the government's investment in World War II veterans under the "G.I. Bill" was unevenly distributed among regions and types of places as a result of differential patterns of internal migration. The IPUMS database contains all the information collected by the census bureau for a 1 percent or larger sample of the entire U.S. population from 1850 through 2000. With the application of computerized statistical techniques, the power of this database can be harnessed to answer research questions not readily addressed with traditional historical evidence.

Internal migration in the United States, which had begun to increase during the Great Depression, rose rapidly during and after World War II. Earlier research has shown that veterans migrated at significantly higher rates than non-veterans and constituted a substantially higher proportion of every migrant group. White migrants showed higher absolute gains to migration than blacks with additional gains to veteran migrants. But black veterans showed a more dramatic rise in relative income over non-veterans than did whites. What has not been examined is where these veteran migrants came from and where they settled.

The differential distribution of veterans within migrant groups suggests that the direct economic benefits veterans were given with the G.I. Bill - as well as the indirect returns to investment in human capital - resulted in an uneven spatial distribution of these social welfare dollars. During a critical period of economic growth and diversification, some areas and types of places may have received a disproportionate share of the government's veteran benefit dollars


Anne Lif Lund Jacobsen

Using GIS and Multivariable statistic in regional economic analyses

Original situated within the field of marine environmental and economic history, this paper will investigate the potential of using multivariat statistic combined with GIS to outline the structure of a regional economy.

The object for research is the Limfjord fishery from 1890-1920. During my research the question was raised how the developments of new fishing techniques had an effect on the local investment pattern in fishing gear, and how these investments were distributed spatially. In order to do so a combined analysis of qualitative and geographical data found in the annual fishery statistics is made. Whereas there was no or little spatial correlations between the single sets of data, a multivariate analyses of several data sets shows more promising results.

The ultimate aim is to develop an integrated method of analysing historical statistic data with both multivariable statistic and GIS


Andrei Volodin

Statistics Multimedia Teaching

The paper presents first results of Multimedia Teaching Project, initiated by Prof. Joerg Baten (Tuebingen University), that aimed to prepare multimedia presentations teaching statistics, econometrics, and economic theory. My presentation is based on the teaching materials prepared in Laboratory for Historical Computing of Moscow Lomonosov State University.

Integral multimedia presentations with support of Microsoft PowerPoint and TechSmith Camtasia Studio technologies are elaborated/ Each presentation consists of several main parts: text explanations, images and graphs illustrations, audio-files which pointing out main assertations, and video-files showing how to use properly statistic programs such as StatSoft "Statistica" and SPSS. Such notions and methods of statistics as correlation and regression are treated, mostly "Statistica" computing experience including. All multimedia files produced in this project based on original Camtasia technologies and integration is executed in PowerPoint framework.

The prospective product of Statistics Multimedia Teaching Project is a number of CDs, proving full necessary information for lectures improving and distance training. This year our product will be used in the course of teaching applied statistics at the History Faculty of Moscow Lomonosov State University.


Alla Polevaya (co-authors L.Borodkin, I.Garskova, T.Valetov, A.Volodin)

Internet Resource on the History of Labor Relations in Russian Industry: 1880-1930 1)

Since the beginning of 2002 the Electronic Library of the Faculty of History (Moscow Lomonosov State University) includes a new subject-oriented section devoted to labor history, 1880-1930. This complex resource is named "The Evolution of Labor Relations in Russian Industry: From Pre-Revolutionary Industrialization to NEP" ( http://www.hist.msu.ru/Labs/HisLab/ ), it is constantly renewed. Nowadays the resource (short name: WHR - Working History Resource) has the following subsections:

  1. Basic legislative and law materials concerning labor relations;
  2. Books and articles on state regulation of labor relations;
  3. Texts of contemporaries dealing with the problems of labor relations, labor legislation and its realization;
  4. Bibliography;
  5. Texts written by the Project participants on the subject-matter (and by some historians-collaborators);
  6. Data bases;
  7. Digitized archive and other documents (images);
  8. Electronic texts;
  9. Visual materials (photographs, newspaper and journal illustrations, etc.);
  10. Links to the materials, placed on other web-sites.
By the beginning of 2003 the electronic resource WHR has become the most representative and imposing one in Internet regarding the quantity and quality of information about labor relations in Russian industry during the period under consideration. So, Section 5 contains a big collection of publications of the last decade on the problem of motivation of industrial labor during pre-revolutionary industrialization and NEP (New Economic Policy). In Section 4 is represented the unique bibliography on working history, composed of the materials from digests of the 1970s - 1980s and contemporary Russian and foreign historiography (about 2000 items). Section 8 is filling up with electronic texts of archive documents (from Moscow and regional archives) and newspaper materials of the 1920s. So, sources difficult of access are put into use. Section 3 contains a representative set of related electronic texts of articles, published in journals Vestnik Truda and Voprosy Truda in the 1920s - early 1930s., reflecting the discussions of that period on the issues of regulating labor relations in Soviet industry. In the same Section is represented the data base concerning 2577 articles published in those journals during the period under consideration (made by the journal's contents). In Section 1 and 2 we go on collecting materials on law regulation of labor relations and the role of the State in that process (especially the recent publications). Section 7 and Section 9 reflect another aspect of labor relations (anthropological); they contain materials of individuals' (photographs, diplomas, identity cards, etc.). Section 10 is very important, it includes links to other Internet resources on the Project's subject-matter. The emphasis is done on the document collection giving "visual images" of the epoch in the context of the issues of working history (photographs, posters, etc.); here are also collected almost all articles and other publications on working history of the period found in Internet.

As the sections of WHR are filled up with new electronic documents the necessity of providing the Resource with different services increased. Spade-work has been done to provide the resource with ftp-access to document collections.


1) The work is being done with the financial support of Russian Foundation for Humanities (N 02-01-12001v).


Wolfgang Meixner

Regional and social background of expatriated illegal members of the NSDAP in Austria between 1933 and 1938.

A central role in the research to NS-history takes the question about the social and regional composition of the NSDAP. This is particularly of importance, there the NSDAP changed in Austria 1932 from a very job-specific party, consisting of officials of traffic, commercial employees, officials and freelance academic graduates, to a mass party. With the prohibition of the party in Austria (1933), membership registers are missing in order to be able to reconstruct for instance social recruiting of the party comrades. Newer research showed the fact that the investigations accomplished by Gerhard Botz for the social structure of the NSDAP only for the time up to the party prohibition are plausible but for the time of "illegality" cannot convince. Its attempt, on the basis after the "Anschluss" honour for the sake of lent membership numbers the social and regional composition of these "alten Kämpfer" ("old fighters") determine to be able, today one criticizes and is not longer practicable. Arises however the question, which documents us then for the order, around the social structure of the "NS-Gesinnungsgemeinschaft" to examine for the time between 1933 and 1938? Such a source could be the more than 10.000 of persons comprehensive "Ausbuergerungsverzeichnis", in that all expatriated persons with birth, date of birth and residence as well as occupation data gone merrily since the party prohibition in the sense of the prohibitory law of the nationality are registered. The straight occupation data found there place the research however again before problems, in particular with the setting-up of horizontal and vertical stratum models. In my contribution I will bring up these problems for discussion and will try to present proposals for solution.


Irina Garskova

MA Programs in Historical Computing (the Experience of Russian and CIS Universities): Application of Advanced Research Projects

Historical computing at Russian and CIS universities (as well as at Western universities) is dramatically changing under the impact of new information and communication technologies. Professional historians more and more often deal with large scale innovations at libraries, museums, archives, publishing and multimedia houses and other repositories of the national cultural resources. Advanced skills in computing are extremely needed to enable students to get a good jobs in the information society. New professional disciplines emerge as a response to the challenges of information technology. Humanities departments tend to innovate the learning content and the learning methods.

Some CIS universities have already recognized the need to establish and develop historical computing as a teaching subject, to include MA in historical computing in the general curriculum. Moreover, historical computing MA programs have recently been opened at the Belorussian State University, Altai State University, Stavropol State University, and some other CIS universities.

The established MA in historical computing curriculum include both traditional approaches on one hand, and the knowledge and skills of advanced computing methods and technologies, on the other hand, as two important components of professional education. The second component embraces:

  1. general theoretical and methological issues on advanced computing in the humanities, adopted mathematical cources (mathematical logic, theory of probability and mathematical statistics, etc.);
  2. quantitative methods in the historical research, computer simulation as a tool in historical research;
  3. basic and advanced courses in formal methods and advanced computational processing, database management, visual processing and visual analysis, multimedia applications, text-based and multimedia-based computing.
Historians in the information society become more experienced in using historical sources in electronic form. For this reason the forming a teaching repository of electronic resources is very important, especially taking into account that a lot of electronic resources are available today on the web. Many of these resources came from research results and can be secondary used for educational purposes. Forming the electronic library of readers and links to existing Internet teaching resiurces is also very important for teaching historical informatics disciplines.


Galina Mozhaeva

Humanitarian informatics: historical aspect and problems

Peculiarity of contemporary society's development is conversion to its new formation - to informational society. The category of informational society is topical question now at the context of the humanities, because it integrates all sphere of humanitarian knowledge in itself. That is why informational society is object of philosophic, philological, sociological and historical investigations. The process of informatization of society, connecting with penetration informational technologies to the humanitarian investigation's tools, have lead to of principle modification of humanitarian knowledge's structure, when information has become primary category. This process of transformation of informatization as reflection on technological progress in the humanities to the primary mechanism of structure humanitarian knowledge's reorganization and change of interpersonal relations is happening during several decades.

Historical researches, which are forming the base for philosophic comprehension of contemporary society, are taking on special actuality in understanding of informatization`s process.

In the informatization`s process of social humanitarian sciences are marked out three stages of informatization.

First stage - technical (1960 - 70-th years). It is characterized by using IBM in applied investigations, making instrumental facilities.

Second stage - technological (1980 - 90-th years). Using of informational technologies in social humanitarian investigations, development of new research methods, which making on the base of informational technologies, have begin at this stage. Technological stage gives the start of development of «branch informatics» - historical informatics, economic informatics and others. Now we understand that the development of any «branch informatics» have faults to the side of applied researches, basing into practical using of informational technologies, to the side of preference logic of facilities rather than logic of matter of humanitarian investigations. At the beginning of informatization in humanitarian as the development and the introduction instrumental facilities it was necessarily and helped to develop humanitarian sciences.

Contemporary informational surroundings is making new aspirations to the system of humanitarian knowledge. The process of informatization is deeply wounding the sphere of humanitarian education: its methods, technologies and governing apparatus and sphere of basic researches, where informational technologies are using not only in the stage of elaboration, keeping and analysis of codification of traditional material and become the object of humanitarian research.

Informatization of the humanities have discover deep social changes which produce by process of informatization. The start of third stage - methodological - connect with it at the 2000-th. At this stage recomprehension of the understanding of informatics, change of scientific views on informatics as science have place. Clarification of mechanisms of informatization`s influence on the `s development, on forming of new stage of culture, construction of theory of informational society is starting. It is clear that at the field of contemporary information science vision have to be appropriateness of beginnings and functioning all sorts of information, appropriateness and consequences of informational processes in the society.

Study of informational processes, sources and channels of information will allow to understand reasons and disposition of social behavior, social interactions. Investigation of humanitarian problems of informatics is playing to leading outlook part, which manifests itself in formation of whole system informational world's picture, in understanding of community of the informational processes of operation in animate nature, society and technology.

The necessity and importance of such researches is clearly defined by number of factors.

In the first place, information has become determining category in the economic and social development at the modern society. That is why informatization (as the main mechanism of the conversion to informational society) is becoming a subject of the researches of very wide sections of scientists now.

Secondly, informatization of society is driving to modification of social connections and relations between people. Creation and development of the computer communication calls modification of communication processes in the society and character of communication interactions. Mass communications exchange system of social culture perception of the information, allow to manipulate by people consciousness.

Thirdly, introduction of person computers and development of the informational and communication technologies provide the impact on the person development, on his world outlook change, on the system of personal valuables. The man is plunging into virtual and real computer worlds more and more, and he is confronting with necessity of change of life style, of way of living and thinking, of the character of rapport with surrounding world.

The further development of informatization demands not only computer literacy, but certain lever of the informational culture, which is founded on the understanding of the law-governed natures of informational society's development. The understanding of the gist of the informatization and informatics must change in the first place. Development of the information, expansion of informational canals, deepening of connects between these canals, intensification of there influence to the man and society, but not only inculcation of informational technologies in different spheres of the society's life are understood as informatization now. That is why the social aspect of informatics, determining its as the humanitarian science ore humanitarian informatics, arises.

Humanitarian informatics is the science, which investigates the law-governed natures of the information's beginning and development in society, the law-governed natures and results of the informational processes in society, the philosophy and methodology of the informational society, the informatization as the social phenomenon.

The sufficient scientific material already has been made, and the informational society's theory has been developed. This theory gives the possibility to speak about humanitarian informatics not only as scientific direction, but as knowledge's system about informational society, which may be the base for the training of specialists in this field.

At the same time, the situation has formed in the education system, when there aren't specialists in the field of humanitarian informatics. The thorough research into social sphere's informatization only begins in Russia. The concept of informational culture isn't studied in historical development and philosophical comprehension. First of all, this fact has an influence on education. The technical perfection in the field of informational technologies is losing its meaning if the appropriate pedagogical means are absent. These means may be made on the base of new methodological principles only from the field of the humanities.

The specialists in the humanities are necessary for decision of these problems. They will help to develop the informatics as the science, which determines the modern society's development at all relations. It's necessary to create the system of the train personnel - the specialists in the humanitarian informatics's field, and to organize the scientific research in the informatization's field by way of consolidation of the specialists in different humanitarian fields and in the inmormatics's field.


Marc St-Hilaire, CIEQ/Geography, Université Laval

Byron Moldofsky, Geography, University of Toronto

Mapping 20th Century census micro-data: The Canadian Century Research Infrastructure cartographic system

An interuniversity and multi-disciplinary team has recently undertaken the building of a set of census micro-databases in order to facilitate historical research on the Canadian society in the first half of the 20th century. One innovative aspect of the research infrastructure to be developed is the cartographic component associated with the micro-datasets, a mapping system intending to provide the researchers with tools which will allow them to partly take into account the spatial components of the phenomena they study. This paper aims to present the mapping system, from the principles that guide its construction to the product to be delivered and the technical frame of its utilisation. A special attention is paid to the concepts related to the geographical construction of the census, as to the methodology used to create the basic cartographic files underlying the whole system.



Norwegian Historical Data Centre (NHDC)
The Faculty of Social Sciences, University of Tromsø N-9037 Tromsø, NORWAY
Updated: March 16th 2004