Workshop II: Data sources and methods in bibliometrics

Krešimir Zauder (Institute for Social Research in Zagreb, HR)

 

Abstract: Bibliographic and citation metadata are a complex data specialization with own sources, formats and chalenges in data preparation and analysis. Given the textual and often semi-structured nature of this data, indicators automatically calculated by online databases are frequently biased or inaccurate. On the other hand, quality preparation and analysis of this data requires significant expertise.

The goal of the workshop is to familiarize the participants with the broad problematic of preparing and analyzing bibliographic and citation metadata for bibliometrics. The workshop will be organized into for parts of 45 minutes. The first part will explain the basic problems in scientometry (e.g. citation indexing and problems in calculating various indicators). The second part will be dedicated to collecting and migrating bibliographic and citation metadata. The last two parts will provide and introduction to algorithmic data handling with programming
language Python, which opens many possibilities essential for bibliometrics, but which are also useful in the broader context of handling bibliographic data. This part of the workshop does NOT presuppose prior programming experience, and Python programming language is frequently cited as a good language for learning programming.
However, given the scope of the problematic, the workshop does require the participants to be proficient in computer use.

 

In Croatian:

Izvori i obrada podataka u bibliometriji

Bibliografski i citatni metapodaci tvore kompleksnu podatkovnu cjelinu s vlastitim izvorima, formatima i izazovima u obradi podataka. Obzirom na tekstovnu i često polu-strukturiranu prirodu tih podataka, u indikatore kako se računaju kroz online baze se teško pouzdati. Kvalitetna lokalna obrada tih podataka, pak, iziskuje znatna specijalizirana računalna znanja.

Cilj radionice je polaznicima približiti problematiku preuzimanja i obrade bibliografskih i citatnih metapodataka u bibliometrijske svrhe. Predviđeno trajanje radionice je četiri školska sata. Od toga prvi školski sat posvetiti će se općenitoj problematici bibliometrije (e.g. prirodi citatnih indeksa te preciznosti izračuna indikatora poput broja citata i h-indeksa). Drugi sat će dati ostvrt na različite načine preuzimanja i razmjene bibliografskih i citatnih metapodataka. Obzirom na prirodu podataka, zadnja dva sata posvetiti će se algoritamskoj obradi podataka kroz programski jezik Python koji otvara mnoge mogućnosti nužne bibliometriji, ali koje su korisne i u širem kontekstu obrade bibliografskih metapodataka. Ovaj dio radionice NE pretpostavlja predznanje iz programiranja, a programski jezik Python je idealan i upravo za prvi susret s programiranje. Ipak, obzirom na problematiku, radionica zahtijeva dobre osnove rada na računalu.

Workshop II
Location: Old Campus Date: September 19, 2014 Time: 2:30 pm - 5:30 pm Kresimir-Zauder Krešimir Zauder