OAIUNI

Sergio Contador
September 2019

Introduction

This application collects scientific sources from 60 repositories of spanish universities that uses the OAI-PMH communication protocol.

Sections

The application is divided into four parts or sections, two of them divided into three and two subsections each:

  • Open Access:
    • OA.
    • OAI-PMH.
    • Spanish Data Providers.
  • Universities:
    • Information.
    • Location.
  • Search.
  • Statistics.

Open Access: OA

What is Open Access?
Free way to access scientific information.

Where to find the scientific information?
In the repositories.

What are the repositories?
Centralized sites where digitized information is stored and maintained.

The information is distributed through a computer network that communicates with different interoperability protocols.

One of the main protocols used is OAI-PMH.

Open Access: OAI-PMH

OAI-PMH: Open Access Initiative - Protocol for Metadata Harvesting.

Dublin Core metadata communication protocol.

It uses multiple Data Providers that communicate with multiple Service Providers.

Data Providers store and maintain repositories.

Service Providers looks for Data Providers and use them for the creation of value-added services.

Service Providers makes a metadata request to the Data Providers.

In response, Data Providers sends a set of records in XML format.

Open Access: Spanish Data Providers

Some of the spanish Data Providers based on Dublin Core metadata communication protocol are shown in a table with five variables the user can select:

VARIABLE DESCRIPTION
name name of data provider
url uniform resource locator of data provider
url_oai uniform resource locator for metadata access of data provider
type type of data provider
spreader data provider institution

Universities: Information

Information about spanish universities included in the application are shown in a table with eight variables the user can select:

VARIABLE DESCRIPTION
university name of the university
acronym acronym of the university
type type of university
city city where the university is
fundation year year of fundation of the university
number of teachers number of teachers of the university
number of students number of students of the university
repository size repository size of the university

Universities: Location

The application shows each university in Google Maps locating with its own logo.
The user can move through the map and see the exact position of the university.
The user can also access to the url of the university and the repository.

Search

To access at the information store in the repositories, the application uses an advance search engine where the user can perform and advance search using a query filtering the information through three text boxes, and four zones with multiple boxes of selection.

Search

The name of each text box and zones, and their description, are shown in the following table:

NAME DESCRIPTION
title title that appears in the source
creator author/authors who have written the source
subject keywords that appear in the source
type type of the source
language language of the source
year year of publication of the source
university university owner of the source

Search: title, creator, subject

The search engine allows the user to perform independent searches of the title, author/authors and subject of the source.

The search is done using boxes with text entries.

Natural language processing techniques are used to correct tildes, capitals and other incorrect entries of text.

Search: type, language, year, university

The search engine distinguishes 30 different types of sources, 40 languages and 626 years. The search is done through boxes that the user can select.

The user can accept the selection made by pressing the accept selection button.

Next, the system returns the results founded based on the selection that the user has made.

Also, the user can reject the selection by pressing the delete selection button, and start a new search.

Search: results visualization

The results are displayed as text in 3 parts:

  • part blue : title and link of the source.
  • part green : description of the metadata associated with the source (type, language, year and university).
  • part black : summary of the source content.

10 sources per page are displayed.

The user can scroll through the pages using the forward and backward buttons (previous and next), or perform a new search by pressing the new search button.

Search: organization

The results are organized taking into account their relevance, where the most relevant results appear in the first positions. Specifically, the date on which the sources have been published in the repository is taken into account.

Sources that are published most recent are displayed at the beginning, while the oldest are displayed at the end.

Statistics: University

The user can select different variables of the universities for plotting in 3D pie chart:

  • number of teachers.
  • number of students.
  • size of the university.
  • size of the repository.
  • rwwu ranking.
  • alexa ranking.

The number of universities included in the plot can be selected.

Statistics: Repository

The user can select different variables of the repositories included in the search engine for plotting in 3D pie chart:

  • type.
  • language.
  • year.

The repository included in the plot and the numbers of categories of the variable selected can be chosen.

Statistics: Network

A network showing the relations between universities based on relations between creators of the different sources included in the repositories is plotted.

The user can select different number of relations and the vertex size included in the plot.

We recommend you try all these features here!!