Sergio Contador
September 2019
This application collects scientific sources from 60 repositories of spanish universities that uses the OAI-PMH communication protocol.
The application is divided into four parts or sections, two of them divided into three and two subsections each:
What is Open Access?
Free way to access scientific information.
Where to find the scientific information?
In the repositories.
What are the repositories?
Centralized sites where digitized information is stored and maintained.
The information is distributed through a computer network that communicates with different interoperability protocols.
One of the main protocols used is OAI-PMH.
OAI-PMH: Open Access Initiative - Protocol for Metadata Harvesting.
Dublin Core metadata communication protocol.
It uses multiple Data Providers that communicate with multiple Service Providers.
Data Providers store and maintain repositories.
Service Providers looks for Data Providers and use them for the creation of value-added services.
Service Providers makes a metadata request to the Data Providers.
In response, Data Providers sends a set of records in XML format.
Some of the spanish Data Providers based on Dublin Core metadata communication protocol are shown in a table with five variables the user can select:
VARIABLE | DESCRIPTION |
---|---|
name | name of data provider |
url | uniform resource locator of data provider |
url_oai | uniform resource locator for metadata access of data provider |
type | type of data provider |
spreader | data provider institution |
Information about spanish universities included in the application are shown in a table with eight variables the user can select:
VARIABLE | DESCRIPTION |
---|---|
university | name of the university |
acronym | acronym of the university |
type | type of university |
city | city where the university is |
fundation year | year of fundation of the university |
number of teachers | number of teachers of the university |
number of students | number of students of the university |
repository size | repository size of the university |
The application shows each university in Google Maps locating with its own logo.
The user can move through the map and see the exact position of the university.
The user can also access to the url of the university and the repository.
To access at the information store in the repositories, the application uses an advance search engine where the user can perform and advance search using a query filtering the information through three text boxes, and four zones with multiple boxes of selection.
The name of each text box and zones, and their description, are shown in the following table:
NAME | DESCRIPTION |
---|---|
title | title that appears in the source |
creator | author/authors who have written the source |
subject | keywords that appear in the source |
type | type of the source |
language | language of the source |
year | year of publication of the source |
university | university owner of the source |
The search engine allows the user to perform independent searches of the title, author/authors and subject of the source.
The search is done using boxes with text entries.
Natural language processing techniques are used to correct tildes, capitals and other incorrect entries of text.
The search engine distinguishes 30 different types of sources, 40 languages and 626 years. The search is done through boxes that the user can select.
The user can accept the selection made by pressing the accept selection button.
Next, the system returns the results founded based on the selection that the user has made.
Also, the user can reject the selection by pressing the delete selection button, and start a new search.
The results are displayed as text in 3 parts:
10 sources per page are displayed.
The user can scroll through the pages using the forward and backward buttons (previous and next), or perform a new search by pressing the new search button.
The results are organized taking into account their relevance, where the most relevant results appear in the first positions. Specifically, the date on which the sources have been published in the repository is taken into account.
Sources that are published most recent are displayed at the beginning, while the oldest are displayed at the end.
The user can select different variables of the universities for plotting in 3D pie chart:
The number of universities included in the plot can be selected.
The user can select different variables of the repositories included in the search engine for plotting in 3D pie chart:
The repository included in the plot and the numbers of categories of the variable selected can be chosen.
A network showing the relations between universities based on relations between creators of the different sources included in the repositories is plotted.
The user can select different number of relations and the vertex size included in the plot.
We recommend you try all these features here!!