Skip to content

Latest commit

 

History

History
132 lines (69 loc) · 8.33 KB

README.md

File metadata and controls

132 lines (69 loc) · 8.33 KB

EEXCESS

This is the Github repository of the EU-funded project EEXCESS.

Overview

EEXCESS develops open source software for augmenting web channels like web sites or social networks with cultural heritage and scholarly content. It consists of several sub-projects that combine into a privacy-preserving, personalized recommendation service for cultural heritage and scholarly content. It consists of the following components:

  1. A federated recommender system, that recommends resources from distributed cultural heritage and scholarly data providers. Recommendations are personalized while at the same time retaining the user's privacy. Besides the federated recommendation, we also provide tools for data mapping based on Semantic Web standards and for privacy-perservation (see the Privacy Proxy).

  2. A Partner Wizard which allows to integrate new partner systems using a simple web-form. A partner system is a content provider which is used by the federated recommender to generate recommendations. So without coding you can integrate most search APIs.

  3. Client-side augmentation and visualisation technologies to inject cultural heritage and scholarly content into habitually used user channels. This includes injecting recommendations into HTML pages via a Chrome Extension, into content management systems like Wordpress, into Google Apps for Works, into Learn Management Systems like Moodle and into social networks like Twitter.

  4. Several proof-of-concepts that take up additional usage routes for cultural heritage and scholarly content. For example, a Cultural Heritage Game with a Purpose (CGWAP), an Android App for recommending Europeana Content and an Angular JS based pargraph injection called Jarvis.

The project is currently under development, so not all functionality is realised yet. Below you can find more details on the architecture and furhter information on the source code managment/components. If you have any qeustion, idea or issue, please get in contact with us.

Data Sources

Currently the following data sources are intergrated into the Federated Recommender:

Partner System Objects
[Europeana] (http://www.europeana.eu/portal/) ca. 44 millionen cultural heritage objects
[ZBW's Econbiz] (https://www.econbiz.de/) ca. 10 millionen publications in the field of Economics
KIM - Museums in Baselland, Switzerland more than 35.000 objects from 16 museums
Mendeley ca. 85 millionen publications
Brockhaus Wissensserver
Kierling 5.600 images
Rijksmuseum 1 million objects
Wikipedia for Schools 6000 pages for children
Core - COnnecting REpositories more than 25 Million Objects
Digital Public Library of America millions (>10 Million) of photographs, manuscripts, books, sounds, moving images, and more from libraries, archives, and museums around the United States.

The integration of the following partner system is currently in progress

Partner System Objects
Carinthia Archive 150.000 Objects
Styrian Archive about 150.000 Objects
Flickr Bookarchive Images around 1 Million images
Canadian Discovery Portal 60 million pages
German National Library ca. 17.7 million publications

Architecture

Architecture of the EEXCESS Framework

Source Code Management

The EEXCESS source is organised in Git submodules. So every individual sub-module maintains its own Git repository and also provides a standalone solution. This EEXCESS main repositories brings all the parts together.

Note that when cloning the repo you need to init and update the submodule once:

git clone https://github.com/EEXCESS/eexcess.git
cd eexcess
git submodule init
git submodule update

Alternatively you can use '--recursive' as option

git clone --recursive https://github.com/EEXCESS/eexcess.git

Client-side Technologies

  1. The Wordpress Plugin for recommending cultural and scholarly content while editing a Wordpress blog. It is foreseen to deploy the Wordpress Plugin Directory.

  2. The Google Chrome Extension for recommending cultural and scholarly content while browsing the web. The Extension is also available through the Chrome Web Store.

  1. A Twitter Bot recommending scholarly content to Twitter users. The bot listens to certain keywords and send recommendations to users. If a user starts following the Twitter Bot, she gets recommendations regularly.

  2. A Google Docs Plugin/App that allows to query for cultural and scholarly resources when writing a Google Docs document. For example, found images can be reused in the document (if they have an apropriate license) or citation can be inserted.

Server-side Technologies

  1. The Federated Recommender queries distributed partner recommenders and merges their results. Partner recommenders are available for collections of Europeana, ZBW, Collections Trust (via Europeana), Wissenmedia's Brockhaus and KBL-AMBL's KIM Portal.
  1. The Privacy Proxy serves as privacy ensuring filter for request to the federated recommender. The developed protocol ensures that user id and query are separated. Further, queries can be obfuscated to ensure a high degree of privacy while retaining recommender accuracy.

Supporting Technologies

  1. Tools for Data Quality Assurance.
  2. Prototype for the Blog Crawler
  3. Prototype for the Blog Analyzer

Proof-of-Concept

  1. An Android App that recommends Europeana results based on your current location.

  2. A Cultural Game with a Purpose (CGWAP) which allows players to ask and answer cultural questions. Answers have to be given as links and the goal is to be as fast as possible in finding the answer on Europeana.

Visualisations and Widgets

  1. The FacetScape presents a visualisation widget usable for faceted browsing of cultural heritage resources. The visualisation can be used independently of the recommender framework.

Documentation

Detailed descriptions are given in the individual components. The overall documentation is available in the Wiki of this repository.

Branching model

All contibutions shoud follow the best practice branching model, as oultined here

Data Quality