Greenstone tutorial exercise

Back to wiki
Back to index
Sample files:
Devised for Greenstone version: 2.60|3.06
Modified for Greenstone version: 2.87|3.10

Looking at a multimedia collection

  1. Copy the entire folder

    sample_files → beatles → advbeat_large

    (with all its contents) into your Greenstone collect folder. If you have installed Greenstone in the usual place, this is

    My Computer → Local Disk (C:) → Users → <Username> → Greenstone3 → web → sites → localsite → collect

    where <Username> is the username under which Greenstone is installed.

    Put advbeat_large in there. Then go into the advbeat_large folder and delete its index subfolder.

  1. Next, go into the advbeat_large folder's etc subfolder and rename the Greenstone 2 specific collect.cfg file there to collect.cfg.bak. Doing so will allow GLI to detect only the Greenstone 3 collection configuration file, collectionConfig.xml, enabling GLI to rebuild the collection based on that.

  1. Start up GLI and open the Advanced Beatles -- large collection. Switch to the Create panel and build the collection. Preview the result.
  1. Explore the Beatles collection. Note how the Browse button divides the material into seven different types. Within each category, the documents have appropriate icons. Some documents have an audio icon: when you click these you hear the music (assuming your computer is set up with appropriate player software). Others have an image thumbnail: when you click these you see the images.

  1. Look at the titles browser. Each title has a bookshelf that may include several related items. For example, Hey Jude has a MIDI file, lyrics, and a discography item.

  1. Observe the low quality of the metadata. For example, the five items under A Hard Day's Night (under "H" in the titles browser) have different variants as their titles. The collection would have been easier to organize had the metadata been cleaned up manually first, but that would be a big job. Only a tiny amount of metadata was added by hand—fewer than ten items. The original metadata was left untouched and Greenstone facilities were used to clean it up automatically. (You will find in Building a multimedia collection that this is possible but tricky.)

  1. In the file browser, take a look at the files that make up the collection, in the

    sample_files → beatles → advbeat_large → import

    folder. What a mess! There are around 450 files under seven top-level sub-folders. Organization is minimal, reflecting the different times and ways the files were gathered. For example, html_lyrics, discography and tablature_txt are excerpts of web sites, and images contains various images in JPEG format. For each type, drill down through the hierarchy and look at a sample document.

Copyright © 2005-2019 by the New Zealand Digital Library Project at the University of Waikato, New Zealand
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled “GNU Free Documentation License.”