Creating a Collection
Only PARADISEC administrators can create a collection in the catalog. When you are ready for us to create a collection, contact the CoEDL Data Manager firstname.lastname@example.org. You will need the following information:
- Collection ID (see below)
- A brief title of your collection
- A more detailed description of your collection
- Funding body (if applicable) (e.g. ARC, ELDP, etc.)
- Grant identifier (if applicable) (e.g. CoEDL = CE140100041 or Wellsprings = FL130100111)
You can edit the collection information after it has been created, adding details such as the originating university, country, language(s), etc. You also can set the map, define access, and assign editor privileges at the collection level.
Once you are added as a user, I can help you start a collection. First you need to create a name for your collection; it should be unique and include a project prefix if relevent (COE / WS) + your initials (all caps and no spaces) + a digit:
Example collection IDs:
General researcher : JM1
The digit is required and it distinguishes you from someone with the same initials (John Mansfield, Julia Miller, etc), but also allows for multiple collections, should you require, e.g. JM1, JM2... JM85.
What constitutes a collection?
A collection is a personal corpus you have created that contains recordings of a language/performance/group of people you have researched, recordings that address a particular research question, or includes all the recordings collected during the life of a researcher. Each individual depositor will have one or more collections, distinguished by the collection ID. Below are some examples of collections held by PARADISEC:
LB1 - Theatre performances recorded in different locations and years, but all within the Philippine province of Ilocos Sur.
GB1-GB50 - 50 collections, each containing fieldnotes of a different Australian language.
TH1-TH4 - Each collection based upon different field trips, but the same language.
NT collections - NT2 contains .pdf files of PhD thesis; NT8 contains dictionary files and texts of a language; NT4 is specifically for images, mainly plant identification photos.
SOCCOG - Collection for all participants of the Social Cognition project using the “Family Problems” picture task to collect data from different languages around the world.
PARADISEC catalog at the collection level
Come up with a brief descriptive title for your collection. Then provide a more detailed description of what is in your collection. Details could include discussion of elicitation tools used, particular research questions, or motivations for creating your corpus.
What is an item?
Items are the next level of organisation below the collection. An item once referred to a physical tape, such as a reel-to-reel or a cassette tape. Now that much of the new data will be collected already in digital format, with multi-media, concurrent recording devices, it may be better to refer to an item as a recording session or event.
The item names within a collection do not have to be meaningful; ALL important information about each item will be contained in the metadata.
Within a collection you can have multiple items. In this collection, there are 182 items. The collector has chosen to name them “001”, “002”, “003”, etc. Do not use more than 30 characters for an item name.
Each item can have multiple content files. Under item 014, there are 6 content files, differentiated by the third part of the file name (“A”, “A1”, and “A2”). Three of these files were archived by the collector (the .wav files) and three were generated by PARADISEC during the ingest process as online display versions (.mp3 files).