Chapter 5. DSpace System Documentation: Configuration

Table of Contents

5.1. General Configuration
5.1.1. Input Conventions
5.1.2. Update Reminder
5.2. The dspace.cfg Configuration Properties File
5.2.1. The dspace.cfg file
5.2.2. Main DSpace Configurations
5.2.3. DSpace Database Configuration
5.2.4. DSpace Email Settings
5.2.4.1. Wording of E-mail Messages
5.2.5. File Storage
5.2.6. SRB (Storage Resource Brokerage) File Storage
5.2.7. Handle Server Configuration
5.2.8. Stackable Authentication Method(s)
5.2.8.1. Authentication by Password
5.2.8.2. X.509 Certificate Authentication
5.2.8.3. Example of a Custom Authentication Method
5.2.8.4. Configuring IP Authentication
5.2.8.5. Configuring LDAP Authentication
5.2.9. Shibboleth Authentication Configuration Settings
5.2.10. Logging Configuration
5.2.11. Configuring Lucene Search Indexes
5.2.12. Proxy Settings
5.2.13. Configuring Media Filters
5.2.14. Configurable MODS Dissemination Crosswalk
5.2.15. XSLT-based Crosswalks
5.2.15.1. Testing XSLT Crosswalks
5.2.16. Configurable Qualified Dublin Core (QDC) dissemination crosswalk
5.2.17. Configuring Crosswalk Plugins
5.2.18. Configuring Packager Plugins
5.2.19. Event System Configuration
5.2.20. Checksum Checker Settings
5.2.21. Item Export and Download Settings
5.2.22. Subscription Emails
5.2.23. Settings for the Submission Process
5.2.24. Configuring Creative Commons License
5.2.25. WEB User Interface Configurations
5.2.26. Browse Index Configuration
5.2.26.1. Defining the Indexes.
5.2.26.2. Defining Sort Options
5.2.26.3. Browse Index Normalization Rule Configuration
5.2.26.4. Other Browse Options
5.2.27. Author (Multiple metadata value) Display
5.2.28. Links to Other Browse Contexts
5.2.29. Recent Submissions
5.2.30. Syndication Feed (RSS) Settings
5.2.31. Content Inline Disposition Threshold
5.2.32. Multi-file HTML Document/Site Settings
5.2.33. Sitemap Settings
5.2.34. Upload File Settings
5.2.35. Statistical Report Configuration Setting
5.2.36. JSP Web Interface (JSPUI) Settings
5.2.37. Configuring Multilingual Support
5.2.37.1. Setting the Default Language for the Application
5.2.37.2. Supporting More Than One Language
Changes in dspace.cfg
Related Files
5.2.38. Item Mapper
5.2.39. Display of Group Membership
5.2.40. SFX Server
5.2.41. Item Recommendation Setting
5.2.42. Controlled Vocabulary Settings
5.2.43. XMLUI Specific Configuration
5.2.44. OAI-PMH Configuration and Activation
5.2.44.1. OAI-PMH Configuration
5.2.44.2. Activating Additional OAI-PMH Crosswalks
DIDL
5.2.45. Delegation Administration
5.2.46. Batch Metadata Editing
5.2.47. Hiding Metadata
5.3. Optional or Advanced Configuration Settings
5.3.1. The Metadata Format and Bitstream Format Registries
5.3.1.1. Metadata Format Registries
5.3.1.2. Bitstream Format Registry
5.3.2. XPDF Filter
5.3.2.1. Installation
5.3.2.2. Install XPDF Tools
5.3.2.3. Installation within DSpace
Fetch and install jai_imageio JAR
Edit DSpace Configuration
Build and Install
5.3.3. Creating a new Media/Format Filter
5.3.3.1. Creating a simple Media Filter
5.3.3.2. Creating a Dynamic or "Self-Named" Format Filter
5.3.4. Configuration Files for Other Applications
5.3.5. Configuring Usage Instrumentation Plugins
5.3.5.1. The Passive Plugin
5.3.5.2. The Tab File Logger Plugin
5.3.5.3. The XML Logger Plugin
5.3.6. SWORD Configuration
5.3.7. OpenSearch Support
5.3.8. Embargo

There are a numbers of ways in which DSpace may be configured and/or customized. This chapter of the documentation will discuss the configuration of the software and will also reference customizations that may be performed in the chapter following.

For ease of use, the Configuration documentation is broken into several parts:

5.1. General Configuration

In the following sections you will learn about the different configuration files that you will need to edit so that you may make your DSpace installation work. Of the several configuration files which you will work with, it is the dspace.cfg file you need to learn to configure first and foremost.

In general, most of the configuration files, namely dspace.cfg and xmlui.xconf will provide a good source of information not only with configuration but also with customization (cf. Customization chapter)

5.1.1. Input Conventions

We will use the dspace.cfg as our example for input conventions used through out the system. It is a basic Java properties file, where lines are either comments, starting with a '#', blank lines, or property/value pairs of the form:

property.name = property value

Some property defaults are "commented out". That is, they have a "#" proceeding them, and the DSpace software ignores the config property. This may cause the feature not to be enabled, or, cause a default property to be used when the software is compiled and updated.

The property value may contain references to other configuration properties, in the form ${property.name}. This follows the ant convention of allowing references in property files. A property may not refer to itself. Examples:

property.name = word1 ${other.property.name} more words
property2.name = ${dspace.dir}/rest/of/path

[Tip]

Property values can include other, previously defined values, by enclosing the property name in ${...}. For example, if your dspace.cfg contains:

dspace.dir = /dspace
dspace.history = ${dspace.dir}/history

Then the value of dspace.history property is expanded to be /dspace/history. This method is especially useful for handling commonly used file paths.

5.1.2. Update Reminder

Things you should know about editing dspace.cfg files.

It is important to rememeber that there are two dspace.cfg files after an installation of DSpace.

  1. The "source" file that is found in [dspace-source]/dspace/config/dspace.cfg

  2. The "runtime" file that is found in [dspace]/config/dspace.cfg

The runtime file is supposed to be the copy of the source file, which is considered the master version. However, the DSpace server and command programs only look at the runtime configuration file, so when you are revising your configuration values, it is tempting to only edit the runtime file. DO NOT do this. Always make the same changes to the source version of dspace.cfg in addition to the runtime file. The two files should always be identical, since the source dspace.cfg will be the basis of your next upgrade.

To keep the two files in synchronization, you can edit your files in [dspace-source]/dspace/config/ and then you would run the following commands:

cd /[dspace-source]/dspace/target/dspace-<version>-build.dir
ant update_configs

This will copy the source dspace.cfg (along with other configuration files) into the runtime (/[dspace]/config) directory.

[Note]

You should remember that after editing your configuration file(s), and you are done and wish to implement the changes, you will need to:

  • To run ant -Dconfig=/[dspace]/config/dspace.cfg update if you are updating your dspace.cfg file and wish to see the changes appear. Follow the usual sequence with copying your webapps.

  • If you edit dspace.cfg in [dspace-source]/dspace/config/, you should then run 'ant init_configs' in the directory [dspace-source]/dspace/target/dspace-1.5.2-build.dir so that any changes you may have made are reflected in the configuration files of other applications, for example Apache. You may then need to restart those applications, depending on what you changed.

5.2. The dspace.cfg Configuration Properties File

The primary way of configuring DSpace is to edit the dspace.cfg. You will definitely have to do this before you can run DSpace properly. dspace.cfg contains basic information about a DSpace installation, including system path information, network host information, and other like items. To assist you in this endeavor, below is a place for you to write down some of the preliminary data so that you may facilitate faster configuration.
Server IP: ______________________________
Host Name (Server name) ______________________________
dspace.url ______________________________
Administrator's email: ______________________________
handle prefix: ______________________________
assetstore directory: ______________________________
SMTP server: ______________________________

5.2.1. The dspace.cfg file

Below is a brief "Properties" table for the dspace.cfg file and the documented details are referenced. Please refer to those sections for the complete details of the parameter

Table 5.1. The dspace.cfg Main Properties

Property Ref. Sect.
General Configurations
dspace.dir
dspace.url
dspace.baseUrl
dspace.oai.url
dspace.hostname
dspace.name
5.2.2
Database Configurations
db.name
db.url
db.driver
db.username
db.password
3.2.3 or 5.2.3
Advanced Database Configuration
db.schema
db.maxconnection
db.maxwait
db.maxidle
db.statementpool
db.poolname
5.2.3
Email Settings
mail.server
mail.server.username
mail.server.password
mail.server.port
mail.from.address
feedback.recipient
mail.admin
alert.recipient
registration.notify
mail.charset
mail.allowed.referrers
mail.extraproperties
mail.server.disabled
5.2.4
File Storage
assetstore.dir
[assetstore.dir.1
assetstore.dir.2
assetstore.incoming]
5.2.5
SRB File Storage
srb.hosts.1
srb.port.1
srb.mcatzone.1
srb.mdasdomainname.1
srb.defaultstorageresource.1
srb.username.1
srb.password.1
srb.homedirectory.1
srb.parentdir.1
5.2.6
Handle Configuration
handle.prefix
handle.dir
5.2.7
Authorization System Configuration
core.authorization.community-admin.create-subelement
core.authorization.community-admin.delete-subelement
core.authorization.community-admin.policies
core.authorization.community-admin.admin-group
core.authorization.community-admin.collection.policies
core.authorization.community-admin.collection.template-item
core.authorization.community-admin.collection.submitters
core.authorization.community-admin.collection.workflows
core.authorization.community-admin.collection.admin-group
core.authorization.community-admin.item.delete
core.authorization.community-admin.item.withdraw
core.authorization.community-admin.item.reinstatiate
core.authorization.community-admin.item.policies
core.authorization.community-admin.item.create-bitstream
core.authorization.community-admin.item.delete-bitstream
core.authorization.community-admin.item-admin.cc-license
core.authorization.collection-admin.policies
core.authorization.collection-admin.template-item
core.authorization.collection-admin.submitters
core.authorization.collection-admin.workflows
core.authorization.collection-admin.admin-group
core.authorization.collection-admin.item.delete
core.authorization.collection-admin.item.withdraw
core.authorization.collection-admin.item.reinstatiate
core.authorization.collection-admin.item.policies
core.authorization.collection-admin.item.create-bitstream
core.authorization.collection-admin.item.delete-bitstream
core.authorization.collection-admin.item-admin.cc-license
core.authorization.item-admin.policies
core.authorization.item-admin.create-bitstream
core.authorization.item-admin.delete-bitstream
core.authorization.item-admin.cc-license
5.2.45
Stackable Authentication Methods
plugin.sequence.org.dspace.authenticate.AuthenticationMethod
5.2.8 or 5.2.9
LDAP Authentication
ldap.enable
ldap.provider_url
ldap.id_field
ldap.object_context
ldap.search_context
ldap.email_field
ldap.surname_field
ldap.givenname_field
ldap.phone_field
webui.ldap.autoregister
ldap.login.specialgroup
5.2.8.5
Hierarchical LDAP Settings:
ldap.search_scope
ldap.search.user
ldap.netid_email_domain
5.2.8.5
Shibboleth Authentication Settings
authentication.shib.email-header
authentication.shib.firstname-header
authentication.shib.lastname-header
authentication.shib.email-use-tomcat-remote-user
authentication.shib.autoregister
authentication.shib.role-header
authentication.shib.default-roles
5.2.9-
Log Configuration
log.init.config 5.2.10
Lucene Search Indexes
search.dir
search.max-clauses
search.analyzer
search.operator
search.maxfieldlengthsearch.index.n
5.2.11
Proxy Settings
http.proxy.host
http.proxy.port
5.2.12
Media Filter
filter.plugin5.2.13
plugin.named.org.dspace.app.mediafilter.FormatFilter
     filter.org.dspace.app.mediafilter.PDFFilter.inputFormats
     filter.org.dspace.app.mediafilter.HTMLFilter.inputFormats
     filter.org.dspace.app.mediafilter.WordFilter.inputFormats
     filter.org.dspace.app.mediafilter.JPEGFilter.inputFormats
  filter.org.dspace.app.mediafilter.BrandedPreviewJPEGFilter.inputFormats
5.2.13
pdffilter.largepdfs
pdffilter.skiponmemoryexception
5.2.13
Crosswalks (MODS, QDC, XSLT, etc.)
crosswalk.mods.properties.MODS
crosswalk.submission.MODS.stylesheet
crosswalk.qdc.namespace.QDC.dc
crosswalk.qdc.namespace.QDC.dcterms
crosswalk.qdc.schemaLocation.QDC
plugin.named.org.dspace.content.crosswalk.IngestionCrosswalk
plugin.named.org.dspace.content.crosswalk.DisseminationCrosswalk
5.2.18
Event Settings
event.dispatcher.default.class
event.dispatcher.default.consumers
event.dispatcher.noindex.class
event.dispatcher.noindex.consumers
event.consumer.search.class
event.consumer.search.filters
event.consumer.browse.class
event.consumer.browse.filters
event.consumer.eperson.class
event.consumer.eperson.filters
event.consumer.test.class
event.consumer.test.filters
testConsumer.verbose
5.2.19
Checksum Checker
plugin.single.org.dspace.checker.BitsreamDispatcher
checker.retention.default
checker.retention.CHECKSUM-MATCH
5.2.20
Item Export and Download
org.dspace.app.itemexport.work.dir
org.dspace.app.itemexport.download.dir
org.dspace.app.itemexport.life.span.hours
org.dspace.app.itemexport.max.size
5.2.21
Bulk Metadata Editing
bulkedit.valueseparator
bulkedit.fieldseparator
bulkedit.gui-item-limit
bulkedit.ignore-on-export
5.2.46
Subscription Email Option
eperson.subscription.onlynew 5.2.22
Submission Process
webui.submit.blocktheses 5.2.23
webui.submit.upload.required 5.2.23
webui.submit.enable-cc 5.2.24
WEBUI Configurations [General]
webui.browse.thumbnail.max.height
webui.browse.thumbnail.max.width
5.2.25
webui.browse.thumbnail.linkbehaviour5.2.25
thumbnail.maxwidth
thumbnail.maxheight
5.2.25
webui.preview.enabled5.2.25
webui.preview.maxwidth
webui.preview.maxheight
5.2.25
webui.preview.brand
webui.preview.brand.abbrev
5.2.25
webui.preview.brand.height5.2.25
webui.preview.brand.font
webui.preview.brank.fontpoint
5.2.25
webui.preview.dc5.2.25
webui.strengths.show
webui.strengths.cache
5.2.25
Browse Index Configuration
webui.browse.index.n5.2.26
webui.itemlist.sort-option.n5.2.26
webui.browse.medata.case-insensitive5.2.26.3
webui.browse.value_columns.max
webui.browse.sort_columns.max
webui.browse.value_columns.omission_mark
plugin.named.org.dspace.sort.OrderFormatDelegate
5.2.26.4
Mulitple Metadata Value Display
webui.browse.author-field5.2.27
webui.browse.author-limit5.2.27
Other Browse Contexts
webui.browse.link.n5.2.28
Recent Submission
recent.submission.sort-option
recent.submissions.count
5.2.29
Syndication Feed (RSS) Settings
webui.feed.enable
webui.feed.items
webui.feed.cache.size 
webui.cache.age
webui.feed.formats
webui.feed.localresolve
webui.feed.item.title
webui.feed.item.date
webui.feed.item.description
5.2.30
Content Inline Disposition Threshold
webui.content_disposition_threshold
xmlui.content_disposition_threshold
5.2.31
Multifile HTML Settings
webui.html.max-depth-guess
xmlui.html.max-depth-guess
5.2.32
Other General Configuration Settings
sitemap.dir5.2.33
sitemap.engineurls5.2.33
upload.temp.dir5.2.34
default.locale5.2.37
itemmap.author.index5.2.38
webui.mydspace.showgroupmembership5.2.39
sfx.server.url5.2.40
webui.suggest.enable
webui.suggest.loggedinusers.only
5.2.41
webui.controlledvocabulary.enable5.2.42
oai.didl.maxresponse5.2.44
JSP Web Interface Settings
webui.licence_bundle.show
webui.itemdisplay.default
webui.resolver.1.urn
webui.resolver.1.baseurl
webui.resolver.2.urn
webui.resolver.2.baseurl
plugin.single.org.dspace.app.webui.util.StyleSelection
webui.itemdisplay.thesis.collections
webui.itemdisplay.metadata-style
webui.itemlist.column
webui.itemlist.width
webui.itemlist.browse.<index name>.sort.<sort name>.columns
webui.itemlist.sort<sort name>.columns
webui.itemlist.browse.<browse name>.columns
webui.itemlist.*lt;sort or index name>.columns
webui.itemlist.dateaccessioned.columns
webui.itemlist.dateaccessioned.widths
webui.itemlist.tablewidth
5.2.36
XMLUI Settings (Manakin)
xmlui.supported.locales
xmlui.force.ssl
xmlui.user.registration
xmlui.user.assumelogon
xmlui.user.logindirect
xmlui.theme.allowoverrides
xmlui.bundle.upload
xmlui.community-list.render.full
xmlui.community-list.cache
xmlui.bitstream.mods
xmlui.bitstream.mets
xmlui.google.analytics.key
xmlui.controlpanel.activity.max
xmlui.controlpanel.activity.ipheader
5.2.43

5.2.2. Main DSpace Configurations

Property: dspace.dir
Example Value: /dspace
Informational Note:Root directory of DSpace installation. Omit the trailing '/'. Note that if you change this, there are several other parameters you will probably want to change to match, e.g. assetstore.dir.
 
Property:dspace.baseUrl
Example Value: http://dspacetest.myu.edu:8080
Informational Note:Main URL at which DSpace Web UI webapp is deployed. Include any port number, but do not include the trailing '/'.
 
Property:dspace.url
Example Value:dspace.url = ${dspace.baseUrl}/jspui
Informational noteDSpace base URL. Include port number etc., but NOT trailing slash. Change to /xmlui if you wish to use the xmlui (Manakin) as the default, or remove "/jspui" and set webapp of your choice as the "ROOT" webapp in the servlet engine.
 
Property:dspace.oai.url
Example Value:dspace.oai.url = ${dspace.baseUrl}/oai
Informational note:The base URL of the OAI webapp (do not include /request).
 
Property: dspace.hostname
Example Value: dspace.hostname = dspace.mysu.edu
Informational Note:Fully qualified hostname; do not include port number.
 
Property: dspace.name
Example Value: dspace.name = DSpace at My University
Informational Note:Short and sweet site name, used throughout Web UI, e-mails and elsewhere (such as OAI protocol)

5.2.3. DSpace Database Configuration

Many of the database configurations are software-dependent. That is, it will be based on the choice of database software being used. Documentation is below shows PostgreSQL and Oracle examples.

Property: db.name
Example Value: db.name = postgres
Informational Note:In dspace.cfg you choose either postgres or oracle.
 
Property: db.url
Example Value: db.url = dbc:postgresql://localhost:5432/dspace-services
Informational Note:The above value is the default value when configuring with PostgreSQL. When using Oracle, use this value: jbdc.oracle.thin:@//host:port/dspace
 
Property: db.username
Example Value: db.username = dspace
Informational Note:In the installation directions, the administrator is instructed to create the user "dspace" who will own the database "dspace".
 
Property: password
Example Value: password = dspace5
Informational Note:This is the password that was prompted during the installation process (cf. 3.2.3. Installation)
 
Property: db.schema
Example Value: db.schema = vra
Informational Note:If your database contains multiple schemas, you can avoid problems with retrieving the definitions of duplicate objects by specifying the schema name here that is used for DSpace by uncommenting the entry. This is commented out
 
Property: db.maxconnections
Example Value: db.maxconnections = 30
Informational Note:Maximum number of DB connections in pool
 
Property: db.maxwait
Example Value: db.maxwait = 5000
Informational Note:Maximum time to wait before giving up if all connections in pool are busy (in milliseconds).
 
Property: db.maxidle
Example Value: db.maxidle = -1
Informational Note:Maximum number of idle connections in pool. (-1 = unlimited)
 
Property: db.statementpool
Example Value: db.statementpool = true
Informational Note:Determines if prepared statement should be cached. (Default is set to true)
 
Property: db.poolname
Example Value: db.poolname = dspacepool
Informational Note:Specify a name for the connection pool. This is useful if you have multiple applications sharing Tomcat's database connection pool. If nothing is specified, it will default to 'dspacepool'

5.2.4. DSpace Email Settings

The configuration of email is simple and provides a mechanism to alert the person(s) responsible for different features of the DSpace software.

Property: mail.server
Example Value: mail.server = smtp.my.edu
Informational Note:SMTP Mail Server
 
Property:
mail.server.username
mail.server.password
Example Value:
mail.server.username = myusername
mypassword
Informational Note:SMTP mail server authentication username and password, if required.
 
Property: mail.server.port
Example Value: mail.server.port = 25
Informational Note:SMTP mail server alternate port (Defaults to port 25 when commented out).
 
Property: mail.from.address
Example Value: mail.from.address = dspace-noreply@myu.edu
Informational Note:The "From" address for email. Change the 'myu.edu' to the site's host name.
 
Property: feedback.recipient
Example Value: feedback.recipient = dspace-help@myu.edu
Informational Note:When a user clicks on the feedback link/feature, the information will be send to the email address of choice. This configuration is currently limited to only one recipient.
 
Property: mail.admin
Example Value: mail.admin = dspace-help@myu.edu
Informational Note:Email address of the general site administrator (Webmaster)
 
Property: alert.recipient
Example Value: alert.recipient = john.doe@myu.edu
Informational Note:Enter the recipient for server errors and alerts.
 
Property: registration.notify
Example Value: registration.notify = mike.smith@myu.edu
Informational Note:Enter the recipient that will be notified when a new user registers on DSpace.
 
Property: mail.charset
Example Value: mail.charset = UTF8
Informational Note:Set the default mail character set. This may be over-ridden by providing a line inside the email template "charset: <encoding>", otherwise this default is used.
 
Property:mail.allowed.referrers
Example Value:mail.allowed.referrers = localhost
Informational Note:A comma separated list of hostnames that are allowed to refer browsers to email forms. Default behaviour is to accept referrals only from dspace.hostname
 
Property:mail.extraproperties
Example Value:
mail.extraproperties = mail.smtp.socketFactory.port=465, \
	                   mail.smtp.socketFactory.class=javax.net.ssl.SSLSocketFactory, \
	                   mail.smtp.socketFactory.fallback=false
Informational Note:If you need to pass extra settings to the Java mail library. Comma separated, equals sign between the key and the value. By default, commented out.
 
Property:mail.server.disabled
Example Vavlue:mail.server.disabled = false
Informational Note:An option is added to disable the mailserver. By default, this property is set to false. By setting value to 'true', DSpace will not send out emails. It will instead log the subject of the email which should have been sent. This is especially useful for development and test environments where production data is used when testing functionality.

5.2.4.1. Wording of E-mail Messages

Sometimes DSpace automatically sends e-mail messages to users, for example to inform them of a new work flow task, or as a subscription e-mail alert. The wording of emails can be changed by editing the relevant file in [dspace]/config/emails. Each file is commented. Be careful to keep the right number 'placeholders' (e.g.{2}).

Note: You should replace the contact-information "dspace-help@myu.edu or call us at xxx-555-xxxx" with your own contact details in:

config/emails/change_password
config/emails/register

5.2.5. File Storage

This is the default "technique" that is used by DSpace to store the bitstreams. DSpace can also use SRB (Storage Resource Brokerage) as an alternative. See section 5.2.6 for details regarding SRB.

Property: assetstore.dir
Example Value: assetstore.dir = ${dspace.dir}/assetstore
Informational Note:This is Asset (bitstream) store number 0 (Zero). You need not place your assetstore under the /dspace directory, but may want to place it on a different logical volume on the server that DSpace resides. So, you might have something like this: assetstore.dir = /storevgm/assestore.
 
Property:
assetstore.dir.1
assetstore.dir.2
Example Value:
assetstore.dir.1 = /second/assetstore
assetstore.dir.2 = /third/assetstore
Informational Note:This property specifies extra asset stores like the one above, counting from one (1) upwards. This property is commented out (#) until it is needed.
 
Property: assetstore.incoming
Example Value: assetstore.incoming = 1
Informational Note:Specify the number of the store to use for new bitstreams with this property. The default is 0 [zero] which corresponds to the 'assestore.dir' above.
[Tip]

In the examples above, you can see that your storage does not have to be under the /dspace directory. For the default installation it needs to reside on the same server (unless you plan to configure SRB (cf. below)). So, if you added storage space to your server, and it has a different logical volume/name/directory, you could have the following as an example:

assetstore.dir = /storevgm/assetstore
assetstore.dir.1 = /storevgm2/assetstore
assetstore.incoming = 1

Please Note: When adding additional storage configuration, you will then need to uncomment and declare assestore.incoming = 1

5.2.6. SRB (Storage Resource Brokerage) File Storage

An alternate to using the default storage framework is to use Storage Resource Brokerage (SRB). This can provide a different level of storage and disaster recovery. (Storage can take place on storage that is off-site.) Refer to http://www.sdsc.edu/srb/index.php/Main_Page for complete details regarding SRB.

The same framework is used to configure SRB storage. That is, the asset store number (0..n) can reference a file system directory as above or it can reference a set of SRB account parameters. But any particular asset store number can reference one or the other but not both. This way traditional and SRB storage can both be used but with different asset store numbers. The same cautions mentioned above apply to SRB asset stores as well. The particular asset store a bitstream is stored in is held in the database, so don't move bitstreams between asset stores, and do not renumber them.

Property: srb.hosts.1
Example value: srb.hosts.1 = mysrbmcathost.myu.edu
Property: srb.port.1
Example value: srb.port.1 = 5544
Property: srb.mcatzone.1
Example value: srb.mcatzone.1 = mysrbzone
Property: srb.mdasdomainname.1
Example value: srb.mdasdomainname.1 = mysrbdomain
Property: srb.defaultstorageresource.1
Example value: srb.defaultstorageresource.1 = mydefaultsrbresource
Property: srb.username.1
Example value: srb.username.1 = mysrbuser
Property: srb.password.1
Example value: srb.password.1 = mysrbpassword
Property: srb.homedirectory.1
Example value:
srb.homedirectory.1 = /mysrbzone/home/mysrbuser.mysrbdomain
 
Property: srb.parentdir.1
Example value: srb.parentdir.1 = mysrbdspaceassetstore
Informational Note:Several of the terms, such as mcatzone, have meaning only in the SRB context and will be familiar to SRB users. The last, srb.paratdir.n, can be used for additional (SRB) upper directory structure within an SRB account. This property value could be blank as well.

The 'assetstore.incoming' property is an integer that references where new bitstreams will be stored. The default (say the starting reference) is zero. The value will be used to identify the storage where all new bitstreams will be stored until this number is changed. This number is stored in the Bitstream table (store_number column) in the DSpace database, so older bitstreams that may have been stored when 'asset.incoming' had a different value can be found.

In the simple case in which DSpace uses local (or mounted) storage the number can refer to different directories (or partitions). This gives DSpace some level of scalability. The number links to another set of properties 'assetstore.dir', 'assetstore.dir.1' (remember zero is default), assetstore.dir.2', etc., where the values are directories.

To support the use of SRB DSpace uses the same scheme but broaden to support:

  • using SRB instead of the local file system

  • using the local file system (native DSpace)

  • using a mix of SRB and local file system

in this broadened use of the 'asset.incoming' integer will refer to one of the following storage locations:

  • a local file system directory (native DSpace)

  • a set of SRB account parameters (host, port, zone, domain, username, password, home directory, and resource

Should the be any conflict, like '2' referring to a local directory and to a set of SRB parameters, the program will select the local directory.

If SRB is chosen from the first install of DSpace, it is suggested that 'assetstore.dir' (no integer appended) be retained to reference a local directory (as above under File Storage) because build.xml uses this value to do a mkdir. In this case, 'assetstore.incoming' can be set to 1 (i.e. uncomment the line in File Storage above) and the 'assetstore.dir' will not be used.

5.2.7. Handle Server Configuration

Property: handle.prefix
Example Value handle.prefix = 1234.56789
Informational Note:The default installed by DSpace is 123456789 but you will replace this upon receiving a handle from CNRI.
 
Property: handle.dir
Example Value: handle.dir = ${dspace.dir}/handle-server
Informational Note:The default files, as shown in the Example Value is where DSpace will install the files used for the Handle Server.

For complete information regarding the Handle server, the user should consult 3.3.4. The Handle Server section of Installing DSpace.

5.2.8. Stackable Authentication Method(s)

(formally Custom Authentication)

Since many institutions and organizations have existing authentication systems, DSpace has been designed to allow these to be easily integrated into an existing authentication infrastructure. It keeps a series, or "stack", of authentication methods, so each one can be tried in turn. This makes it easy to add new authentication methods or rearrange the order without changing any existing code. You can also share authentication code with other sites.

Property: plugin.sequence.org.dspace.authenticate.AuthenticationMethod
Example Value:
plugin.sequence.org.dspace.authenticate.AuthenticationMethod = \ 
          org.dspace.authenticate.PasswordAuthentication

The configuration property plugin.sequence.org.dspace.authenticate.AuthenticationMethod defines the authentication stack. It is a comma-separated list of class names. Each of these classes implements a different authentication method, or way of determining the identity of the user. They are invoked in the order specified until one succeeds.

An authentication method is a class that implements the interface org.dspace.authenticate.AuthenticationMethod. It authenticates a user by evaluating the credentials (e.g. username and password) he or she presents and checking that they are valid.

The basic authentication procedure in the DSpace Web UI is this:

  1. A request is received from an end-user's browser that, if fulfilled, would lead to an action requiring authorization taking place.

  2. If the end-user is already authenticated:

    • If the end-user is allowed to perform the action, the action proceeds

    • If the end-user is NOT allowed to perform the action, an authorization error is displayed.

    • If the end-user is NOT authenticated, i.e. is accessing DSpace anonymously:

  3. The parameters etc. of the request are stored

  4. The Web UI's startAuthentication method is invoked.

  5. First it tries all the authentication methods which do implicit authentication (i.e. they work with just the information already in the Web request, such as an X.509 client certificate). If one of these succeeds, it proceeds from Step 2 above.

  6. If none of the implicit methods succeed, the UI responds by putting up a "login" page to collect credentials for one of the explicit authentication methods in the stack. The servlet processing that page then gives the proffered credentials to each authentication method in turn until one succeeds, at which point it retries the original operation from Step 2 above.

Please see the source files AuthenticationManager.java and AuthenticationMethod.java for more details about this mechanism.

5.2.8.1. Authentication by Password

The default method org.dspace.authenticate.PasswordAuthentication has the following properties:

  • Use of inbuilt e-mail address/password-based log-in. This is achieved by forwarding a request that is attempting an action requiring authorization to the password log-in servlet, /password-login. The password log-in servlet (org.dspace.app.webui.servlet.PasswordServlet contains code that will resume the original request if authentication is successful, as per step 3. described above.

  • Users can register themselves (i.e. add themselves as e-people without needing approval from the administrators), and can set their own passwords when they do this

  • Users are not members of any special (dynamic) e-person groups

  • You can restrict the domains from which new users are able to register. To enable this feature, uncomment the following line from dspace.cfg: authentication.password.domain.valid = example.com Example options might be '@example.com' to restrict registration to users with addresses ending in @example.com, or '@example.com, .ac.uk' to restrict registration to users with addresses ending in @example.com or with addresses in the .ac.uk domain.

5.2.8.2. X.509 Certificate Authentication

The X.509 authentication method uses an X.509 certificate sent by the client to establish his/her identity. It requires the client to have a personal Web certificate installed on their browser (or other client software) which is issued by a Certifying Authority (CA) recognized by the web server.

  1. See the HTTPS installation instructions to configure your Web server. If you are using HTTPS with Tomcat, note that the <Connector> tag must include the attribute clientAuth="true" so the server requests a personal Web certificate from the client.

  2. Add the org.dspace.authenticate.X509Authentication plugin first to the list of stackable authentication methods in the value of the configuration key plugin.sequence.org.dspace.authenticate.AuthenticationMethod e.g.:

        
                                plugin.sequence.org.dspace.authenticate.AuthenticationMethod = \
                                 org.dspace.authenticate.X509Authentication, \
                                 org.dspace.authenticate.PasswordAuthentication
                            
  3. You must also configure DSpace with the same CA certificates as the web server, so it can accept and interpret the clients' certificates. It can share the same keystore file as the web server, or a separate one, or a CA certificate in a file by itself. Configure it by one of these methods, either the Java keystore

                 authentication.x509.keystore.path =  path to Java keystore file
                 authentication.x509.keystore.password =  password to access the keystore
                            

    ...or the separate CA certificate file (in PEM or DER format):

    authentication.x509.ca.cert =  path to certificate file for CA
                                              whose client certs to accept. 
  4. Choose whether to enable auto-registration: If you want users who authenticate successfully to be automatically registered as new E-Persons if they are not already, set the authentication.x509.autoregister configuration property to true. This lets you automatically accept all users with valid personal certificates. The default is false.

5.2.8.3. Example of a Custom Authentication Method

Also included in the source is an implementation of an authentication method used at MIT, edu.mit.dspace.MITSpecialGroup. This does not actually authenticate a user, it only adds the current user to a special (dynamic) group called 'MIT Users' (which must be present in the system!). This allows us to create authorization policies for MIT users without having to manually maintain membership of the MIT users group.

By keeping this code in a separate method, we can customize the authentication process for MIT by simply adding it to the stack in the DSpace configuration. None of the code has to be touched.

You can create your own custom authentication method and add it to the stack. Use the most similar existing method as a model, e.g. org.dspace.authenticate.PasswordAuthentication for an "explicit" method (with credentials entered interactively) or org.dspace.authenticate.X509Authentication for an implicit method.

5.2.8.4. Configuring IP Authentication

You can enable IP authentication by adding its method to the stack in the DSpace configuration, e.g.:

 plugin.sequence.org.dspace.authenticate.AuthenticationMethod =
                              org.dspace.authenticate.IPAuthentication 

You are than able to map DSpace groups to IP's in dspace.cfg by setting authentication.ip.GROUPNAME = iprange[, iprange ...], e.g:

 authentication.ip.MY_UNIVERSITY = 10.1.2.3, \            # Full IP
                                   13.5, \                # Partial IP
                                   11.3.4.5/24, \         # with CIDR
                                   12.7.8.9/255.255.128.0 # with netmask
 

Negative matches can be set by prepending the entry with a '-'. For example if you want to include all of a class B network except for users of a contained class c network, you could use: 111.222,-111.222.333.

Note: if the Groupname contains blanks you must escape the, e.g. Department\ of\ Statistics

5.2.8.5. Configuring LDAP Authentication

You can enable LDAP authentication by adding its method to the stack in the DSpace configuration, e.g.

                plugin.sequence.org.dspace.authenticate.AuthenticationMethod =
                     org.dspace.authenticate.LDAPAuthentication
                          

If LDAP is enabled in the dspace.cfg file, then new users will be able to register by entering their username and password without being sent the registration token. If users do not have a username and password, then they can still register and login with just their email address the same way they do now.

If you want to give any special privileges to LDAP users, create a stackable authentication method to automatically put people who have a netid into a special group. You might also want to give certain email addresses special privileges. Refer to the Custom Authentication Code section above for more information about how to do this.

Here is an explanation of what each of the different configuration parameters are for:

Standard LDAP Configuration
Property: ldap.enable
Example Value: ldap.enable = false
Informational Note:This setting will enable or disable LDAP authentication in DSpace. With the setting off, users will be required to register and login with their email address. With this setting on, users will be able to login and register with their LDAP user ids and passwords.
 
Property: ldap.provider_url
Example Value: ldap.provider_url = ldap://ldap.myu.edu/o=myu.edu
Informational Note:This is the url to your institution's LDAP server. You may or may not need the /o=myu.edu part at the end. Your server may also require the ldaps:// protocol.
 
Property: ldap.id_field
Example Value: ldap.id_field = uid
Explanation:This is the unique identifier field in the LDAP directory where the username is stored.
 
Property: ldap.object_context
Example Value: ldap.object_context = ou=people, o=myu.edu
Informational Note:This is the object context used when authenticating the user. It is appended to the ldap.id_field and username. For example uid=username,ou=people,o=myu.edu. You will need to modify this to match your LDAP configuration.
 
Property: ldap.search_context
Example Value: ldap.search_context = ou=people
Informational Note:This is the search context used when looking up a user's LDAP object to retrieve their data for autoregistering. With ldap.autoregister turned on, when a user authenticates without an EPerson object we search the LDAP directory to get their name and email address so that we can create one for them. So after we have authenticated against uid=username,ou=people,o=byu.edu we now search in ou=people for filtering on [uid=username]. Often the ldap.search_context is the same as the ldap.object_context parameter. But again this depends on your LDAP server configuration.
 
Property: ldap.email_field
Example Value: ldap.email_field = mail
Informational Note:This is the LDAP object field where the user's email address is stored. "mail" is the default and the most common for ldap servers. If the mail field is not found the username will be used as the email address when creating the eperson object.
 
Property: ldap.surname_field
Example Value: ldap.surname_field = sn
Informational Note:This is the LDAP object field where the user's last name is stored. "sn" is the default and is the most common for LDAP servers. If the field is not found the field will be left blank in the new eperson object.
 
Property: ldap.givenname_field
Example Value: ldap.givenname_field = givenName
Informational Note:This is the LDAP object field where the user's given names are stored. I'm not sure how common the givenName field is in different LDAP instances. If the field is not found the field will be left blank in the new eperson object.
 
Property: ldap.phone_field
Example Value: ldap.phone_field = telephoneNumber
Informational Note:This is the field where the user's phone number is stored in the LDAP directory. If the field is not found the field will be left blank in the new eperson object.
 
Property: webui.ldap.autoregister
Example Value: webui.ldap.autoregister = true
Informational Note:This will turn LDAP autoregistration on or off. With this on, a new EPerson object will be created for any user who successfully authenticates against the LDAP server when they first login. With this setting off, the user must first register to get an EPerson object by entering their ldap username and password and filling out the forms.
LDAP Users Group
Property: ldap.login.specialgroup
Example Value: ldap.login.specialgroup = group-name
Informational Note:If required, a group name can be given here, and all users who log into LDAP will automatically become members of this group. This is useful if you want a group made up of all internal authenticated users. (Remember to log on as the administrator, add this to the "Groups" with read rights).

Hierarchical LDAP Settings.# If your users are spread out across a hierarchical tree on your LDAP server, you will need to use the following stackable authentication class:

plugin.sequence.org.dspace.authenticate.AuthenticationMethod = \
       org.dspace.authenticate.LDAPHierarchicalAuthentication

You can optionally specify the search scope. If anonymous access is not enabled on your LDAP server, you will need to specify the full DN and password of a user that is allowed to bind in order to search for the users.

Property: ldap.search_scope
Example Value: ldap.search_scope = 2
Informational Note: This is the search scope value for the LDAP search during autoregistering. This will depend on your LDAP server setup. This value must be one of the following integers corresponding to the following values:
object scope : 0
one level scope : 1
subtree scope : 2
 
Property:
ldap.search.user
ldap.search.password
Example Value:
ldap.search.user = cn=admin,ou=people,o=myu.edu
ldap.search.password = password
Informational Note:The full DN and password of a user allowed to connect to the LDAP server and search for the DN of the user trying to log in. If these are not specified, the initial bind will be performed anonymously.
 
Property: ldap.netid_email_domain =
Example Value: ldap.netid_email_domain = @example.com
Informational Note:If your LDAP server does not hold an email address for a user, you can use the following field to specify your email domain. This value is appended to the netid in order to make an email address. E.g. a netid of 'user' and ldap.netid_email_domain as @example.com would set the email of the user to be user@example.com

5.2.9. Shibboleth Authentication Configuration Settings

Detailed instructions for installing Shibboleth on DSpace may be found athttps://mams.melcoe.mq.edu.au/zope/mams/pub/Installation/dspace15.

DSpace requires email as the user's credentials. There are two ways of providing email to DSpace:

  1. By explicitly specifying to the user which attribute (header) carries the email address.

  2. By turning on the user-email-using-tomcat=true which means the software will attempt to acquire the user's email from Tomcat.

The first option takes Precedence when specified. both options can be enabled to allow for fallback.

Property: authentication.shib.email-header
Example Value: authentication.shib.email-header = MAIL
Informational Note:The option specifies that the email comes from the mentioned header. This value is CASE-Sensitive.
 
Property: authentication.shib.firstname-header
Example Value: authentication.shib.firstname-header = SHIB-EP-GIVENNAME
Informational Note:Optional. Specify the header that carries the user's first name. This is going to be used for the creation of new-user.
 
Property: authentication.shib.lastname-header
Example Value: authentication.shib.lastname-header = SHIB-EP-SURNAME
Informational Note:Optional. Specify the header that carries user's last name. This is used for creation of new user.
 
Property: authentication.shib.email-use-tomcat-remote-user
Example Value: authentication.shib.email-use-tomcat-remote-user = true
Informational Note:This option forces the software to acquire the email from Tomcat.
 
Property: authentication.shib.autoregister
Example Value: authentication.shib.autoregister = true
Informational Note:Option will allow new users to be registered automatically if the IdP provides sufficient information (and the user does not exist in DSpace
 
Property:
authentication.shib.role-header
authentication.shib-role.header.ignore-scope
Example Value:
authentication.shib.role-header = Shib-EP-ScopedAffiliation
authentication.shib-role.header.ignore-scope = true
or
authentication.shib.role-header = Shib-EP-UnscopedAffiliation
 authentication.shib-role.header.ignore-scope = false
Informational Note:These two options specify which attribute that is responsible for providing user's roles to DSpace and unscope the attributes if needed. When not specified, it is defaulted to 'Shib-EP-UnscopedAffiliation', and ignore-scope is defaulted to 'false'. The value is specified in AAP.xml (Shib 1.3.x) or attribute-filter.xml (Shib 2.x). The value is CASE-Sensitive. The values provided in this header are separated by semi-colon or comma. If your sp only provides scoped role header, you need to set authentication.shib.role-header.ignore-Scope as true. For example if you only get Shib-EP-ScopedAffiliation instead of Shib-EP-ScopedAffiliation, you name to make your settings as in the example value above.
 
Property: authentication.shib.default-roles
Example Value: authentication.shib.default-roles = Staff, Walk-ins
Informational Note:When user is fully authN or IdP but would not like to release his/her roles to DSpace (for privacy reasons?), what should the default roles be given to such user. The values are separated by semi-colon or comma.
 
Property:
authentication.shib.role.Senior\ Researcher
authentication.shib.role.Librarian
Example Value:
authentication.shib.role.Senior\ Researcher = Researcher, Staff
authentication.shib.role.Librarian = Administrator
Informational Note:The following mappings specify role mapping between IdP and Dspace. The left side of the entry is IdP's role (prefixed with "authentication.shib.role.") which will be mapped to the right entry from DSpace. DSpace's group as indicated on the right entry has to EXIST in DSpace, otherwise user will be identified as 'anonymous'. Multiple values on the right entry should be separated by comma. The values are CASE-Sensitive. Heuristic one-to-one mapping will be done when the IdP groups entry are not listed below (i.e. if "X" group in IdP is not specified here, then it will be mapped to "X" group in DSpace if it exists, otherwise it will be mapped to simply 'anonymous'). Given sufficient demand, future release could support regex for the mapping special characters need to be escaped by '\'
 

5.2.10. Logging Configuration

The following settings are currently not used by XMLUI. XMLUI writes its logs to [dspace-xmlui]/WEB-INF-logs/ in the actual XMLUI web application.

Property: log.init.config
Example Value: log.init.config = ${dspace.dir}/config/log4j.properties
Informational Note:This is where your configure file is located. You may override default log4j configuration by provided your own configurations. Existing alternatives are:
log.init.config = ${dspace.dir}/config/log4j.xml
log.init.config = ${dspace.dir}/config/log4j-console.properties
 
Property: log.dir
Example value: log.dir = ${dspace.dir}/log
Informational Note:This is where to put the logs. (This is used for initial configuration only)

5.2.11. Configuring Lucene Search Indexes

Search indexes can be configured and customized easily in the dspace.cfg file. This allows institutions to choose which DSpace metadata fields are indexed by Lucene.

Property: search.dir
Example Value: search.dir = ${dspace.dir}/search
Informational NoteWhere to put the search index files
 
Property: search.max-clauses
Example Value: search.max-clauses = 2048
Informational NoteBy setting higher values of search.max-clauses will enable prefix searches to work on larger repositories.
 
Property: search.analyzer
Example Value: search.analyzer = org.dspace.search.DSAnalyzer
Informational NoteWhich Lucene Analyzer implementation to use. If this is omitted or commented out, the standard DSpace analyzer (designed for English) is used by default.
 
Property: search.analyzer
Example Value:
search.analyzer = \
     org.apache.lucene.analysis.cn.ChineseAnalyzer
Informational NoteInstead of the standard English analyzer, the Chinese analyzer is used.
 
Property: search.operator
Example Value: search.operator = OR
Informational NoteBoolean search operator to use. The currently supported values are OR and AND. If this configuration item is missing or commented out, OR is used. AND requires all the search terms to be present. OR requires one or more search terms to be present.
 
Property: search.maxfieldlength
Example Value: search.maxfieldlength = 10000
Informational NoteThis is the maximum number of terms indexed for a single field in Lucene. The default is 10,000 words—often not enough for full-text indexing. If you change this, you will need to re-index for the change to take effect on previously added items. -1 = unlimited (Integer.MAG_VALUE)
 
Property: search.index. n
Example Value: search.index.1 = author:dc.contributor.*
Informational NoteSee the details of this particular entry below.

For example, the following entries appear in the default DSpace installation:

search.index.1 = author:dc.contributor.*
search.index.2 = author:dc.creator.*
search.index.3 = title:dc.title.*
search.index.4 = keyword:dc.subject.*
search.index.5 = abstract:dc.description.abstract
search.index.6 = author:dc.description.statementofresponsibility
search.index.7 = series:dc.relation.ispartofseries
search.index.8 = abstract:dc.description.tableofcontents
search.index.9 = mime:dc.format.mimetype
search.index.10 = sponsor:dc.description.sponsorship
search.index.11 = id:dc.identifier.*
search.index.11 = language:dc.language.iso

The format of each entry is search.index.<id> = <search label> : <schema> . <metadata field> where:

<id> is an incremental number to distinguish each search index entry
<search label> is the identifier for the search field this index will correspond to
<schema> is the schema used. Dublin Core (DC) is the default. Others are possible.
<metadata field> is the DSpace metadata field to be indexed.

In the example above, search.index.1 and search.index.2 and search.index.3 are configured as the author search field. The author index is created by Lucene indexing all dc.contributor.*, dc.creator.* and description.statementofresponsibility metadata fields.

After changing the configuration run /[dspace]/bin/index-init to regenerate the indexes.

[Note]

While the indexes are created, this only affects the search results and has no effect on the search components of the user interface. One will need to customize the user interface to reflect the changes, for example, to add the a new search category to the Advanced Search.

[Tip]

In the above examples, notice the asterisk (*). The metadata field (at least for Dublin Core) is made up of the "element" and the "qualifier". The asterisk is used as the "wildcard". So, for example, keyword.dc.subject.* will index all subjects regardless if the term resides in a qualified field. (subject versus subject.lcsh). One could customize the search and only index LCSH (Library of Congress Subject Headings) with the following entry keyword:dc.subject.lcsh instead of keyword:dc.subject.*

5.2.12. Proxy Settings

These settings for proxy are commented out by default. Uncomment and specify both properties if proxy server is required for external http requests. Use regular host name without port number.

Property: http.proxy.host
Example Value http.proxy.host = proxy.myu.edu
Informational NoteEnter the host name without the port number.
 
Property: http.proxy.port
Example Value http.proxy.port = 2048
Informational NoteEnter the port number for the proxy server.

5.2.13. Configuring Media Filters

Media or Format Filters are classes used to generate derivative or alternative versions of content or bitstreams within DSpace. For example, the PDF Media Filter will extract textual content from PDF bitstreams, the JPEG Media Filter can create thumbnails from image bitstreams.

Media Filters are configured as Named Plugins, with each filter also having a separate configuration setting (in dspace.cfg) indicating which formats it can process. The default configuration is shown below.

Property: filter.plugins
Example Value:(See example below)
filter.plugins = PDF Text Extractor, Html Text Extractor, \
                         Word Text Extractor, JPEG Thumbnail
Informational Note:Place the names of the enabled MediaFilter or FormatFilter plugins. To enable Branded Preview, comment out the previous one line and then uncomment the two lines in found in dspace.cfg:
Word Text Extractor, JPEG Thumbnail,\
     Branded Preview JPEG
 
Property: plugin.named.org.dspace.app.mediafilter.FormatFilter
Example Value:(See example below)
plugin.named.org.dspace.app.mediafilter.FormatFilter = \
         org.dspace.app.mediafilter.PDFFilter = PDF Text Extractor, \
         org.dspace.app.mediafilter.HTMLFilter = HTML Text Extractor, \
         org.dspace.app.mediafilter.WordFilter = Word Text Extractor, \
         org.dspace.app.mediafilter.JPEGFilter = JPEG Thumbnail, \
         org.dspace.app.mediafilter.BrandedPreviewJPEGFilter = Branded Preview JPEG
Informational Note:Assign "human-understandable" names to each filter
 
Property:(See key below)
filter.org.dspace.app.mediafilter.PDFFilter.inputFormats
filter.org.dspace.app.mediafilter.HTMLFilter.inputFormats
filter.org.dspace.app.mediafilter.WordFilter.inputFormats
filter.org.dspace.app.mediafilter.JPEGFilter.inputFormats
filter.org.dspace.app.mediafilter.BrandedPreviewJPEGFilter.inputFormats
Example Value:(See example below)
filter.org.dspace.app.mediafilter.PDFFilter.inputFormats = Adobe PDF
filter.org.dspace.app.mediafilter.HTMLFilter.inputFormats = HTML, Text
filter.org.dspace.app.mediafilter.WordFilter.inputFormats = Microsoft Word
filter.org.dspace.app.mediafilter.JPEGFilter.inputFormats = BMP, GIF, JPEG, \ 
                                                            image/png
filter.org.dspace.app.mediafilter.BrandedPreviewJPEGFilter.inputFormats = BMP, \
                                                   GIF, JPEG, image/png
Informational Note:Configure each filter's input format(s)
 
Property: pdffilter.largepdfs
Example Value: pdffilter.largepdfs = true
Informational Note:It this value is set for "true", all PDF extractions are written to temp files as they are indexed. This is slower, but helps to ensure that PDFBox software DSpace uses does not eat up all your memory.
Property: pdffilter.skiponmemoryexception
Example Value: pdffilter.skiponmemoryexception = true
Informational Note:If this value is set for "true", PDFs which still result in an "Out of Memory" error from PDFBox are skipped over. These problematic PDFs will never be indexed until memory usage can be decreased in the PDFBox software.

Names are assigned to each filter using the plugin.named.org.dspace.app.mediafilter.FormatFilter field (e.g. by default the PDFilter is named "PDF Text Extractor".

Finally, the appropriate filter.<class path>.inputFormats defines the valid input formats which each filter can be applied. These format names must match the short description field of the Bitstream Format Registry.

You can also implement more dynamic or configurable Media/Format Filters which extend SelfNamedPlugin

5.2.14. Configurable MODS Dissemination Crosswalk

The MODS crosswalk is a self-named plugin. To configure an instance of the MODS crosswalk, add a property to the DSpace configuration starting with "crosswalk.mods.properties."; the final word of the property name becomes the plugin's name. For example, a property name crosswalk.mods.properties.MODS defines a crosswalk plugin named "MODS".

The value of this property is a path to a separate properties file containing the configuration for this crosswalk. The pathname is relative to the DSpace configuration directory, i.e. the config subdirectory of the DSpace install directory. Example from the dspace.cfgfile:

Properties:
crosswalk.mods.properties.MODS
crosswalk.mods.properties.mods
Example Values:
crosswalk.mods.properties.MODS = crosswalks/mods.properties
crosswalk.mods.properties.mods = crosswalks/mods.properties
Informational Note:This defines a crosswalk named MODS whose configuration comes from the file [dspace]/config/crosswalks/mods.properties. (In the above example, the lower-case name was added for OAI-PMH)

The MODS crosswalk properties file is a list of properties describing how DSpace metadata elements are to be turned into elements of the MODS XML output document. The property name is a concatenation of the metadata schema, element name, and optionally the qualifier. For example, the contributor.author element in the native Dublin Core schema would be: dc.contributor.author. The value of the property is a line containing two segments separated by the vertical bar ("|"): The first part is an XML fragment which is copied into the output document. The second is an XPath expression describing where in that fragment to put the value of the metadata element. For example, in this property:

dc.contributor.author = <mods:name><mods:role><mods:roleTerm
	                    type="text">author</mods:roleTerm>
                        </mods:role><mods:namePart>%s</mods:;
                        <mods:namePart>%s</mods:namePart></mods:name> |
                        mods:namePart/text() 

Some of the examples include the string "%s" in the prototype XML where the text value is to be inserted, but don't pay any attention to it, it is an artifact that the crosswalk ignores. For example, given an author named Jack Florey, the crosswalk will insert

<mods:name>
   <mods:role>
     <mods:roleTerm type="text">author</mods:roleTerm>
   </mods:role>
   <mods:namePart>  Jack Florey</mods:namePart>
</mods:name>

into the output document. Read the example configuration file for more details.

5.2.15. XSLT-based Crosswalks

The XSLT crosswalks use XSL stylesheet transformation (XSLT) to transform an XML-based external metadata format to or from DSpace's internal metadata. XSLT crosswalks are much more powerful and flexible than the configurable MODS and QDC crosswalks, but they demand some esoteric knowledge (XSL stylesheets). Given that, you can create all the crosswalks you need just by adding stylesheets and configuration lines, without touching any of the Java code.

The default settings in the dspace.cfg file for submission crosswalk:

Properties: crosswalk.submission.MODS.stylesheet
Example Value: crosswalk.submission.MODS.stylesheet = crosswalks/mods-submission.xsl
Informational Note:Configuration XSLT-driven submission crosswalk for MODS

As shown above, there are three (3) parts that make up the properties "key":

     crosswalk.submissionPluginName.stylesheet =
          1        2        3               4 

crosswalk first part of the property key.
submission second part of the property key.
PluginName is the name of the plugin. The path value is the path to the file containing the crosswalk stylesheet (relative to /[dspace]/config).

Here is an example that configures a crosswalk named "LOM" using a stylesheet in [dspace]/config/crosswalks/d-lom.xsl:

 crosswalk.submission.LOM.stylesheet = crosswalks/d-lom.xsl

A dissemination crosswalk can be configured by starting with the property key crosswalk.dissemination. Example:

     crosswalk.dissemination.PluginName.stylesheet = path

The PluginName is the name of the plugin (!). The path value is the path to the file containing the crosswalk stylesheet (relative to /[dspace]/config).

You can make two different plugin names point to the same crosswalk, by adding two configuration entries with the same path:

     crosswalk.submission.MyFormat.stylesheet = crosswalks/myformat.xslt
     crosswalk.submission.almost_DC.stylesheet = crosswalks/myformat.xslt

The dissemination crosswalk must also be configured with an XML Namespace (including prefix and URI) and an XML schema for its output format. This is configured on additional properties in the DSpace configuration:

     crosswalk.dissemination.PluginName.namespace.Prefix = namespace-URI
     crosswalk.dissemination.PluginName.schemaLocation = schemaLocation value 

For example:

     crosswalk.dissemination.qdc.namespace.dc = http://purl.org/dc/elements/1.1/
     crosswalk.dissemination.qdc.namespace.dcterms = http://purl.org/dc/terms/
     crosswalk.dissemination.qdc.schemalocation = http://purl.org/dc/elements/1.1/ \
     http://dublincore.org/schemas/xmls/qdc/2003/04/02/qualifieddc.xsd

5.2.15.1. Testing XSLT Crosswalks

The XSLT crosswalks will automatically reload an XSL stylesheet that has been modified, so you can edit and test stylesheets without restarting DSpace. You can test a dissemination crosswalk by hooking it up to an OAI-PMH crosswalk and using an OAI request to get the metadata for a known item.

Testing the submission crosswalk is more difficult, so we have supplied a command-line utility to help. It calls the crosswalk plugin to translate an XML document you submit, and displays the resulting intermediate XML (DIM). Invoke it with:

[dspace]/bin/dsrun
     org.dspace.content.crosswalk.XSLTIngestionCrosswalk [-l] plugin input-file

where plugin is the name of the crosswalk plugin to test (e.g. "LOM"), and input-file is a file containing an XML document of metadata in the appropriate format.

Add the -l option to pass the ingestion crosswalk a list of elements instead of a whole document, as if the List form of the ingest() method had been called. This is needed to test ingesters for formats like DC that get called with lists of elements instead of a root element.

5.2.16. Configurable Qualified Dublin Core (QDC) dissemination crosswalk

The QDC crosswalk is a self-named plugin. To configure an instance of the QDC crosswalk, add a property to the DSpace configuration starting with "crosswalk.qdc.properties."; the final word of the property name becomes the plugin's name. For example, a property name crosswalk.qdc.properties.QDC defines a crosswalk plugin named "QDC".

The following is from dspace.cfg file:

Properties: crosswalk.qdc.namspace.qdc.dc
Example Value: crosswalk.qdc.namspace.qdc.dc = http://purl.org/dc/elements/1.1
Properties: crosswalk.qdc.namspace.qdc.dcterms
Example Value: crosswalk.qdc.namspace.qdc.dc = http://purl.org/dc/terms/
Properties: crosswalk.qdc.schemaLocation.QDC
Example Value:
crosswalk.qdc.schemaLocation.QDC = http://www.purl.org/dc/terms \
      http://dublincore.org/schemas/xmls/qdc/2006/01/06/dcterms.xsd \
      http://purl.org/dc/elements/1.1 \
      http://dublincore.org/schemas/xmls/qdc/2006/01/06/dc.xsd
Properties: crosswalk.qdc.properties.QDC
Example Value: crosswalk.qdc.properties.QDC = crosswalks/QDC.properties
Informational Note:Configuration of the QDC Crosswalk dissemination plugin for Qualified DC. (Add lower-case name for OAI-PMH. That is, change QDC to qdc.)

In the property key "crosswalk.qdc.properties.QDC" the value of this property is a path to a separate properties file containing the configuration for this crosswalk. The pathname is relative to the DSpace configuration directory /[dspace]/config . Referring back to the "Example Value" for this property key, one has crosswalks/qdc.properties which defines a crosswalk named QDC whose configuration comes from the file [dspace]/config/crosswalks/qdc.properties.

You will also need to configure the namespaces and schema location strings for the XML output generated by this crosswalk. The namespaces properties names are formatted:

crosswalk.qdc.namespace.prefix = uri

where prefix is the namespace prefix and uri is the namespace URI. See the above Property and Example Value keys as the default dspace.cfg has been configured.

The QDC crosswalk properties file is a list of properties describing how DSpace metadata elements are to be turned into elements of the Qualified DC XML output document. The property name is a concatenation of the metadata schema, element name, and optionally the qualifier. For example, the contributor.author element in the native Dublin Core schema would be: dc.contributor.author. The value of the property is an XML fragment, the element whose value will be set to the value of the metadata field in the property key.

For example, in this property:

     dc.coverage.temporal = <dcterms:temporal />

the generated XML in the output document would look like, e.g.:

     <dcterms:temporal>Fall, 2005</dcterms:temporal>

5.2.17. Configuring Crosswalk Plugins

Ingestion crosswalk plugins are configured as named or self-named plugins for the interface org.dspace.content.crosswalk.IngestionCrosswalk. Dissemination crosswalk plugins are configured as named or self-named plugins for the interface org.dspace.content.crosswalk.DisseminationCrosswalk.

You can add names for existing crosswalks, add new plugin classes, and add new configurations for the configurable crosswalks as noted below.

5.2.18. Configuring Packager Plugins

Package ingester plugins are configured as named or self-named plugins for the interface org.dspace.content.packager.PackageIngester. Package disseminator plugins are configured as named or self-named plugins for the interface org.dspace.content.packager.PackageDisseminator.

You can add names for the existing plugins, and add new plugins, by altering these configuration properties. See the Plugin Manager architecture for more information about plugins.

5.2.19. Event System Configuration

Properties:
event.dispatcher.default.class
 event.dispatcher.default.consumers
Example Value:
event.dispatcher.default.class = org.dspace.event.BasicDispatcher
event.dispatcher.default.consumers = search, browse, eperson
Informational Note:This is the default synchronous dispatcher (Same behavior as traditional DSpace).
 
Properties:
event.dispatcher.noindex.class
event.dispatcher.noindex.consumers
Example Value:
event.dispatcher.noindex.class = org.dspace.event.BasicDispatcher
event.dispatcher.noindex.consumers = eperson
Informational Note:The noindex dispatcher will not create search or browse indexes (useful for batch item imports).
 
Properties:
event.consumer.search.class
event.consumer.search.filters
Example Value:(See example below)
event.consumer.search.class = org.dspace.search.SearchConsumer
event.consumer.search.filters =  \
Community|Collection|Item|Bundle+Add|Create|Modify|Modify_Metadata|Delete|Remove
Informational Note:Consumer to maintain the search index.
 
Properties:
event.consumer.browse.class
event.consumer.browse.filters
Example Value:(See example below)
event.consumer.browse.class = org.dspace.browse.BrowseConsumer
event.consumer.browse.filters = \
Community|Collection|Item|Bundle+Add|Create|Modify|Modify_Metadata|Delete|Remove
Informational Note:Consumer to maintain the browse index.
 
Properties:
event.consumer.eperson.class
event.consumer.eperson.filters
Example Value:
event.consumer.eperson.class = org.dspace.eperson.EPersonConsumer
event.consumer.eperson.filters = EPerson+Create
Informational Note:Consumer related to EPerson changes
 
Properties:
event.consumer.test.class
event.consumer.test.filters
Example Value:
event.consumer.test.class = org.dspace.event.TestConsumer
event.consumer.test.filters = All+All
Informational Note:Test consumer for debugging and monitoring. Commented out by default.
 
Properties: testConsumer.verbose
Example Value: testConsumer.verbose = true
Informational Note:Set this to true to enable testConsumer messages to standard output. Commented out by default.

5.2.20. Checksum Checker Settings

DSpace now comes with a Checksum Checker script ([dspace]/bin/checker) which can be scheduled to verify the checksum of every item within DSpace. Since DSpace calculates and records the checksum of every file submitted to it, this script is able to determine whether or not a file has been changed (either manually or by some sort of corruption or virus). The idea being that the earlier you can identify a file has changed, the more likely you'd be able to recover it (assuming it was not a wanted change).

Property: plugin.single.org.dspace.checker.BitstreamDispatcher
Example Value: plugin.single.org.dspace.checker.BitstreamDispatcher = org.dspace.checker.SimpleDispatcher
Informational Note:The Default dispatcher is case non is specified.
 
Property: checker.retention.default
Example Value: checker.retention.default = 10y
Informational Note:This option specifies the default time frame after which all checksum checks are removed from the database (defaults to 10 years). This means that after 10 years, all successful or unsuccessful matches are removed from the database.
 
Property: checker.retention.CHECKSUM_MATCH
Example Value: checker.retention.CHECKSUM_MATCH = 8w
Informational Note:This option specifies the time frame after which a successful “match” will be removed from your DSpace database (defaults to 8 weeks). This means that after 8 weeks, all successful matches are automatically deleted from your database (in order to keep that database table from growing too large).

5.2.21. Item Export and Download Settings

It is possible for an authorized user to request a complete export and download of a DSpace item in a compressed zip file. This zip file may contain the following:

dublin_core.xml
license.txt
contents (listing of the contents)
handle
file itself and the extract file if available

The configuration settings control several aspects of this feature:

Property: org.dspace.app.itemexport.work.dir
Example Value: org.dspace.app.itemexport.work.dir = ${dspace.dir}/exports
Informational Note:The directory where the exports will be done and compressed.
 
Property: org.dspace.app.itemexport.download.dir
Example Value: org.dspace.app.itemexport.download.dir = ${dspace.dir}/exports/download
Informational NoteThe directory where the compressed files will reside and be read by the downloader.
 
Property: org.dspace.app.itemexport.life.span.hours
Example Value: org.dspace.app.itemexport.life.span.hours = 48
Informational NoteThe length of time in hours each archive should live for. When new archives are created this entry is used to delete old ones.
 
Property: org.dspace.app.itemexport.max.size
Example Value: org.dspace.app.itemexport.max.size = 200
Informational NoteThe maximum size in Megabytes (Mb) that the export should be. This is enforced before the compression. Each bitstream's size in each item being exported is added up, if their cumulative sizes are more than this entry the export is not kicked off.

5.2.22. Subscription Emails

DSpace, through some advanced installation and setup, is able to send out an email to collections that a user has subscribed. The user who is subscribed to a collection is emailed each time an item id added or modified. The following property key controls whether or not a user should be notified of a modification.

Property: eperson.subscription.onlynew
Example Value: eperson.subscription.onlynew = true
Informational Note:For backwards compatibility, the subscription emails by default include any modified items. The property key is COMMENTED OUT by default.

5.2.23. Settings for the Submission Process

These settings control two aspects of the submission process: thesis submission permission and whether or not a bitstream file is required when submitting to a collection.

Property: webui.submit.blocktheses
Example Value: webui.submit.blocktheses = false
Informational Note:Controls whether or not that the submission should be marked as a thesis.
 
Property: webui.submit.upload.required
Example Value: webui.submit.upload.required = true
Informational Note:Whether or not a file is required to be uploaded during the "Upload" step in the submission process. The default is true. If set to "false", then the submitter (human being) has the option to skip the uploading of a file.

5.2.24. Configuring Creative Commons License

This enables the Creative Commons license step in the submission process of the JSP User Interface (JSPUI). Submitters are given an opportunity to select a Creative Common license to accompany the item. Creative Commons license govern the use of the content. For further details, refer to the Creative Commons website at http://creativecommons.org .

Property: webui.submit.enable-cc
Example Value: webui.submit.enable-cc = false
Informational Note: Set key to "false" if you are not using CC License. Set key to "true" if you are using CC License.

5.2.25. WEB User Interface Configurations

General Web User Interface Configurations

In this section of Configuration, we address the agnostic WEB User Interface that is used for JSP UI and XML UI. Some of the configurations will give information towards customization or refer you to the appropriate documentation.

Property: webui.browse.thubnail.show
Example Value: webui.browse.thubnail.show = true
Informational Note:Controls whether to display thumbnails on browse and search result pages. If you have customized the Browse columnlist, then you must also include a "thumbnail" column in your configuration. (This configuration property key is not used by XMLUI. To show thumbnails using XMLUI, you need to create a theme which displays them).
 
Property:
webui.browse.thumbnail.maxheight
webui.browse.thumbnail.maxwidth
Example Value:
webui.browse.thumbnail.maxheight = 80
webui.browse.thumbnail.maxwidth = 80
Informational Note:This determines the maximum height and maximum width of the browse/search thumbnails in pixels (px). This only needs to be set if the thumbnails are required to be smaller than the dimensions of thumbnails generated by MediaFilter.
 
Property: webui.itme.thumbnail.show
Example Value: webui.itme.thumbnail.show = true
Informational Note:This determines whether or not to display the thumbnail against each bitstream. (This configuration property key is not used by XMLUI. To show thumbnails using XMLUI, you need to create a theme which displays them).
 
Property: webui.browse.thumbnail.linkbehavior
Example Value: webui.browse.thumbnail.linkbehavior = item
Informational Note:This determines where should the clicking on the thumbnail from browse/search screens take the user. The only values currently supported are "item" or "bitstream".
 
Property:
thumbnail.maxwidth
thumbnail.maxheight
Example Value:
thumbnail.maxwidth 80
thumbnail.maxheight 80
Informational Note:This is the where one sets the maximum height and width of generated thumbnails.
 
Property: webui.preview.enabled
Example Value: webui.preview.enabled = false
Informational Note:Whether or not the user can "preview" the image.
 
Property:
webui.preview.maxwidth
webui.preview.maxheight
Example Value:
webui.preview.maxwidth = 600
webui.preview.maxheight = 600
Informational Note:This sets the maximum dimensions for the preview image.
 
Property: webui.preview.brand
Example Value: webui.preview.brand = My Institution Name
Informational Note:This is the brand text that will appear with the image.
 
Property: webui.preview.brand.abbrev
Example Value: webui.preview.brand.abbrev = MyOrg
Informational Note:An abbreviated form of the full Branded Name. This will be used when the preview image cannot fit the normal text.
 
Property:webui.preview.brand.height
Example Value:webui.preview.brand.height = 20
Informational Note:The height (in px) of the brand
 
Property:
webui.preview.brand.font
webui.preview.brand.fontpoint
Example Value:
webui.preview.brand.font = SanSerif
webui.preview.brand.fontpoint = 12
Informational Note:The font settings for the brand text.
 
Property: webui.preview.dc
Example Value: webui.preview.dc = rights
Informational Note:The Dublin Core field that will display along with the preview.
 
Property: webui.strengths.show
Example Value: webui.strengths.show = false
Informational Note:Determines if communities and collections should display item counts when listed. The default behavior if omitted, is true. (This configuration property key is not used by XMLUI. To show thumbnails using XMLUI, you need to create a theme which displays them).
 
Property: webui.strengths.cache
Example Value: webui.strengths.cache = false
Informational Note:When showing the strengths, should they be counted in real time, or fetched from the cache. Counts fetched in real time will perform an actual count of the database contents every time a page with this feature is requested, which will not scale. If you set the property key is set to cache ("true") you must run the following command periodically to update the count: /[dspace]/bin/dsrun org.dspace.browse.ItemCounter. The default is to count in real time (set to "false").

5.2.26. Browse Index Configuration

The browse indexes for DSpace can be extensively configured. This section of the configuration allows you to take control of the indexes you wish to browse, and how you wish to present the results. The configuration is broken into several parts: defining the indexes, defining the fields upon which users can sort results, defining truncation for potentially long fields (e.g. authors), setting cross-links between different browse contexts (e.g. from an author's name to a complete list of their items), how many recent submissions to display, and configuration for item mapping browse.

Property: webui.browse.index.<n>
Example Value: webui.browse.index.1 = dateissued:metadata:dc.date.issued:date:full
Informational Note:This is an example of how one "Defines the Indexes". SeeDefining the Indexes in the next sub-section.
 
Property: webui.itemlist.sort-option.<n>
Example Value: webui.itemlist.sort-option.1 = title:dc.title:title
Informational Note:This is an example of how one "Defines the Sort Options". See Defining Sort Options in the following sub-section.

5.2.26.1. Defining the Indexes.

DSpace arrives with four default indexes already defined: author, title, date issued, and subjects. Users may also define additional indexes or re-configure the current indexes for different levels of specificity. For example, the default entries that appear in the dspace.cfg as default installation:

webui.browse.index.1 = dateissued:metadata:dc.date.issued:date:full
webui.browse.index.2 = author:metadata:dc.contributor.*:text
webui.browse.index.3 = title:metadata:dc.title:title:full
webui.browse.index.4 = subject:metadata:dc.subject.*:text
#webui.browse.index.5 = dateaccessioned:item:dateaccessioned

The format of each entry is webui.browse.index.<n> = <index name>:<metadata>:<schema prefix>.<element>.<qualifier>:<data-type field>:<sort option>. Please notice that the punctuation is paramount in typing this property key in the dspace.cfg file. The following table explains each element:

Element Definition and Options (if available)
webui.browse.index. n n is the index number. The index numbers must start from 1 and increment continuously by 1 thereafter. Deviation form this will cause an error during install or a configuration update. So anytime you add a new browse index, remember to increase the number. (Commented out index numbers may be used over again).
<index name> The name by which the index will be identified. You will need to update your Messages.properties file to match this field. (The form used in the Messages.properties file is: browse.type.metadata.<index name> .
<metadata> Only two options are available: "metadata" or "item"
<schema prefix> The schema used for the field to be index. The default is dc (for Dublin Core).
<element> The schema element. In Dublin Core, for example, the author element is referred to as "Contributor". The user should consult the default Dublin Core Metadata Registry table in Appendix A.
<qualifier> This is the qualifier to the <element> component. The user has two choices: an asterisk "*" or a proper qualifier of the element. The asterisk is a wildcard and causes DSpace to index all types of the schema element. For example, if you have the element "contributor" and the qualifier "*" then you would index all contributor data regardless of the qualifier. Another example, you have the element "subject" and the qualifier "lcsh" would cause the indexing of only those fields that have the qualifier "lcsh". (This means you would only index Library of Congress Subject Headings and not all data elements that are subjects.
<datatype field> This refers to the datatype of the field:
date the index type will be treated as a date object
title the index type will be treated like a title, which will include a link to the item page
text the index type will be treated as plain text. If single mode is specified then this will link to the full mode list
<index display> Choose full or single. This refers to the way that the index will be displayed in the browse listing. "Full" will be the full item list as specified by webui.itemlist.columns ; "single" will be a single list of only the indexed term.

If you are customizing this list beyond the default, you will need to insert the text you wish to appear in the navigation and on link and buttons. You need to edit the Messages.properties file. The form of the parameter(s) in the file:

     browse.type.<index name>

5.2.26.2. Defining Sort Options

Sort options will be available when browsing a list of items (i.e. only in "full" mode, not "single" mode). You can define an arbitrary number of fields to sort on, irrespective of which fields you display using web.itemlist.columns. For example, the default entries that appear in the dspace.cfg as default installation:

webui.itemlist.sort-option.1 = title:dc.title:title
webui.itemlist.sort-option.2 = dateissued:dc.date.issued:date
webui.itemlist.sort-option.3 = dateaccessioned:dc.date.accessioned:date

The format of each entry isweb.browse.sort-option.<n> = <option name>:<schema prefix>.<element>.<qualifier>:<datatype>. Please notice the punctuation used between the different elements. The following table explains the each element:

Element Definition and Options (if available)
webui.browse.index. n n is an arbitrary number you choose.
<option name> The name by which the sort option will be identified. This may be used in later configuration or to locate the message key (found in Messages.properties file) for this index.
<schema prefix> The schema used for the field to be index. The default is dc (for Dublin Core).
<element> The schema element. In Dublin Core, for example, the author element is referred to as "Contributor". The user should consult the default Dublin Core Metadata Registry table in Appendix A.
<qualifier> This is the qualifier to the <element> component. The user has two choices: an asterisk "*" or a proper qualifier of the element.
<datatype field> This refers to the datatype of the field:
date the sort type will be treated as a date object
text the sort type will be treated as plain text.

5.2.26.3. Browse Index Normalization Rule Configuration

Normalization Rules are those rules that make it possible for the indexes to intermix entries without regard to case sensitivity. By default, the display of metadata in the browse indexes are case-sensitive. In the example below, you retrieve separate entries:

Twain, Mark
twain, mark
TWAIN, MARK

However, clicking through from either of these will result in the same set of items (i.e., any item that contains either representation in the correct field).

Property: webui.browse.metadata.case-insensitive
Example Value: webui.browse.metadata.case-insensitive = true
Informational Note:This controls the normalization of the index entry. Uncommenting the option (which is commented out by default) will make the metadata items case-insensitive. This will result in a single entry in the example above. However, the value displayed may be any one of the above—depending on what representation was present in the first item indexed.

At the present time, you would need to edit your metadata to clean up the index presentation.

5.2.26.4. Other Browse Options

We set other browse values in the following section.

Property:webui.browse.value_columns.max
Example Value:webui.browse.value_columns.max = 500
Informational Note:This sets the options for the size (number of characters) of the fields stored in the database. The default is 0, which is unlimited size for fields holding indexed data. Some database implementations (e.g. Oracle) will enforce their own limit on this field size. Reducing the field size will decrease the potential size of your database and increase the speed of the browse, but it will also increase the chance of mis-ordering of similar fields. The values are commented out, but proposed values for reasonably performance versus result quality. This affects the size of field for the browse value (this will affect display, and value sorting )
 
Property:webui.browse.sort_columns.max
Example Value:webui.browse.sort_columns.max = 200
Informational Note:Size of field for hidden sort columns (this will affect only sorting, not display). Commented out as default.
 
Property:webui.browse.value_columns.omission_mark
Example Value:webui.browse.value_columns.omission_mark = ...
Informational Note:Omission mark to be placed after truncated strings in display. The default is "...".
 
Property:plugin.named.org.dspace.sort.OrderFormatDelegate
Example Value:
plugin.named.org.dspace.sort.OrderFormatDelegate = \
org.dspace.sort.OrderFormatTitleMarc21=title
Informational Note:

This sets the option for how the indexes are sorted. All sort normalizations are carried out by the OrderFormatDelegate. The plugin manager can be used to specify your own delegates for each datatype. The default datatypes (and delegates) are:

author = org.dspace.sort.OrderFormatAuthor
title  = org.dspace.sort.OrderFormatTitle
text   = org.dspace.sort.OrderFormatText

If you redefine a default datatype here, the configuration will be used in preferences to the default. However, if you do not explicitly redefine a datatype, then the default will still be used in addition to the datatypes you do specify.

As of DSpace release 1.5.2, the multi-lingual MARC21 title ordering is configured as default, as shown in the example above. To use the previous title ordering (before release 1.5.2), comment out the configuration in your dspace.cfg file.

5.2.27. Author (Multiple metadata value) Display

[Note]

This section actually applies to any field with multiple values, but authors are the define case and example here.

Property:webui.browse.author-field
Example Value:webui.browse.author-field = dc.contributor.*
Informational Note:This defines which field is the author/editor, etc. listing.

Replace dc.contributor.* with another field if appropriate. The field should be listed in the configuration for webui.itemlist.columns, otherwise you will not see its effect. It must also be defined in webui.itemlist.columns as being of the datatype text otherwise the functionality will be overridden by the specific data type feature. (This setting is not used by the XMLUI as it is controlled by your theme).

Now that we know which field is our author or other multiple metadata value field we can provide the option to truncate the number of values displayed by default. We replace the remaining list of values with "et al" or the language pack specific alternative. Note that this is just for the default, and users will have the option of changing the number displayed when they browse the results. See the following table:

Property:webui.browse.author-limit
Example Value:webui.browse.author-limit = <n>
Informational Note:Where <n> is an integer number of values to be displayed. Use -1 for unlimited (the default value).

5.2.28. Links to Other Browse Contexts

We can define which fields link to other browse listings. This is useful, for example, to link an author's name to a list of just that author's items. The effect this has is to create links to browse views for the item clicked on. If it is a "single" type, it will link to a view of all the items which share that metadata element in common (i.e. all the papers by a single author). If it is a "full" type, it will link to a view of the standard full browse page, starting with the value of the link clicked on.

Property:webui.browse.link.<n>
Example Value:webui.browse.link.1 = author:dc.contributor.*
Informational Note:This is used to configure which fields should link to other browse listings. This should be associated with the name of one of the browse indexes (webui.browse.index.n) with a metadata field listed in webui.itemlist.columns above. If this condition is not fulfilled, cross-linking will not work. Note also that crosslinking only works for metadata fields not tagged as title in webui.itemlist.columns.

The format of the property key is webui.browse.link.<n> = <index name>:<display column metadata> Please notice the punctuation used between the elements.

Element Definition and Options (if available)
webui.browse.link.nn is an arbitrary number you choose
<index name>This need to match your entry for the index name from webui.browse.index property key.
<display column metadata>Use the DC element (and qualifier)

Examples of some browse links used in a real DSpace installation instance:

webui.browse.link.1 = author:dc.contributor.* Creates a link for all types of contributors (authors, editors, illustrators, others, etc.)webui.browse.link.2 = subject:dc.subject.lcsh Creates a link to subjects that are Library of Congress only. In this case, you have a browse index that contains only LC Subject Headings
webui.browse.link.3 = series:dc.relation.ispartofseries Creates a link for the browse index "Series". Please note this is again, a customized browse index and not part of the DSpace distributed release. 

5.2.29. Recent Submissions

This allows us to define which index to base Recent Submission display on, and how many we should show at any one time. This uses the PluginManager to automatically load the relevant plugin for the Community and Collection home pages. Values given in examples are the defaults supplied in dspace.cfg

Property:recent.submission.sort-option
Example Value:recent.submission.sort-option = dateaccessioned
Informational Note:First is to define the sort name (from webui.browse.sort-options) to use for displaying recent submissions.
 
Property:recent.submissions.count
Example Value:recent.submissions.count = 5
Informational Note:Defines how many recent submissions should be displayed at any one time.

There will be the need to set up the processors that the PluginManager will load to actually perform the recent submissions query on the relevant pages. This is already configured by default dspace.cfg so there should be no need for the administrator/programmer to worry about this.

plugin.sequence.org.dspace.plugin.CommunityHomeProcessor = \
        org.dspace.app.webui.components.RecentCommunitySubmissions

plugin.sequence.org.dspace.plugin.CollectionHomeProcessor = \
        org.dspace.app.webui.components.RecentCollectionSubmissions

5.2.30. Syndication Feed (RSS) Settings

This will enable syndication feeds—links display on community and collection home pages. This setting is not used by the XMLUI, as you enable feeds in your theme.

Property:webui.feed.enable
Example Value:webui.feed.enable = false
Informational Note:By default, RSS feeds are set to false (off). Change key to "true" to enable.
 
Property:webui.feed.items
Example Value:webui.feed.items = 4
Informational Note:Defines the number of DSpace items per feed (the most recent submissions)
 
Property:webui.feed.cache.size
Example Value:webui.feed.cache.size = 100
Informational Note:Defines the maximum number of feeds in memory cache. Value of "0" will disable caching.
 
Property:webui.feed.cache.age
Example Value:webui.feed.cache.age = 48
Informational Note:Defines the number of hours to keep cached feeds before checking currency. The value of "0" will force a check with each request.
 
Property:webui.feed.formats
Example Value:webui.feed.formats = rss_1.0,rss_2.0,atom_1.0
Informational Note:Defines which syndication formats to offer. You can use more than one; use a comma-separated list. The following list are the available values: rss_0.90, rss_0.91, rss_0.92, rss_0.93, rss_0.94, rss_1.0, rss_2.0, atom_1.0.
 
Property:webui.feed.localresolve
Example Value:webui.feed.localresolve = false
Informational Note:By default, (set to false), URLs returned by the feed will point at the global handle resolver (e.g. http://hdl.handle.net/123456789/1). If set to true the local server URLs are used (e.g. http://myserver.myorg/handle/123456789/1).
 
Property:
webui.feed.item.title
webui.feed.item.date
Example Value:
webui.feed.item.title = dc.title
webui.feed.item.date = dc.date.issued
Informational Note:This set of keys customize each single-value field displayed in the feed information for each item. Each of the fields takes a *single* metadata field. The form of the key is <scheme prefix>.<element>.<qualifier> In place of the qualifier, one may leave it blank to exclude any qualifiers or use the wildcard "*" to include all qualifiers for a particular element.
 
Property:webui.feed.item.description
Example Value:(See example below)
webui.feed.item.description = dc.title, dc.contributor.author, \
           dc.contributor.editor, dc.description.abstract, \
           dc.description
Informational Note:

One can customize the metadata fields to show in the feed for each item's description. Elements are displayed in the order they are specified in dspace.cfg.

Like other property keys, the format of this property key is: webui.feed.item.description = <scheme prefix>.<element>.<qualifier>. In place of the qualifier, one may leave it blank to exclude any qualifiers or use the wildcard "*" to include all qualifiers for a particular element.

 
Property:webui.feed.item.author
Example Value:webui.feed.item.author = dc.contributor.author
Informational Note:The name of field to use for authors (Atom only); repeatable.
 
Property:webui.feed.logo.url
Example Value:webui.feed.logo.url = ${dspace.url}/themes/mysite/images/mysite-logo.png
Informational Note:Customize the image icon included with the site-wide feeds. This must be an absolute URL
 
Property:
webui.feed.item.dc.creator
webui.feed.item.dc.date
webui.feed.item.dc.description
						
Example Value:
webui.feed.item.dc.creator = dc.contributor.author
webui.feed.item.dc.date = dc.date.issued
webui.feed.item.dc.description = dc.description.abstract
Informational Note:These optional properties add structured DC elements as XML elements to the feed description. They are not the same thing as, for example, webui.feed.item.description. Useful when a program or stylesheet will be transforming a feed and wants separate author, description, date, etc.

5.2.31. Content Inline Disposition Threshold

The following configuration is used to change the disposition behavior of the browser. That is, when the browser will attempt to open the file or download it to the user's specified location. For example, the default size is 8Mb. When an item being viewed is larger than 8MB, the browser will download the file to the desktop (or wherever you have it set to download) and the user will have to open it manually.

Property:webui.content_disposition_threshold
Example value:webui.content_disposition_threshold = 8388608
Informational Note:The default value is set to 8Mb. This property key applies to the JSPUI interface.
 
Property:xmlui.content_disposition_threshold
Example Value:xmlui.content_disposition_threshold = 8388608
Informational Note:The default value is set to 8Mb. This property key applies to the XMLUI (Manakin) interface.

Other values are possible:

4 MB = 4194304
8 MB = 8388608
16 MB = 16777216

5.2.32. Multi-file HTML Document/Site Settings

The setting is used to configure the "depth" of request for html documents bearing the same name.

Property:
webui.html.max-depth-guess
xmlui.html.max-depth-guess
Example Value:
webui.html.max-depth-guess = 3
xmlui.html.max-depth-guess = 3
Informational Note:When serving up composite HTML items, how deep can the request be for us to serve up a file with the same name? For example, if one receives a request for "foo/bar/index.html" and one has a bitstream called just "index.html", DSpace will serve up the former bitstream (foo/bar/index.html) for the request if webui.html.max-depth-guess is 2 or greater. If webui.html.max-depth-guess is 1 or less, then DSpace would not serve that bitstream, as the depth of the file is greater. If webui.html.max-depth-guess is zero, the request filename and path must always exactly match the bitstream name. The default is set to 3.

5.2.33. Sitemap Settings

To aid web crawlers index the content within your repository, you can make use of sitemaps.

Property:sitemap.dir
Example Value:sitemap.dir = ${dspace.dir}/sitemaps
Informational Note:The directory where the generate sitemaps are stored.
 
Property:sitemap.engineurls
Example Value:sitemap.engineurls = http://www.google.com/webmasters/sitemaps/ping?sitemap=
Informational Note:

Comma-separated list of search engine URLs to 'ping' when a new Sitemap has been created. Include everything except the Sitemap UL itself (which will be URL-encoded and appended to form the actual URL 'pinged').

Add the following to the above parameter if you have an application ID with Yahoo: http://search.yahooapis.com/SiteExplorererService/V1/updateNotification?appid=REPLACE_ME?url=. (Replace the component REPLACE_ME with your application ID). There is no known 'ping' URL for MSN/Live search.

5.2.34. Upload File Settings

Property:upload.temp.dir
Example Value:upload.temp.dir = ${dspace.dir}/upload
Informational Note:Where to temporarily store uploaded files.
 
Property:upload.max
Example Value:upload.max = 536870912
Informational Note:Maximum size of uploaded files in bytes. A negative setting will result in no limit being set. The default is set for 512Mb.

5.2.35. Statistical Report Configuration Setting

Property:report.public
Example Value:report.public = false
Informational Note:Controls whether or not the stats can be publicly available. Set it to false (the default) if you only want administrators to access the stats, or you do not intended to generate any statistics.
 
Property:report.dir
Example Value:report.dir = ${dspace.dir}/reports
Informational Note:Directory where the live reports are stored.

5.2.36. JSP Web Interface (JSPUI) Settings

The following section is limited to JSPUI. If the user wishes to use XMLUI settings, please refer to Chapter 7: XMLUI Configuration and Customization.

Property:webui.licence_bundle.show
Example Value:webui.licence_bundle.show = false
Informational Note:Sets whether to display the contents of the license bundle (often just the deposit license in the standard DSpace installation).
 
Property:webui.itemdisplay.default
Example Value:(See example below)
webui.itemdisplay.default = dc.title, dc.title.alternative, \
           dc.contributor.*, dc.subject, dc.data.issued(date), \
           dc.publisher, dc.identifier.citation, \
           dc.relation.ispartofseries, dc.description.abstract, \
           dc.description, dc.identifier.govdoc, \
           dc.identifier.uri(link), dc.identifier.isbn, \
           dc.identifier.issn, dc.identifier.ismn, dc.identifier
Informational Note:

This is used to customize the DC metadata fields that display in the itemdisplay (the brief display) when pulling up a record. The format is: <schema>.<element>.<optional qualifier> In place of the qualifier, one can use the wildcard "*" to include all fields of the same element, or, leave it blank for unqualified elements. Additionally, two additional options are available for behavior/rendering: (date) and (link). See the following examples:
dc.title = Dublin Core element 'title' (unqualified)
dc.title.alternative = DC element 'title', qualifier 'alternative'
dc.title.* = All fields with Dublin Core element 'title' (any or no qualifier)
dc.identifier.uri(link) = DC identifier.uri, rendered as a link
dc.date.issued(date) = DC date.issued, rendered as a date

The Messages.properties file controls how the fields defined above will display to the user. If the field is missing from the Messages.propertiesfile, it will not be display. Look in Messages.properties under metadata.dc.<field>. Example:

metadata.dc.contributor.other       = Authors
metadata.dc.contributor.author      = Authors
metadata.dc.title.*                 = Title

Please note: The order in which you place the values to the property key control the order in which they will display to the user on the outside world. (See the Example Value above).

 
Property:
webui.resolver.1.urn
webui.resolver.1.baseurl
webui.resolver.2.urn
webui.resolver.2.baseurl
Example Value:
webui.resolver.1.urn = doi
webui.resolver.1.baseurl = http://dx.doi.org/
webui.resolver.2.urn = hdl
webui.resolver.2.baseurl = http://hdl.handle.net/
Informational Note:

When using "resolver" in webui.itemdisplay to render identifiers as resolvable links, the base URL is take from <code>webui.resolver.<n>.baseurl<code> where <code>webui.resolver.<n>.baseurl<code> matches the urn specified in the metadata value. The value is appended to the "baseurl" as is, so the baseurl needs to end with the forward slash almost in any case. If no urn is specified in the value it will be displayed as simple text.

For the doi and hdl urn defaults values are provided, respectively http://dc.doi.org and http://hdl.handle.net are used.

If a metadata value with style "doi", "handle" or "resolver" matches a URL already, it is simply rendered as a link with no other manipulation.

 
Property:plugin.single.org.dspace.app.webui.util.StyleSelection
Example Value:
plugin.single.org.dspace.app.webui.util.StyleSelection = \
  org.dspace.app.web.util.CollectionStyleSelection
  #org.dspace.app.web.util.MetadataStyleSelection
Informational Note:Specify which strategy to use for select the style for an item.
 
Property:webui.itemdisplay.thesis.collections
Example Value:webui.itemdisplay.thesis.collections = 123456789/24, 123456789/35
Informational Note:Specify which collections use which views by Handle.
 
Property:
webui.itemdisplay.metadata-style
webui.itemdisplay.metadata-syle
Example Value:
webui.itemdisplay.metadata-style = schema.element[.qualifier|.*]
webui.itemdisplay.metadata-syle = dc.type
Informational Note:Specify which metadata to use as name of the style
 
Property:webui.itemlist.columns
Example Value:
webui.itemlist.columns = thumbnail, dc.date.issued(date), dc.title, \
          dc.contributor.*
Informational Note:

Customize the DC fields to use in the item listing page. Elements will be displayed left to right in the order they are specified here. The form is <schema prefix>.<element>[.<qualifier>|.*][(date)], ...

Although not a requirement, it would make sense to include among the listed fields at least the date and title fields as specified by the webui.browse.index.* configuration options in the next section mentioned. (cf.)

If you have enabled thumbnails (webui.browse.thumbnail.show), you must also include a 'thumbnail' entry in your columns—this is where the thumbnail will be displayed.

 
Property:webui.itemlist.width
Example Value:webui.itemlist.width = *, 130, 60%, 40%
Informational Note:You can customize the width of each column with the following line—you can have numbers (pixels) or percentages. For the 'thumbnail' column, a setting of '*' will use the max width specified for browse thumbnails (cf. webui.browse.thumbnail.maxwidth, thumbnail.maxwidth)
 
Property:
webui.itemlist.browse.<index name>.sort.<sort name>.columns
webui.itemlist.sort.<sort name>.columns
webui.itemlist.browse.<browse name>.columns
webui.itemlist.<sort or index name>.columns
Example Value:
Informational Note:

You can override the DC fields used on the listing page for a given browse index and/or sort option. As a sort option or index may be defined on a field that isn't normally included in the list, this allows you to display the fields that have been indexed/sorted on.

There are a number of forms the configuration can take, and the order in which they are listed below is the priority in which they will be used (so a combination of an index name and sort name will take precedence over just the browse name).

In the last case, a sort option name will always take precedence over a browse index name. Note also, that for any additional columns you list, you will need to ensure there is an itemlist.<field name> entry in the messages file.

 
Property:webui.itemlist.dateaccessioned.columns
Example Value:webui.itemlist.dateaccessioned.columns = thumbnail, dc.date.accessioned(date), dc.title, dc.contributor.*
Informational Note:This would display the date of the accession in place f the issue date whenever the dateaccessioned browsed index or sort option is selected. Just like webui.itemlist.columns, you will need to include a 'thumbnail' entry to display the thumbnails in the item list.
 
Property:webui.itemlist.dateaccessioned.widths
Example Value:webui.itemlist.dateaccessioned.widths = *, 130, 60%, 40%
Informational Note:As in the aforementioned property key, you can customize the width of the columns for each configured column list, substituting '.widths' for '.columns' in the property name. See the setting for webui.itemlist.widthsfor more information.
 
Property:webui.itemlist.tablewidth
Example Value:webui.itemlist.tablewidth = 100%
Informational Note:You can also set the overall size of the item list table with the following setting. It can lead to faster table rendering when used with the column widths above, but not generally recommended.

5.2.37. Configuring Multilingual Support

5.2.37.1. Setting the Default Language for the Application

Property:default.locale
Example Value:default.locale = en
Informational Note:

The default language for the application is set with this property key. This is a locale according to i18n and might consist of country, country_language or country_language_variant. If no default locale is defined, then the server default locale will be used.

The format of a local specifier is described here: http://java.sun.com/j2se/1.4.2/docs/api/java/util/Locale.html

5.2.37.2.  Supporting More Than One Language

Changes in dspace.cfg
Property:webui.supported.locale
Example Value:webui.supported.locale = en, de or perhapswebui.supported.locals = en, en_ca, de
Informational Note:All the locales that are supported by this instance of DSpace. Comma separated list.

The table above, if needed and is used will result in:

  • a language switch in the default header

  • the user will be enabled to choose his/her preferred language, this will be part of his/her profile

  • wording of emails

    • mails to registered users, e.g. alerting service will use the preferred language of the user

    • mails to unregistered users, e.g. suggest an item will use the language of the session

  • according to the language selected for the session, using dspace-admin Edit News will edit the news file of the language according to session

Related Files

If you set webui.supported.locales make sure that all the related additional files for each language are available. LOCALE should correspond to the locale set in webui.supported.locales, e. g.: for webui.supported.locales = en, de, fr, there should be:

  • [dspace-source]/dspace/modules/jspui/src/main/resources/Messages.properties

  • [dspace-source]/dspace/modules/jspui/src/main/resources/Messages_en.properties

  • [dspace-source]/dspace/modules/jspui/src/main/resources/Messages_de.properties

  • [dspace-source]/dspace/modules/jspui/src/main/resources/Messages_fr.properties

Files to be localized:

  • [dspace-source]/dspace/modules/jspui/src/main/resources/Messages_LOCALE.properties

  • [dspace-source]/dspace/config/input-forms_LOCALE.xml

  • [dspace-source]/dspace/config/default_LOCALE.license should be pure ascii

  • [dspace-source]/dspace/config/news-top_LOCALE.html

  • [dspace-source]/dspace/config/news-side_LOCALE.html

  • [dspace-source]/dspace/config/emails/change_password_LOCALE

  • [dspace-source]/dspace/config/emails/feedback_LOCALE

  • [dspace-source]/dspace/config/emails/internal_error_LOCALE

  • [dspace-source]/dspace/config/emails/register_LOCALE

  • [dspace-source]/dspace/config/emails/submit_archive_LOCALE

  • [dspace-source]/dspace/config/emails/submit_reject_LOCALE

  • [dspace-source]/dspace/config/emails/submit_task_LOCALE

  • [dspace-source]/dspace/config/emails/subscription_LOCALE

  • [dspace-source]/dspace/config/emails/suggest_LOCALE

  • [dspace]/webapps/jspui/help/collection-admin_LOCALE.html in html keep the jump link as original; must be copied to [dspace-source]/dspace/modules/jspui/src/main/webapp/help

  • [dspace]/webapps/jspui/help/index_LOCALE.html must be copied to [dspace-source]/dspace/modules/jspui/src/main/webapp/help

  • [dspace]/webapps/jspui/help/site-admin_LOCALE.html must be copied to [dspace-source]/dspace/modules/jspui/src/main/webapp/help

5.2.38. Item Mapper

Because the item mapper requires a primitive implementation of the browse system to be present, we simply need to tell that system which of our indexes defines the author browse (or equivalent) so that the mapper can list authors' items for mapping

Define the index name (from webui.browse.index) to use for displaying items by author.

Property:itemmap.author.index
Example Value:itemmap.author.index = author
Informational Note:If you change the name of your author browse field, you will also need to update this property key.

5.2.39. Display of Group Membership

Property:webui.mydspace.showgroupmembership
Example Value:webui.mydspace.showgroupmembership = false
Informational Note:To display group membership set to "true". If omitted, the default behavior is false.

5.2.40. SFX Server

SFX Server is an OpenURL Resolver.

Property:sfx.server.url
Example Value:sfx.server.url = http://sfx.myu.edu:8888/sfx?
Informational Note:SFX query is appended to this URL. If this property is commented out or omitted, SFX support is switched off.

5.2.41. Item Recommendation Setting

Property:webui.suggest.enable
Example Value:webui.suggest.enable = true
Informational Note:Show a link to the item recommendation page from item display page.
 
Property:webui.suggest.loggedinusers.only
Example Value:webui.suggest.loggedinusers.only = true
Informational Note:Enable only if the user is logged in. If this key commented out, the default value is false.

5.2.42. Controlled Vocabulary Settings

DSpace now supports controlled vocabularies to confine the set of keywords that users can use while describing items.

Property:webui.controlledvocabulary.enable
Example Value:webui.controlledvocabulary.enable = true
Informational Note:Enable or disable the controlled vocabulary add-on. WARNING: This feature is not compatible with WAI (it requires javascript to function).

The need for a limited set of keywords is important since it eliminates the ambiguity of a free description system, consequently simplifying the task of finding specific items of information.

The controlled vocabulary add-on allows the user to choose from a defined set of keywords organized in an tree (taxonomy) and then use these keywords to describe items while they are being submitted.

We have also developed a small search engine that displays the classification tree (or taxonomy) allowing the user to select the branches that best describe the information that he/she seeks.

The taxonomies are described in XML following this (very simple) structure:

<node id="acmccs98" label="ACMCCS98">
    <isComposedBy> 
        <node id="A." label="General Literature">
            <isComposedBy>
                <node id="A.0" label="GENERAL"/>
                <node id="A.1" label="INTRODUCTORY AND SURVEY"/>
            </isComposedBy>
        </node>
    </isComposedBy>
</node>

Your are free to use any application you want to create your controlled vocabularies. A simple text editor should be enough for small projects. Bigger projects will require more complex tools. You may use Protegé to create your taxonomies, save them as OWL and then use a XML Stylesheet (XSLT) to transform your documents to the appropriate format. Future enhancements to this add-on should make it compatible with standard schemas such as OWL or RDF.

In order to make DSpace compatible with WAI 2.0, the add-on is turned off by default (the add-on relies strongly on Javascript to function). It can be activated by setting the following property in dspace.cfg:

webui.controlledvocabulary.enable = true

New vocabularies should be placed in [dspace]/config/controlled-vocabularies/ and must be according to the structure described. A validation XML Schema can be downloaded here.

Vocabularies need to be associated with the correspondent DC metadata fields. Edit the file [dspace]/config/input-forms.xml and place a "vocabulary" tag under the "field" element that you want to control. Set value of the "vocabulary" element to the name of the file that contains the vocabulary, leaving out the extension (the add-on will only load files with extension "*.xml"). For example:

<field>
    <dc-schema>dc</dc-schema>
    <dc-element>subject</dc-element>
    <dc-qualifier></dc-qualifier>
    <!-- An input-type of twobox MUST be marked as repeatable -->
    <repeatable>true</repeatable>
    <label>Subject Keywords</label>
    <input-type>twobox</input-type>
    <hint> Enter appropriate subject keywords or phrases below. </hint>
    <required></required>
    <vocabulary [closed="false"]>nsi</vocabulary>
</field>

The vocabulary element has an optional boolean attribute closed that can be used to force input only with the javascript of controlled-vocabulary add-on. The default behavior (i.e. without this attribute) is as set closed="false". This allow the user also to enter the value in free way.

The following vocabularies are currently available by default:

  • nsi - nsi.xml - The Norwegian Science Index

  • srsc - srsc.xml - Swedish Research Subject Categories

5.2.43. XMLUI Specific Configuration

The DSpace digital repository supports two user interfaces: one based upon JSP technologies and the other based upon the Apache Cocoon framework. This section describes those configurations settings which are specific to the XMLUI interface based upon the Cocoon framework. (Prior to DSpace Release 1.5.1 XMLUI was referred to Manakin. You may still see references to "Manakin")

Property:xmlui.supported.locales
Example Value:xmlui.supported.locales = en, de
Informational Note:A list of supported locales for Manakin. Manakin will look at a user's browser configuration for the first language that appears in this list to make available to in the interface. This parameter is a comma separated list of Locales. All types of Locales country, country_language, country_language_variant. Note that if the appropriate files are not present (i.e. Messages_XX_XX.xml) then Manakin will fall back through to a more general language.
 
Property:xmlui.force.ssl
Example Value:xmlui.force.ssl = true
Informational Note:Force all authenticated connections to use SSL, only non-authenticated connections are allowed over plain http. If set to true, then you need to ensure that the 'dspace.hostname' parameter is set to the correctly.
 
Property:xmlui.user.registration
Example Value:xmlui.user.registration = true
Informational Note:Determine if new users should be allowed to register. This parameter is useful in conjunction with Shibboleth where you want to disallow registration because Shibboleth will automatically register the user. Default value is true.
 
Property:xmlui.user.editmetadata
Example Value:xmlui.user.editmetadata = true
Informational Note:Determines if users should be able to edit their own metadata. This parameter is useful in conjunction with Shibboleth where you want to disable the user's ability to edit their metadata because it came from Shibboleth. Default value is true.
 
Property:xmlui.user.assumelogon
Example Value:xmlui.user.assumelogon = true
Informational Note:Determine if super administrators (those whom are in the Administrators group) can login as another user from the "edit eperson" page. This is useful for debugging problems in a running dspace instance, especially in the workflow process. The default value is false, i.e., no one may assume the login of another user.
 
Property:xmlui.user.loginredirect
Example Value:xmlui.user.loginredirect = /profile
Informational Note:After a user has logged into the system, which url should they be directed? Leave this parameter blank or undefined to direct users to the homepage, or /profile for the user's profile, or another reasonable choice is /submissions to see if the user has any tasks awaiting their attention. The default is the repository home page.
 
Property:xmlui.theme.allowoverrides
Example Value:xmlui.theme.allowoverrides = false
Informational Note:Allow the user to override which theme is used to display a particular page. When submitting a request add the HTTP parameter "themepath" which corresponds to a particular theme, that specified theme will be used instead of the any other configured theme. Note that this is a potential security hole allowing execution of unintended code on the server, this option is only for development and debugging it should be turned off for any production repository. The default value unless otherwise specified is "false".
 
Property:xmlui.bundle.upload
Example Value:xmlui.bundle.upload = ORIGINAL, METADATA, THUMBNAIL, LICENSE, CC_LICENSE
Informational Note:Determine which bundles administrators and collection administrators may upload into an existing item through the administrative interface. If the user does not have the appropriate privileges (add and write) on the bundle then that bundle will not be shown to the user as an option.
 
Property:xmlui.community-list.render.full
Example Value:xmlui.community-list.render.full = true
Informational Note:On the community-list page should all the metadata about a community/collection be available to the theme. This parameter defaults to true, but if you are experiencing performance problems on the community-list page you should experiment with turning this option off.
 
Property:xmlui.community-list.cache
Example Value:xmlui.community-list.cache = 12 hours
Informational Note:Normally, Manakin will fully verify any cache pages before using a cache copy. This means that when the community-list page is viewed the database is queried for each community/collection to see if their metadata has been modified. This can be expensive for repositories with a large community tree. To help solve this problem you can set the cache to be assumed valued for a specific set of time. The downside of this is that new or editing communities/collections may not show up the website for a period of time.
 
Property:xmlui.bistream.mods
Example Value:xmlui.bistream.mods = true
Informational Note:Optionally, you may configure Manakin to take advantage of metadata stored as a bitstream. The MODS metadata file must be inside the "METADATA" bundle and named MODS.xml. If this option is set to 'true' and the bitstream is present then it is made available to the theme for display.
 
Property:xmlui.bitstream.mets
Example Value:xmlui.bitstream.mets = true
Informational Note:Optionally, you may configure Manakin to take advantage of metadata stored as a bitstream. The METS metadata file must be inside the "METADATA" bundle and named METS.xml. If this optino is set to "true" and the bitstream is present then it is made available to the theme for display.
 
Property:xmlui.google.analytics.key
Example Value:xmlui.google.analytics.key = UA-XXXXXX-X
Informational Note:If you would like to use google analytics to track general website statistics then use the following parameter to provide your analytics key. First sign up for an account at http://analytics.google.com, then create an entry for your repositories website. Google Analytics will give you a snipit of javascript code to place on your site, inside that snip it is your google analytics key usually found in the line: _uacct = "UA-XXXXXXX-X" Take this key (just the UA-XXXXXX-X part) and place it here in this parameter.
 
Property:xmlui.controlpanel.activity.max
Example Value:xmlui.controlpanel.activity.max = 250
Informational Note:Assign how many page views will be recorded and displayed in the control panel's activity viewer. The activity tab allows an administrator to debug problems in a running DSpace by understanding who and how their dspace is currently being used. The default value is 250.
 
Property:xmlui.controlpanel.activity.ipheader
Example Value:xmlui.controlpanel.activity.ipheader = X-Forward-For
Informational Note:Determine where the control panel's activity viewer recieves an events IP address from. If your DSpace is in a load balanced enviornment or otherwise behind a context-switch then you will need to set the paramater to the HTTP parameter that records the original IP address.

5.2.44. OAI-PMH Configuration and Activation

In the following sections, you will learn how to configure OAI-PMH and activate additional OAI-PMH crosswalks. The user is also referred to 9.2OAI-PMH Data Provider for greater depth details of the program.

5.2.44.1. OAI-PMH Configuration

Property:oai.didl.maxresponse
Example Value:oai.didle.maxresponse = 0
Informational Note:

Max response size for DIDL. This is the maximum size in bytes of the files you wish to enclose Base64 encoded in your responses, remember that the base64 encoding process uses a lot of memory. We recommend at most 200000 for answers of 30 records each on a 1 Gigabyte machine. Ultimately this will change to a streaming model and remove this restriction. Also please remember to allocate plenty of memory, at least 512 MB to your Tomcat.

Optional: DSpace uses 100 records as the limit for the oai responses. You can alter this by changing /[dspace-source]/dspace-oai/dspace-oai-api/src/main/java/org/dspace/app/oai/DSpaceOAICatalog.java to codify the declaration: private final int MAX_RECORDS = 100 to private final int MAX_RECORDS = 30

5.2.44.2. Activating Additional OAI-PMH Crosswalks

DSpace comes with an unqualified DC Crosswalk used in the default OAI-PMH data provider. There are also other Crosswalks bundled with the DSpace distribution which can be activated by editing one or more configuration files. How to do this for each available Crosswalk is described below. The DSpace source includes the following crosswalk plugins available for use with OAI-PMH:

  • mets - The manifest document from a DSpace METS SIP.

  • mods - MODS metadata, produced by the table-driven MODS dissemination crosswalk.

  • qdc - Qualfied Dublin Core, produced by the configurable QDC crosswalk. Note that this QDC does not include all of the DSpace "dublin core" metadata fields, since the XML standard for QDC is defined for a different set of elements and qualifiers.

OAI-PMH crosswalks based on Crosswalk Plugins are activated as follows:

  1. Ensure the crosswalk plugin has a lower-case name (possibly in addition to its upper-case name) in the plugin configuration.

  2. Add a line to the file config/templates/oaicat.properties of the form:

    Crosswalks. plugin_name =org.dspace.app.oai.PluginCrosswalk

    substituting the plugin's name, e.g. "mets" or "qdc"for plugin_name.

  3. Run the bin/install-configs script

  4. Restart your servlet container, e.g. Tomcat, for the change to take effect.

DIDL

By activating the DIDL provider, DSpace items are represented as MPEG-21 DIDL objects. These DIDL objects are XML documents that wrap both the Dublin Core metadata that describes the DSpace item and its actual bitstreams. A bitstream is provided inline in the DIDL object in a base64 encoded manner, and/or by means of a pointer to the bitstream. The data provider exposes DIDL objects via the metadataPrefix didl.

The crosswalk does not deal with special characters and purposely skips dissemination of the license.txt file awaiting a better understanding on how to map DSpace rights information to MPEG21-DIDL.

The DIDL Crosswalk can be activated as follows:

  1. Uncomment the oai.didl.maxresponse item in dspace.cfg

  2. Uncomment the DIDL Crosswalk entry from the config/templates/oaicat.properties file

  3. Run the bin/install-configs script

  4. Restart Tomcat

  5. Verify the Crosswalk is activated by accessing a URL such as http://mydspace/oai/request?verb=ListRecords&metadataPrefix=didl

5.2.45. Delegation Administration

(Authorization System Configuration)

Property:core.authorization.community-admin.create-subelement
Example Value:core.authorization.community-admin.create-subelement = true
Informational Note:

It is now possible to delegate the administration of Communities and Collections without the need of the Administrator Superuser. The delegation uses an "inherited" technique. A community admin will be a collection admin for all the collections within the community and a collection admin will be always an item admin for all the item owned by the collection. All the functions that are allowed to user with WRITE permission on an object will always allowed to be the ADMIN of the object (e.g. community/collection/admin will be always allowed to edit metadata of the object). The default will be "on" for all the configurations.

Community Administration: subcommunities and collections
core.authorization.community-admin.create-subelement
core.authorization.community-admin.delete-subelement
then administers the following keys:
core.authorization.community-admin.policies
core.authorization.community-admin.admin-group
Collections in the above community:
core.authorization.community-admin.collection.policies
core.authorization.community-admin.collection.template-item
core.authorization.community-admin.collection.submitters
core.authorization.community-admin.collection.workflows
core.authorization.community-admin.collection.admin-group
Item owned by Collections in the above Community:
core.authorization.community-admin.item.delete
core.authorization.community-admin.item.withdraw
core.authorization.community-admin.item.reinstatiate
core.authorization.community-admin.item.policies
And also these bundles:
core.authorization.community-admin.item.create-bitstream
core.authorization.community-admin.item.delete-bitstream
core.authorization.community-admin.item-admin.cc-license
Collection Administration:
core.authorization.collection-admin.policies
core.authorization.collection-admin.template-item
core.authorization.collection-admin.submitters
core.authorization.collection-admin.workflows
core.authorization.collection-admin.admin-group
Item owned by the above Collection
core.authorization.collection-admin.item.delete
core.authorization.collection-admin.item.withdraw
core.authorization.collection-admin.item.reinstatiate
core.authorization.collection-admin.item.policies
And also these bundles:
core.authorization.collection-admin.item.create-bitstream
core.authorization.collection-admin.item.delete-bitstream
core.authorization.collection-admin.item-admin.cc-license
Item Administration:
core.authorization.item-admin.policies
And also these bundles:
core.authorization.item-admin.create-bitstream
core.authorization.item-admin.delete-bitstream
core.authorization.item-admin.cc-license
[Note]

Oracle users should consult Chapter 4 Updating a DSpace Installation regarding the necessary database changes that need to take place.

5.2.46. Batch Metadata Editing

The following configurations allow the adminstrator extract from the DSpace database a set of records for editing by a metadata export. It provides an easier way of editing large collections.

Property:bulkedit.valueseparator
Example Value:bulkedit.valueseparator = ||
Informational noteThe delimiter used to separate values within a single field. For example, this will place the double pipe between multiple authors appearing in one record (Smith, William || Johannsen, Susan). This applies to any metadata field that appears more than once in a record. The user can change this to another character.
 
Property:bulkedit.fieldseparator
Example Value:bulkedit.fieldseparator = ,
Informational noteThe delimiter used to serarate fields (defaults to a comma for CSV). Again, the user could change it something like '$'. If you wish to use a tab, semicolon, or hash (#) sign as the delimiter, set the value to be tab, semicolon or hash.
bulkedit.fieldseparator = tab
 
Property:bulkedit.gui-item-limit
Example Value:bulkedit.gui-item-limit = 20
Informational noteWhen using the WEBUI, this sets the limit of the number of items allowed to be edited in one processing. There is no limit when using the CLI.
 
Property:bulkedit.ignore-on-export
Example Value:
bulkedit.ignore-on-export = dc.date.accessioned, \
                            dc.date.available, \
                            dc.date.updated, dc.description.provenance
Informational noteMetadata elements to exclude when exporting via the user interfaces, or when using the command line version and not using the -a (all) option.

5.2.47. Hiding Metadata

It is now possible to hide metadata from public consumption that is only avaialable to the Administrator.

Property:metadata.hide.dc.description.provenance
Example Value:metadata.hide.dc.description.provenance = true
Informational Note:

Hides the metadata in the property key above except to the administrator.

Fields named here are hidden in the following places UNLESS the logged-in user is an Administrator:

  1. XMLUI metadata XML view, and Item splash pages (long and short views).

  2. JSPUI Item splash pages

  3. OAI-PMH server, "oai_dc" format. (Note: Other formats are **not** affected.)

To designate a field as hidden, add a property here in the form:

metadata.hide.SCHEMA.ELEMENT.QUALIFIER = true

This default configuration hides the dc.description.provenance field, since that usually contains email addresses which ought to be kept private and is mainly of interest to administrators.

5.3. Optional or Advanced Configuration Settings

The following section explains how to configure either optional features or advanced features that are not necessary to make DSpace "out-of-the-box"

5.3.1. The Metadata Format and Bitstream Format Registries

The [dspace]/config/registries directory contains three XML files. These are used to load the initial contents of the Dublin Core Metadata registry and Bitstream Format registry and SWORD metadata registry. After the initial loading (performed by ant fresh_install above), the registries reside in the database; the XML files are not updated.

In order to change the registries, you may adjust the XML files before the first installation of DSpace. On an already running instance it is recommended to change bitstream registries via DSpace admin UI, but the metadata registries can be loaded again at any time from the XML files without difficult. The changes made via admin UI are not reflected in the XML files.

5.3.1.1. Metadata Format Registries

The default metadata schema is Dublin Core, so DSpace is distributed with a default Dublin Core Metadata Registry. Currently, the system requires that every item have a Dublin Core record.

There is a set of Dublin Core Elements, which is used by the system and should not be removed or moved to another schema, see Appendix: Default Dublin Core Metadata registry.

Note: altering a Metadata Registry has no effect on corresponding parts, e.g. item submission interface, item display, item import and vice versa. Every metadata element used in submission interface or item import must be registered before using it.

Note also that deleting a metadata element will delete all its corresponding values.

If you wish to add more metadata elements, you can do this in one of two ways. Via the DSpace admin UI you may define new metadata elements in the different available schemas. But you may also modify the XML file (or provide an additional one), and re-import the data as follows:

[dspace]/bin/dsrun org.dspace.administer.MetadataImporter -f [xml file]

The XML file should be structured as follows:

<dspace-dc-types>
    <dc-type>
        <schema>dc</schema>
        <element>contributor</element>
        <qualifier>advisor</qualifier>
        <scope_note>Use primarily for thesis advisor.</scope_note>
    </dc-type>
</dspace-dc-types>

5.3.1.2. Bitstream Format Registry

The bitstream formats recognized by the system and levels of support are similarly stored in the bitstream format registry. This can also be edited at install-time via [dspace]/config/registries/bitstream-formats.xml or by the administation Web UI. The contents of the bitstream format registry are entirely up to you, though the system requires that the following two formats are present:

  • Unknown

  • License

Deleting a format will cause any existing bitstreams of this format to be reverted to the unknown bitstream format.

5.3.2. XPDF Filter

These filters handle PDF resources with more sophisticated tools that can produce thumbnail images of PDF and 3D PDF files, and do much faster (and more complete) text extraction as well.

5.3.2.1. Installation

The following instructions are the "quick and dirty" method for installing the XPDF filter. It does not address any issues that may be affectively needed for Maven Modules or and POMs. This method below does not take into account the maintaining the DSpace installation or upgrading.

  1. Obtain a source distribution of DSpace 1.6, configure, and build it.

  2. Edit configuration lines in dspace.cfg

  3. Add -Pxpdf-mediafilter-support to maven build

  4. Build and install.

5.3.2.2. Install XPDF Tools

First, download the XPDF suite found at:http://www.foolabs.com/xpdf and install it on your server under /usr/local/bin.

The only tools you really need are:

  • pdftoppm

  • pdfinfo

  • pdftotext

5.3.2.3. Installation within DSpace

The user will be using Java™ Advanced Imaging Image I/O Tools.

[Caution]

For AIX, Sun support has the following: "JAI has native acceleration for the above but it also works in pure Java mode. So as long as you have an appropriate JDK for AIX (1.3 or later, I believe), you should be able to use it. You can download any of them, extract just the jars, and put those in your $CLASSPATH."

Fetch and install jai_imageio JAR

Download the jai_imageio library version 1.0_01 or 1.1 found athttps://jai-imageio.dev.java.net/binary-builds.html#Stable_builds .

Install it in your local Maven repository with the command (assuming the library is installed locally at /opt/facade/lib/jai_imageio.jar):

        mvn install:install-file  \
         -Dfile=/opt/facade/lib/jai_imageio.jar               \
         -DgroupId=com.sun.media                              \
         -DartifactId=jai_imageio                             \
         -Dversion=1.0_01                                     \
         -Dpackaging=jar                                      \
         -DgeneratePom=true
	
Edit DSpace Configuration

First, be sure there is a value for thumbnail.maxwidth and that it corresponds to the size you want for preview images for the UI, e.g.: (NOTE: this code doesn't pay any attention to thumbnail.maxheight but it's best to set it too so the other thumbnail filters make square images.)

        # maximum width and height of generated thumbnails
        thumbnail.maxwidth  300
        thumbnail.maxheight 300
        

Now, add the absolute paths of the XPDF tools you installed:

        xpdf.path.pdftotext = /var/local/bin/pdftotext
        xpdf.path.pdftoppm = /var/local/bin/pdftoppm
        xpdf.path.pdfinfo = /var/local/bin/pdfinfo
	

Also be sure the mediafilter configuration includes the new filters, e.g: (New sections are in bold)

        filter.plugins = \
         PDF Text Extractor, \
         PDF Thumbnail, \
         HTML Text Extractor, \
         Word Text Extractor, \
         JPEG Thumbnail
         plugin.named.org.dspace.app.mediafilter.FormatFilter = \
         org.dspace.app.mediafilter.XPDF2Text = PDF Text Extractor, \
         org.dspace.app.mediafilter.XPDF2Thumbnail = PDF Thumbnail, \
         org.dspace.app.mediafilter.HTMLFilter = HTML Text Extractor, \
         org.dspace.app.mediafilter.WordFilter = Word Text Extractor, \
         org.dspace.app.mediafilter.JPEGFilter = JPEG Thumbnail, \
         org.dspace.app.mediafilter.BrandedPreviewJPEGFilter = Branded Preview JPEG
	

#Configure each filter's input format(s)

          filter.org.dspace.app.mediafilter.XPDF2Thumbnail.inputFormats = Adobe PDF
          filter.org.dspace.app.mediafilter.XPDF2Text.inputFormats = Adobe PDF
          Add -Pxpdf-mediafilter-support to maven build
        

Edit the POM for the dspace-api module. Within the <dependencies> element, add this new element:

	%mvn -Pxpdf-mediafilter-support package
	
Build and Install

Follow the usual DSpace installation/update procedure (mvn package and then ant -Dconfig=etc. ...)

These instructions were retrieved from "http://libstaff.mit.edu/facade/index.php/DSpace_PDF_Media_Filters"

5.3.3. Creating a new Media/Format Filter

5.3.3.1. Creating a simple Media Filter

New Media Filters must implement the org.dspace.app.mediafilter.FormatFilter interface. More information on the methods you need to implement is provided in the FormatFilter.java source file. For example:


             public class MySimpleMediaFilter implements
	FormatFilter

          

Alternatively, you could extend the org.dspace.app.mediafilter.MediaFilter class, which just defaults to performing no pre/post-processing of bitstreams before or after filtering.


             public class MySimpleMediaFilter extends
	MediaFilter

          

You must give your new filter a "name", by adding it and its name to the plugin.named.org.dspace.app.mediafilter.FormatFilter field in dspace.cfg. In addition to naming your filter, make sure to specify its input formats in the filter.<class path>.inputFormats config item. Note the input formats must match the short description field in the Bitstream Format Registry (i.e. bitstreamformatregistry table).


             plugin.named.org.dspace.app.mediafilter.FormatFilter = \
	org.dspace.app.mediafilter.MySimpleMediaFilter = My Simple Text
	Filter, \ ...
	filter.org.dspace.app.mediafilter.MySimpleMediaFilter.inputFormats =
	Text

          

WARNING: If you neglect to define the inputFormats for a particular filter, the MediaFilterManager will never call that filter, since it will never find a bitstream which has a format matching that filter's input format(s).

If you have a complex Media Filter class, which actually performs different filtering for different formats (e.g. conversion from Word to PDF and conversion from Excel to CSV), you should define this as a Dynamic / Self-Named Format Filter.

5.3.3.2. Creating a Dynamic or "Self-Named" Format Filter

If you have a more complex Media/Format Filter, which actually performs multiple filtering or conversions for different formats (e.g. conversion from Word to PDF and conversion from Excel to CSV), you should have define a class which implements the FormatFilter interface, while also extending the SelfNamedPlugin class. For example:


             public class MyComplexMediaFilter extends
	SelfNamedPlugin implements FormatFilter

          

Since SelfNamedPlugins are self-named (as stated), they must provide the various names the plugin uses by defining a getPluginNames() method. Generally speaking, each "name" the plugin uses should correspond to a different type of filter it implements (e.g. "Word2PDF" and "Excel2CSV" are two good names for a complex media filter which performs both Word to PDF and Excel to CSV conversions).

Self-Named Media/Format Filters are also configured differently in dspace.cfg. Below is a general template for a Self Named Filter (defined by an imaginary MyComplexMediaFilter class, which can perform both Word to PDF and Excel to CSV conversions):


             #Add to a list of all Self Named filters
	plugin.selfnamed.org.dspace.app.mediafilter.FormatFilter = \
	org.dspace.app.mediafilter.MyComplexMediaFilter #Define input formats
	for each "named" plugin this filter implements
	filter.org.dspace.app.mediafilter.MyComplexMediaFilter.Word2PDF.inputF
ormats = Microsoft Word
	filter.org.dspace.app.mediafilter.MyComplexMediaFilter.Excel2CSV.input
Formats = Microsoft Excel

          

As shown above, each Self-Named Filter class must be listed in the plugin.selfnamed.org.dspace.app.mediafilter.FormatFilter item in dspace.cfg. In addition, each Self-Named Filter must define the input formats for each named plugin defined by that filter. In the above example the MyComplexMediaFilter class is assumed to have defined two named plugins, Word2PDF and Excel2CSV. So, these two valid plugin names ("Word2PDF" and "Excel2CSV") must be returned by the getPluginNames() method of the MyComplexMediaFilter class.

These named plugins take different input formats as defined above (see the corresponding inputFormats setting). WARNING: If you neglect to define the inputFormats for a particular named plugin, the MediaFilterManager will never call that plugin, since it will never find a bitstream which has a format matching that plugin's input format(s).

For a particular Self-Named Filter, you are also welcome to define additional configuration settings in dspace.cfg. To continue with our current example, each of our imaginary plugins actually results in a different output format (Word2PDF creates "Adobe PDF", while Excel2CSV creates "Comma Separated Values"). To allow this complex Media Filter to be even more configurable (especially across institutions, with potential different "Bitstream Format Registries"), you may wish to allow for the output format to be customizable for each named plugin. For example:


             #Define output formats for each named plugin
	filter.org.dspace.app.mediafilter.MyComplexMediaFilter.Word2PDF.output
Format = Adobe PDF
	filter.org.dspace.app.mediafilter.MyComplexMediaFilter.Excel2CSV.outpu
tFormat = Comma Separated Values

          

Any custom configuration fields in dspace.cfg defined by your filter are ignored by the MediaFilterManager, so it is up to your custom media filter class to read those configurations and apply them as necessary. For example, you could use the following sample Java code in your MyComplexMediaFilter class to read these custom outputFormat configurations from dspace.cfg :


             //get "outputFormat" configuration from dspace.cfg
	String outputFormat =
	ConfigurationManager.getProperty(MediaFilterManager.FILTER_PREFIX +
	"." + MyComplexMediaFilter.class.getName() + "." +
	this.getPluginInstanceName() + ".outputFormat");

          

5.3.4. Configuration Files for Other Applications

To ease the hassle of keeping configuration files for other applications involved in running a DSpace site, for example Apache, in sync, the DSpace system can automatically update them for you when the main DSpace configuration is changed. This feature of the DSpace system is entirely optional, but we found it useful.

The way this is done is by placing the configuration files for those applications in [dspace]/config/templates, and inserting special values in the configuration file that will be filled out with appropriate DSpace configuration properties. Then, tell DSpace where to put filled-out, 'live' version of the configuration by adding an appropriate property to dspace.cfg, and run [dspace]/bin/install-configs.

Take the apache13.conf file as an example. This contains plenty of Apache-specific stuff, but where it uses a value that should be kept in sync across DSpace and associated applications, a 'placeholder' value is written. For example, the host name:

ServerName @@dspace.hostname@@

The text @@dspace.hostname@@ will be filled out with the value of the dspace.hostname property in dspace.cfg. Then we decide where we want the 'live' version, that is, the version actually read in by Apache when it starts up, will go.

Let's say we want the live version to be located at /opt/apache/conf/dspace-httpd.conf. To do this, we add the following property to dspace.cfg so DSpace knows where to put it:

config.template.apache13.conf = /opt/apache/conf/dspace-httpd.conf

Now, we run [dspace]/bin/install-configs. This reads in [dspace]/config/templates/apache13.conf, and places a copy at /opt/apache/conf/dspace-httpd.conf with the placeholders filled out.

So, in /opt/apache/conf/dspace-httpd.conf, there will be a line like:

ServerName dspace.myu.edu

The advantage of this approach is that if a property like the hostname changes, you can just change it in dspace.cfg and run install-configs, and all of your tools' configuration files will be updated.

However, take care to make all your edits to the versions in [dspace]/config/templates! It's a wise idea to put a big reminder at the top of each file, since someone might unwittingly edit a 'live' configuration file which would later be overwritten.

5.3.5.  Configuring Usage Instrumentation Plugins

A usage instrumentation plugin is configured as a singleton plugin for the abstract class org.dspace.app.statistics.AbstractUsageEvent.

5.3.5.1.  The Passive Plugin

The Passive plugin is provided as the class org.dspace.app.statistics.PassiveUsageEvent. It absorbs events without effect. Use the Passive plugin when you have no use for usage event postings. This is the default if no plugin is configured.

5.3.5.2.  The Tab File Logger Plugin

The Tab File Logger plugin is provided as the class org.dspace.app.statistics.UsageEventTabFileLogger. It writes event records to a file in tab-separated column format. If left unconfigured, an error will be noted in the DSpace log and no file will be produced. To specify the file path, provide an absolute path as the value for usageEvent.tabFileLogger.file in dspace.cfg.

5.3.5.3.  The XML Logger Plugin

The XML Logger plugin is provided as the class org.dspace.app.statistics.UsageEventXMLLogger. It writes event records to a file in a simple XML-like format. If left unconfigured, an error will be noted in the DSpace log and no file will be produced. To specify the file path, provide an absolute path as the value for usageEvent.xmlLogger.file in dspace.cfg.

5.3.6. SWORD Configuration

SWORD (Simple Web-service Offering Repository Deposit) is a protocol that allows the remote deposit of items into repositories. DSpace implements the SWORD protocol via the 'sword' web application. The version of SWORD currently supported by DSpace is 1.3. The specification and further information can be downloaded fromhttp://swordapp.org.

SWORD is based on the Atom Publish Protocol and allows service documents to be requested which describe the structure of the repository, and packages to be deposited.

Properties:sword.mets-ingester.package-ingester
Example Value:sword.mets-ingester.package-ingester = METS
Informational Note:

The property key tell the SWORD METS implementation which package ingester to use to install deposited content. This should refer to one of the classes configured for:

                        plugin.named.org.dspace.content.packager.PackageIngester

The value of sword.mets-ingester.package-ingester tells the system which named plugin for this interface should be used to ingest SWORD METS packages.

 
Properties:mets.submission.crosswalk.EPDCX
Example Value:mets.submission.crosswalk.EPDCX = SWORD
Informational Note:Define the metadata type EPDCX (EPrints DC XML)to be handled by the SWORD crosswalk configuration.
 
Properties:crosswalk.submission.SWORD.stylesheet
Example Value:crosswalk.submission.SWORD.stylesheet = crosswalks/sword-swap-ingest.xsl
Informational Note:Define the stylesheet which will be used by the self-named XSLTIngestionCrosswalk class when asked to load the SWORD configuration (as specified above). This will use the specified stylesheet to crosswalk the incoming SWAP metadata to the DIM format for ingestion.
 
Properties:sword.deposit.url
Example Value:sword.deposit.url = http://www.myu.ac.uk/sword/deposit
Informational Note:The base URL of the SWORD deposit. This is the URL from which DSpace will construct the deposit location urls for collections. The default is {dspace.url}/sword/deposit. In the event that you are not deploying DSpace as the ROOT application in the servlet container, this will generate incorrect URLs, and you should override the functionality by specifying in full as shown in the example value.
 
Properties:sword.servicedocument.url
Example Value:sword.servicedocument.url = http://www.myu.ac.uk/sword/servicedocument
Informational Note:The base URL of the SWORD service document. This is the URL from which DSpace will construct the service document location urls for the site, and for individual collections. The default is {dspace.url}/sword/servicedocument . In the event that you are not deploying DSpace as the ROOT application in the servlet container, this will generate incorrect URLs, and you should override the functionality by specifying in full as shown in the example value.
 
Properties:sword.media-link.url
Example Value:sword.media-link.url = http://www.myu.ac.uk/sword/media-link
Informational Note:The base URL of the SWORD media links. This is the URL which DSpace will use to construct the media link urls for items which are deposited via sword. The default is {dspace.url}/sword/media-link. In the event that you are not deploying DSpace as the ROOT application in the servlet container, this will generate incorrect URLs, and you should override the functionality by specifying in full as shown in the example value.
 
Properties:sword.generator.url
Example Value:sword.generator.url = http://www.dspace.org/ns/sword/1.3.1
Informational Note:The URL which identifies the sword software which provides the sword interface. This is the URL which DSpace will use to fill out the atom:generator element of its atom documents. The default is: http://www.dspace.org/ns/sword/1.3.1. If you have modified your sword software, you should change this URI to identify your own version. If you are using the standard dspace-sword module you will not, in general, need to change this setting.
 
Properties:sword.updated.field
Example Value:sword.updated.field = dc.date.updated
Informational Note:The metadata field in which to store the updated date for items deposited via SWORD.
 
Properties:sword.slug.field
Example Value:sword.slug.field = dc.identifier.slug
Informational Note:The metadata field in which to store the value of the slug header if it is supplied.
 
Properties:
sword.accept-packaging.METSDSpaceSIP.identifier
sword.accept-packaging.METSDSpaceSIP.q
Example Value:(See example below)
sword.accept-packaging.METSDSpaceSIP.identifier = http://purl.org/net/sword-types/METSDSpaceSIP
sword.accept-packaging.METSDSpaceSIP.q = 1.0
Informational Note:The accept packaging properties, along with their associated quality values where appropriate. This is a Global Setting; these will be used on all DSpace collections
 
Properties:
sword.accept-packaging.[handle].METSDSpaceSIP.identifier
sword.accept-packaging.[handle].METSDSpaceSIP.q
Example Value:(See example below)
sword.accept-packaging.[handle].METSDSpaceSIP.identifier = http://purl.org/net/sword-types/METSDSpaceSIP
sword.accept-packaging.[handle].METSDSpaceSIP.q = 1.0
Informational Note:Collection Specific settings: these will be used on the collections with the given handles.
 
Properties:sword.expose-items
Example Value:sword.expose-items = false
Informational Note:Should the server offer up items in collections as sword deposit targets. This will be effected by placing a URI in the collection description which will list all the allowed items for the depositing user in that collection on request. NOTE: this will require an implementation of deposit onto items, which will not be forthcoming for a short while.
 
Properties:sword.expose-communities
Example Value:sword.expose-communities = false
Informational Note:Should the server offer as the default the list of all Communities to a Service Document request. If false, the server will offer the list of all collections, which is the default and recommended behavior at this stage. NOTE: a service document for Communities will not offer any viable deposit targets, and the client will need to request the list of Collections in the target before deposit can continue.
 
Properties:sword.max-upload-size
Example Value:sword.max-upload-size = 0
Informational Note:The maximum upload size of a package through the sword interface, in bytes. This will be the combined size of all the files, the metadata and any manifest data. It is NOT the same as the maximum size set for an individual file upload through the user interface. If not set, or set to 0, the sword service will default to no limit.
 
Properties:sword.keep-original-package
Example Value:sword.keep-original-package = true
Informational Note:Whether or not DSpace should store a copy of the original sword deposit package. NOTE: this will cause the deposit process to run slightly slower, and will accelerate the rate at which the repository consumes disk space. BUT, it will also mean that the deposited packages are recoverable in their original form. It is strongly recommended, therefore, to leave this option turned on. When set to "true", this requires that the configuration option upload.temp.dir above is set to a valid location.
 
Properties:sword.bundle.name
Example Value:sword.bundle.name = SWORD
Informational Note:The bundle name that SWORD should store incoming packages under if sword.keep-original-package is set to true. The default is "SWORD" if not value is set
 
Properties:sword.identify-version
Example Value:sword.identify-version = true
Informational Note:Should the server identify the sword version in a deposti response. It is recommended to leave this unchanged.
 
Properties:sword.on-behalf-of.enable
Example Value:sword.on-behalf-of.enable = true
Informational Note:Should mediated deposit via sowrd be supported. If enabled, this will allow users to deposit content packages on behalf of other users.
 
Properties:
plugin.named.org.dspace.sword.SWORDingester
Example Value:(See example below)
plugin.named.org.dspace.sword.SWORDIngester = \
  org.dspace.sword.SWORDMETSIngester = http://purl.org/net/sword-types/METSDSpaceSIP \
  org.dspace.sword.SimpleFileIngester = SimpleFileIngester
Informational Note:Configure the plugins to process incoming packages. The form of this configuration is as per the Plugin Manager's Named Plugin documentation: plugin.named.[interface] = [implementation] = [package format identifier] \ . Package ingesters should implement the SWORDIngester interface, and will be loaded when a package of the format specified above in: sword.accept-packaging.[package format].identifier = [package format identifier] is received. In the event that this is a simple file deposit, with no package format, then the class named by "SimpleFileIngester" will be loaded and executed where appropriate. This case will only occur when a single file is being deposited into an existing DSpace Item.
 
Properties:sword.accepts
Example Value:sword.accepts = application/zip, foo/bar
Informational Note:A comma separated list of MIME types that SWORD will accept.

5.3.7. OpenSearch Support

OpenSearch is a small set of conventions and documents for describing and using "serach enginges", meaning any service that returns a set of results for a query. See extensive description in the Business Layer section of the documentation.

Please note that for result data formatting, OpenSearch uses Syndication Feed Settings (RSS). So, even if Syndication Feeds are not enable, they must be configured to enable OpenSearch. OpenSearch uses all the configuration properties for DSpace RSS to determine the mapping of metadata fields to feed fields. Note that a new field for authors has been added (used in Atom format only).

Property:websvc.opensearch.enable
Example Value:websvc.opensearch.enable = false
Informational Note:Whether or not OpenSearch is enabled. By default, the feature is disabled. Change the property key to 'ture' to enable.
 
Property:websvc.opensearch.uicontext
Example Value:websvc.opensearch.uicontext = simple-search
Informational Note:Context for HTML request URLs. Change only for non-standard servlet mapping.
 
Property:websvc.opensearch.svccontext
Example Value:websvc.opensearch.svccontext = open-search/
Informational Note:Context for RSS/Atom request URLs. Change only for non-standard servlet mapping.
 
Property:websvc.opensearch.autolink
Example Value:websvc.opensearch.autolink = true
Informational Note:Present autodiscovery link in every page head.
 
Property:websvc.opensearch.validity
Example Value:websvc.opensearch.validity = 48
Informational Note:Number of hours to retain results before recalculating. This applies to the Manakin interface only.
 
Property:websvc.opensearch.shortname
Example Value:websvc.opensearch.shortname = DSpace
Informational Note:A short name used in browsers for search service. It should be sixteen (16) or fewer characters.
 
Property:websvc.opensearch.longname
Example Value:websvc.opensearch.longname = ${dspace.name}
Informational Note:A longer name up to 48 characters.
 
Property:websvc.opensearch.description
Example Value:websvc.opensearch.description = ${dspace.name} DSpace repository
Informational Note:Brief service description
 
Property:websvc.opensearch.faviconurl
Example Value:websvc.opensearch.faviconurl = http://www.dspace.org/images/favicon.ico
Informational Note:Location of favicon for service, if any. They must by 16 x 16 pixels. You can profide your own local favicon instead of the default.
 
Property:websvc.opensearch.samplequery
Example Value:websvc.opensearch.samplequery = photosynthesis
Informational Note:Sample query. This should return results. You can replace the sample query with search terms that should actually yield results in your repository.
 
Property:websvc.opensearch.tags
Example Value:websc.opensearch.tags = IR DSpace
Informational Note:Tags used to describe search service.
 
Property:websvc.opensearch.formats
Example Value:websvc.opensearch.formats = html,atom,rss
Informational Note:Result formats offered. Use one or more comma-separated from the list: html, atom, rss. Please note that html is requred for autodiscovery in browsers to function, and must be the first in the list if present.

5.3.8. Embargo

It is possible now to configure a DSpace instance to have an "Embargo" feature uses for thesis and dissertations.

Property:embargo.field.terms
Example Value:embargo.field.terms = SCHEMA.ELEMENT.QUALIFIER
Informational Note:DC metadata field to hold the user-supplied embargo terms
 
Property:embargo.field.lift
Example Value:embargo.field.lift = SCHEMA.ELEMENT.QUALIFIER
Informational Note:DC metadata field to hold computed "lift date" of embargo
 
Property:embargo.terms.open
Example Value:embargo.terms.open = forever
Informational Note:The string in terms field to indicate indefinite embargo
 
Property:plugin.single.org.dspace.embargo.EmbargoSetter
Example Value:plugin.single.org.dspace.embargo.EmbargoSetter = CLASSNAME
Informational Note:Implementation of embargo setter plugin
Property:plugin.single.org.dspace.embargo.EmbargoLifter
Example Value:plugin.single.org.dspace.embargo.EmbargoLifter = org.dspace.embargo.DefaultEm
Informational Note:Implementation of embargo lifter plugin

Remember that you need to replace SCHEMA.ELEMENT.QUALIFIER with a real metadata field. Additionally, you need to replace the CLASSNAME with a properly impleented plugin.


Licensed under a Creative Commons Attribution 3.0 United States License