Not logged in
PANGAEA.
Data Publisher for Earth & Environmental Science

Scherer, Maximilian; Bernard, Jürgen; Schreck, Tobias (2011): Reference list of sources used for two experimental data files dataBSRN and dataMixed. PANGAEA, https://doi.org/10.1594/PANGAEA.756307

Always quote above citation when using data! You can download the citation in several formats below.

RIS CitationBibTeX Citation

Abstract:
Increasing amounts of data is collected in most areas of research and application. The degree to which this data can be accessed, analyzed, and retrieved, is a decisive in obtaining progress in fields such as scientific research or industrial production. We present a novel methodology supporting content-based retrieval and exploratory search in repositories of multivariate research data. In particular, our methods are able to describe two-dimensional functional dependencies in research data, e.g. the relationship between ination and unemployment in economics. Our basic idea is to use feature vectors based on the goodness-of-fit of a set of regression models to describe the data mathematically. We denote this approach Regressional Features and use it for content-based search and, since our approach motivates an intuitive definition of interestingness, for exploring the most interesting data. We apply our method on considerable real-world research datasets, showing the usefulness of our approach for user-centered access to research data in a Digital Library system.
Related to:
Scherer, Maximilian; Bernard, Jürgen; Schreck, Tobias (2011): Retrieval and exploratory search in multivariate research data repositories using regressional features. ACM/IEEE Joint Conference on Digital Libraries, https://doi.org/10.1145/1998076.1998144
Parameter(s):
#NameShort NameUnitPrincipal InvestigatorMethod/DeviceComment
1ExperimentExp
2Author(s)Author(s)
3Year of publicationYear
4TitleTitle
5Persistent IdentifierPersistent Identifier
Size:
39400 data points

Download Data

Download dataset as tab-delimited text (use the following character encoding: )

View dataset as HTML (shows only first 2000 rows)