Building a Biodiversity Content Management System for Science, Education, and Outreach

We describe the system architecture and data template design for the Animal Diversity Web (http://www.animaldiversity.org), an online natural history resource serving three audiences: 1) the scientific community, 2) educators and learners, and 3) the general public. Our architecture supports highly scalable, flexible resource building by combining relational and object-oriented databases. Content resources are managed separately from identifiers that relate and display them. Websites targeting different audiences from the same database handle large volumes of traffic. Content contribution and legacy data are robust to changes in data models. XML and OWL versions of our data template set the stage for making ADW data accessible to other systems.
Parr, C.S., R. Espinosa, T. Dewey, G. Hammond and P. Myers. Data Science Journal (2005). Articles>Content Management>Scientific Communication
The Design and Development of a Project-Oriented Information System

In this paper, the idea of building a project-oriented information system based upon a specialized information database was discussed. It attempts to provide tools for helping researchers use Internet resources effectively in the course of their research. Based on this idea, a web-based project-oriented information system was constructed. The paper systematically expounds the design and development process of the project-oriented information system. Furthermore, examples of utilizing the project-oriented information system to obtain useful information and suggestions for specific projects were described. According to our discussion and utilization of the system, we believe that building a project-oriented information system can help researchers with their research projects.
Zhao, Yuehong, Chao Liu, Hao Wen, Hezhen Zhang and Zhihong Xu. Data Science Journal (2003). Articles>Project Management>User Centered Design
Designating User Communities for Scientific Data: Challenges and Solutions

Defining a 'designated user community' for a data collection is essential to good scientific data stewardship. It enables data managers to determine what information is necessary to ensure the usability of the data now and into the future. It helps managers present and enable access to the data and may determine the format of the data. However, defining a community is difficult, and it is impossible to predict how the use of a data collection may change over time. This creates a series of data management problems for data stewards that may be mitigated by a set of best practices.
Parsons, M.A. and R. Duerr. Data Science Journal (2005). Articles>Communication>Scientific Communication>Collaboration
Digital Object Identifiers for Scientific Data

The Digital Object Identifier (DOI) is a system for identifying content objects in the digital environment. DOIs are names assigned to any entity for use on Internet digital networks. Scientific data sets may be identified by DOIs, and several efforts are now underway in this area. This paper outlines the underlying architecture of the DOI system, and two such efforts which are applying DOIs to content objects of scientific data.
Paskin, Norman. Data Science Journal (2005). Articles>Publishing>Standards>Online
The Essentials of a Database Quality Process

Many steps are involved in the process of turning an initial concept for a database into a finished product that meets the needs of its user community. In this paper, we describe those steps in the context of a four-phase process with particular emphasis on the quality-related issues that need to be addressed in each phase to ensure that the final product is a high quality database. The basic requirements for a successful database quality process are presented with specific examples drawn from experience gained in the Standard Reference Data Program at the National Institute of Standards and Technology.
Blakeslee, Dorothy M. and John Rumble, Jr. Data Science Journal (2003). Articles>Information Design>Databases>Usability
Knowledge Management and Life Long Education in Science

In 1998 ENEA, the Italian National Agency for New Technologies, Energy and the Environment, launched an e-learning platform with the mission of sharing scientific knowledge among everyone, not just workers but also students and the unemployed, in order to use its research results to support competitiveness and sustainable development. In 6 years, more than 20.000 users have followed one or more of the 46 on line courses. Many agreements with schools, universities, private and public training organisation are now under way to improve the dissemination of scientific knowledge and to build an open data base of scientific learning objects that anyone can use.
Moreno, Anna and Sergio Grande. Data Science Journal (2005). Articles>Knowledge Management>Education>Scientific Communication
Materials Data on the Internet

The availability of the Internet has provided unprecedented opportunities for both data compilers and users. With respect to materials data, this paper explores: how do we know what is available? how can data be accessed, interpreted, exchanged? what novel modes of presentation are now available? what organizations are active in this field and what are their programs? what improvements are needed? where do we go from here and how? Examples will be illustrated of specific materials databases available on the Internet from a variety of materials data fields, e.g. fundamental data, engineering design properties, environmental data, and materials safety data. While there is no question that large and widely varied bodies of data are accessible on the Internet, significant improvements are needed promptly. The paper concludes by summarizing these problems and possible means for their alleviation.
Westbrook, J.H. Data Science Journal (2003). Articles>Scientific Communication>Online>Engineering
Overview of the IMSA Project, A Patient-Oriented Information System

This paper proposes an overview of the IMSA application, a patient-oriented medical information system. IMSA stands for Interactive Multimedia System for Auto-medication and aims to provide a health-care Internet tool for the end-user. This system proposes an environment that integrates on-line health information, medical and pharmaceutical databases and a knowledge-based system for medical diagnosis. The implementation process focuses on cognitive science, knowledge representation and human-computer interaction.
Curé, Oliver. Data Science Journal (2002). Articles>Scientific Communication>User Centered Design
Heterogeneous research environments, interests and locations do not necessarily coincide, thus hitherto the primary method of communication amongst researchers has been email. In this article a novel unified polythematic, real-time, synergistic, data telecommunication system is proposed with peer-reviewed, bidirectional fuzzy feedback for research scientists, to facilitate scientific information exchange via the extensible markup language (XML) on multiple scientific topics, e.g. in mathematics, physics, biology and chemistry.
Petratos, Panagiotis. Data Science Journal (2003). Articles>Knowledge Management>Scientific Communication>Collaboration
Preserving the Positive Functions of the Public Domain In Science

Science has advanced in part because data and scientific methodologies have traditionally not been subject to intellectual property protection. In recent years, intellectual property has played a greater role in scientific work. While intellectual property rights may have a positive role to play in some fields of science, so does the public domain. This paper will discuss some of the positive functions of the public domain and ways in which certain legal developments may negatively impact the public domain. It suggests some steps that scientists can take to preserve the positive functions of the public domain for science.
Samuelson, Pamela. Data Science Journal (2003). Articles>Intellectual Property>Copyright>Scientific Communication
Promoting Access to Public Research Data for Scientific, Economic, and Social Development

Access to and sharing of data are essential for the conduct and advancement of science. This article argues that publicly funded research data should be openly available to the maximum extent possible. To seize upon advancements of cyberinfrastructure and the explosion of data in a range of scientific disciplines, this access to and sharing of publicly funded data must be advanced within an international framework, beyond technological solutions. The authors, members of an OECD Follow-up Group, present their research findings, based closely on their report to OECD, on key issues in data access, as well as operating principles and management aspects necessary to successful data access regimes.
Arzberger, P., P. Schroeder, A. Beaulieu, G. Bowker, K. Casey, L. Laaksonen, D. Moorman, P. Uhlir and P. Wouters. Data Science Journal (2004). Articles>Publishing>Research>Scientific Communication
Scholarly Information Architecture, 1989-2015 
If we were to start from scratch today to design a quality-controlled archive and distribution system for scientific and technical information, it could take a very different form from what has evolved in the past decade from pre-existing print infrastructure. Ultimately, we might expect some form of global knowledge network for research communications. Over the next decade, there are many technical and non-technical issues to address along the way, everything from identifying optimal formats and protocols for rendering, indexing, linking, querying, accessing, mining, and transmitting the information, to identifying sociological, legal, financial, and political obstacles to realization of ideal systems. What near-term advances can we expect in automated classification systems, authoring tools, and next-generation document formats to facilitate efficient data mining and long-term archival stability? How will the information be authenticated and quality controlled? What differences should be expected in the realization of these systems for different scientific research fields? Can recent technological advances provide not only more efficient means of accessing and navigating the information, but also more cost-effective means of authentication and quality control? Relevant experiences from open electronic distribution of research materials in physics and related disciplines during the past decade are used to illuminate these questions, and some of their implications for proposals to improve the implementation of peer review are then discussed.
Ginsparg, P. Data Science Journal (2004). Articles>Information Design>Research
The fact that many decisions need a combination of information sources makes easy integration of geospatial data an important data usability issue. Our vision is to achieve automated just-in-time integration. As a foundation, we present a system architecture with distributed data and services. Existing and evolving standards and technologies fitting into this architecture are presented along with their scope and shortcomings. A major point is the appropriate definition of data and operation semantics. Further research is needed here to make the automatic formation of service chains for data integration possible.
Riedemann, Catharina and Christian Timm. Data Science Journal (2003). Articles>Information Design>XML>Geography
There are 11 readers currently online: 0 registered users and 11 guests. Register.

![]()
![]()


![]()
![]()
![]()