Search Results

Advanced search parameters have been applied.
open access

Nuclear data and measurements series: Some comments on the effects of long-range correlations in covariance matrices for nuclear data

Description: Attention is called to the considerable sensitivity of many uncertainty calculations to the magnitude of the long-ranged correlations which appear in covariance matrices. If such correlations do exist, they must be included in order to properly assess the impact of the uncertainties in the data. If, however, certain assumed long-range correlations are unrealistic, then analyses involving such correlation information are almost certain to produce misleading results. The issue is discussed in general terms, and its importance is illustrated by examples based in part on recent work from this laboratory. Some practical suggestions are offered for dealing with the matter of correlations in instances where the available information is incomplete. 23 refs., 2 figs., 1 tab.
Date: March 1, 1987
Creator: Smith, D. L.
Partner: UNT Libraries Government Documents Department
open access

Understanding the Knowledge, Skills, and Abilities (KSAs) of Data Professionals in United States Academic Libraries

Description: This study applies the knowledge, skills, and abilities (KSA) framework for eScience professionals to data service positions in academic libraries. Understanding the KSAs needed to provide data services is of crucial concern. The current study looks at KSAs of data professionals working in the United States academic libraries. An exploratory sequential mixed method design was adopted to discover the KSAs. The study was divided into two phases, a qualitative content analysis of 260 job advertisements for data professionals for Phase 1, and distribution of a self-administered online survey to data professionals working in academic libraries research data services (RDS) for Phase 2. The discovery of the KSAs from the content analysis of 260 job ads and the survey results from 167 data professionals were analyzed separately, and then Spearman rank order correlation was conducted in order to triangulate the data and compare results. The results from the study provide evidence on what hiring managers seek through job advertisements in terms of KSAs and which KSAs data professionals find to be important for working in RDS. The Spearman rank order correlation found strong agreement between job advertisement KSAs and data professionals perceptions of the KSAs.
Date: December 2021
Creator: Khan, Hammad Rauf
Partner: UNT Libraries

The Life-Changing Magic of OpenRefine: The Open-Source Art of Data Decluttering and Organizing

Description: Presentation for the 2017 UNT Open Access Symposium. This presentation provides an overview of a workshop on using OpenRefine to make sense of data, standardize, correct, and rearrange data elements, automate tedious editing, and prepare data for export to other platforms for a variety of uses.
Date: May 18, 2017
Creator: Feustle, Maristella
Partner: UNT Music Library
open access

An Introduction to Data Science

Description: This book provides non-technical readers with a gentle introduction to essential concepts and activities of data science. For more technical readers, the book provides explanations and code for a range of interesting applications using the open source R language for statistical computing and graphics"--Resource home page.
Date: 2012
Creator: Stanton, Jeffrey M.
Partner: UNT Libraries
open access

RADIX 95n: Binary-to-Text Data Conversion

Description: This paper presents Radix 95n, a binary to text data conversion algorithm. Radix 95n (base 95) is a variable length encoding scheme that offers slightly better efficiency than is available with conventional fixed length encoding procedures. Radix 95n advances previous techniques by allowing a greater pool of 7-bit combinations to be made available for 8-bit data translation. Since 8-bit data (i.e. binary files) can prove to be difficult to transfer over 7-bit networks, the Radix 95n conversion technique provides a way to convert data such as compiled programs or graphic images to printable ASCII characters and allows for their transfer over 7-bit networks.
Date: August 1991
Creator: Jones, Greg, 1963-2017.
Partner: UNT Libraries
open access

Gemini Series Experiment Data Reduction and Storage Techniques

Description: The presentation covers data formats expected from Gemini experiments; data quick look vs. in-depth analysis; iPDV object-oriented data storage; iPDV's traceability of analysis results; optimizing object memory usage in iPDV; and long-term archival of data objects by iPDV.
Date: November 1, 2011
Creator: Berglin, R. A.
Partner: UNT Libraries Government Documents Department
open access

Research Data Management Practices of Academic Researchers in Turkey

Description: Presentation paper for the 2017 International Conference on Knowledge Management. This paper examines the level of awareness and understanding of the various issues, challenges, and training needs associated with research data management for Hacettepe University researchers
Date: October 25, 2017
Creator: Ünal, Yurdagül & Kurbanoğlu, Serap
Partner: UNT College of Information
open access

High Performance Multivariate Visual Data Exploration for Extremely Large Data

Description: One of the central challenges in modern science is the need to quickly derive knowledge and understanding from large, complex collections of data. We present a new approach that deals with this challenge by combining and extending techniques from high performance visual data analysis and scientific data management. This approach is demonstrated within the context of gaining insight from complex, time-varying datasets produced by a laser wakefield accelerator simulation. Our approach leverages histogram-based parallel coordinates for both visual information display as well as a vehicle for guiding a data mining operation. Data extraction and subsetting are implemented with state-of-the-art index/query technology. This approach, while applied here to accelerator science, is generally applicable to a broad set of science applications, and is implemented in a production-quality visual data analysis infrastructure. We conduct a detailed performance analysis and demonstrate good scalability on a distributed memory Cray XT4 system.
Date: August 22, 2008
Creator: Rubel, Oliver; Wu, Kesheng; Childs, Hank; Meredith, Jeremy; Geddes, Cameron G.R.; Cormier-Michel, Estelle et al.
Partner: UNT Libraries Government Documents Department

Unfolding Research Data Services: An Information Architecture Perspective

Description: Poster presented at the 2018 ACM/IEEE Joint Conference on Digital Libraries. This poster describes the use of a content analysis with the lens of information architecture to better understand how research data services are organize in North American academic library websites, and to what extent the research data lifecycle is supported within these services.
Date: June 4, 2018
Creator: Khan, Hammad; Chang, Hsia-Ching & Kim, Jeonghyun
Partner: UNT College of Information
open access

Unfolding Research Data Services: An Information Architecture Perspective

Description: This paper describes the use of a content analysis with the lens of information architecture to better understand how research data services are organize in North American academic library websites, and to what extent the research data lifecycle is supported within these services.
Date: June 4, 2018
Creator: Khan, Hammad; Chang, Hsia-Ching & Kim, Jeonghyun
Partner: UNT College of Engineering
open access

FastBit: Interactively Searching Massive Data

Description: As scientific instruments and computer simulations produce more and more data, the task of locating the essential information to gain insight becomes increasingly difficult. FastBit is an efficient software tool to address this challenge. In this article, we present a summary of the key underlying technologies, namely bitmap compression, encoding, and binning. Together these techniques enable FastBit to answer structured (SQL) queries orders of magnitude faster than popular database systems. To illustrate how FastBit is used in applications, we present three examples involving a high-energy physics experiment, a combustion simulation, and an accelerator simulation. In each case, FastBit significantly reduces the response time and enables interactive exploration on terabytes of data.
Date: June 23, 2009
Creator: Wu, Kesheng; Ahern, Sean; Bethel, E. Wes; Chen, Jacqueline; Childs, Hank; Cormier-Michel, Estelle et al.
Partner: UNT Libraries Government Documents Department
open access

Efficient Bulk Data Replication for the Earth System Grid

Description: The Earth System Grid (ESG) community faces the difficult challenge of managing the distribution of massive data sets to thousands of scientists around the world. To move data replicas efficiently, the ESG has developed a data transfer management tool called the Bulk Data Mover (BDM). We describe the performance results of the current system and plans towards extending the techniques developed so far for the up- coming project, in which the ESG will employ advanced networks to move multi-TB datasets with the ulti- mate goal of helping researchers understand climate change and its potential impacts on world ecology and society.
Date: March 10, 2010
Creator: Sim, Alex; Gunter, Dan; Natarajan, Vijaya; Shoshani, Arie; Williams, Dean; Long, Jeff et al.
Partner: UNT Libraries Government Documents Department
open access

The Target Data Breach: Frequently Asked Questions

Description: This report answers some frequently asked questions about the Target (store) data breach, including what is known to have happened in the breach, and what costs may result. It also examines some of the broader issues common to data breaches, including how the payment system works, how cybersecurity costs are shared and allocated within the payment system, who bears the losses in such breaches more generally, what emerging cybersecurity technologies may help prevent them, and what role the government could play in encouraging their adoption, as well as some of the legislation that the 113th Congress has introduced to deal with these issues.
Date: April 22, 2014
Creator: Weiss, N. E. & Miller, Rena S.
Partner: UNT Libraries Government Documents Department
open access

Building Agency within the Agency and in the Community: Improving Web Access to Public Health Data in New York City

Description: This paper catalogues the New York City Department of Health and Mental Hygiene's redevelopment efforts to understand users and reimagine uses for its public use data.
Date: November 9, 2018
Creator: Marcial, Laura; Quattrone, Wes; Affrunti, Megan; Hinterland, Kinjia; Laufenberg, Jeff; Duparc, Martin et al.
Partner: UNT College of Information
open access

Application of High-performance Visual Analysis Methods to Laser Wakefield Particle Acceleration Data

Description: Our work combines and extends techniques from high-performance scientific data management and visualization to enable scientific researchers to gain insight from extremely large, complex, time-varying laser wakefield particle accelerator simulation data. We extend histogram-based parallel coordinates for use in visual information display as well as an interface for guiding and performing data mining operations, which are based upon multi-dimensional and temporal thresholding and data subsetting operations. To achieve very high performance on parallel computing platforms, we leverage FastBit, a state-of-the-art index/query technology, to accelerate data mining and multi-dimensional histogram computation. We show how these techniques are used in practice by scientific researchers to identify, visualize and analyze a particle beam in a large, time-varying dataset.
Date: August 28, 2008
Creator: Rubel, Oliver; Prabhat, Mr.; Wu, Kesheng; Childs, Hank; Meredith, Jeremy; Geddes, Cameron G.R. et al.
Partner: UNT Libraries Government Documents Department
open access

Application specific compression : final report.

Description: With the continuing development of more capable data gathering sensors, comes an increased demand on the bandwidth for transmitting larger quantities of data. To help counteract that trend, a study was undertaken to determine appropriate lossy data compression strategies for minimizing their impact on target detection and characterization. The survey of current compression techniques led us to the conclusion that wavelet compression was well suited for this purpose. Wavelet analysis essentially applies a low-pass and high-pass filter to the data, converting the data into the related coefficients that maintain spatial information as well as frequency information. Wavelet compression is achieved by zeroing the coefficients that pertain to the noise in the signal, i.e. the high frequency, low amplitude portion. This approach is well suited for our goal because it reduces the noise in the signal with only minimal impact on the larger, lower frequency target signatures. The resulting coefficients can then be encoded using lossless techniques with higher compression levels because of the lower entropy and significant number of zeros. No significant signal degradation or difficulties in target characterization or detection were observed or measured when wavelet compression was applied to simulated and real data, even when over 80% of the coefficients were zeroed. While the exact level of compression will be data set dependent, for the data sets we studied, compression factors over 10 were found to be satisfactory where conventional lossless techniques achieved levels of less than 3.
Date: December 1, 2008
Creator: Melgaard, David Kennett; Byrne, Raymond Harry; Myers, Daniel S.; Harrison, Carol D.; Lee, David S.; Lewis, Phillip J. et al.
Partner: UNT Libraries Government Documents Department
open access

Big Data and Government Information

Description: Panel paper for the 2017 International Conference on Knowledge Management. This panel explores the current landscape of big data applications for government information.
Date: October 25, 2017
Creator: Alemneh, Daniel Gelaw; Phillips, Mark Edward; Smith, Lisa & Shimabuku, Hope
Partner: UNT Libraries
open access

Note on a simple tool for the comparison of several estimates of the same parameter

Description: One basic step towards improving the quality of data produced by a data collection agency is to identify problem areas that require special attention. Comparisons with other comparible data sources, when possible, serve as a simple technique that can explore existing problems for one or both data collection systems. In this paper, a simple tool is suggested which can be used in the comparison of several estimates of the same parameter and can provide some statements about the accuracy of the estimates vs. the target parameter. This tool, called the Maximum Ratio, gives a measure of closeness among the estimates and also serves as the basis of a test for determining when at least one of the estimates under comparison will be more than (100)(..cap alpha..) percent away from the true parameter where 0 < ..cap alpha.. < 1.
Date: December 1, 1981
Creator: Tsao, H. & Wright, T.
Partner: UNT Libraries Government Documents Department
open access

Review of particle properties. 25th anniversary edition

Description: This review is a reprint of Physics Letters, Vol. IIIB, April 22, 1982, and is an updating through December 1981 of our previous review of particle properties (Particle Data Group (1980)). As in previous editions we have attempted to make the text as complete and self-contained as possible. The results of our compilation are presented in two sections, the Tables of Particle Properties and the Data Card Listings. The Tables summarize the properties of only those particles whose existence is in our judgment experimentally well founded and which have a high probability of standing the test of time. The Data Card Listings give up-to-date information, with references, on all reported particles, whether considered well established or not. The Listings also contain mini-reviews on questions of interest. As in previous editions, we include a section of miscellaneous tables, figures, and formulae. These are aimed at the practicing high energy physics experimentalist. (WHK)
Date: April 1, 1982
Partner: UNT Libraries Government Documents Department
open access

Data Formats and Procedures for the ENDF Neutron Cross Section Library

Description: Report issued by the Brookhaven National Laboratory discussing the formats and procedures used at the ENDF Neutron Cross Section Library. As stated in the introduction, "this report describes the philosophy, data formats, and procedures that have been developed for the Evaluated Nuclear Data File (ENDF)" (p. 1-1). This report includes tables, and illustrations.
Date: March 1971
Creator: Drake, M. K. & Honeck, Henry C.
Partner: UNT Libraries Government Documents Department
open access

A software perspective of environmental data quality

Description: Because of the large amount of complex data in environmental projects, particularly large decontamination and decommissioning projects, the quality of the data has a profound impact on the success and cost of the mission. In every phase of the life cycle of the project, including regulatory intervention and legal proceedings, maintaining the quality of data and presenting data in a timely and meaningful manner are critical. In this paper, a systemic view of data quality management from a software engineering perspective is presented. A method of evaluation evolves from this view. This method complements the principles of the data quality objective. When graded adequately, the method of evaluation establishes a paradigm for ensuring data quality for new and renewed projects. This paper also demonstrates that incorporating good practices of software engineering into the data management process leads to continuous improvement of data quality.
Date: July 1, 1995
Creator: Banerjee, B.
Partner: UNT Libraries Government Documents Department
Back to Top of Screen