Search Results

Advanced search parameters have been applied.

The Life-Changing Magic of OpenRefine: The Open-Source Art of Data Decluttering and Organizing

Description: Presentation for the 2017 UNT Open Access Symposium. This presentation provides an overview of a workshop on using OpenRefine to make sense of data, standardize, correct, and rearrange data elements, automate tedious editing, and prepare data for export to other platforms for a variety of uses.
Date: May 18, 2017
Creator: Feustle, Maristella
Partner: UNT Music Library
open access

Nuclear data and measurements series: Some comments on the effects of long-range correlations in covariance matrices for nuclear data

Description: Attention is called to the considerable sensitivity of many uncertainty calculations to the magnitude of the long-ranged correlations which appear in covariance matrices. If such correlations do exist, they must be included in order to properly assess the impact of the uncertainties in the data. If, however, certain assumed long-range correlations are unrealistic, then analyses involving such correlation information are almost certain to produce misleading results. The issue is discussed in general terms, and its importance is illustrated by examples based in part on recent work from this laboratory. Some practical suggestions are offered for dealing with the matter of correlations in instances where the available information is incomplete. 23 refs., 2 figs., 1 tab.
Date: March 1, 1987
Creator: Smith, D. L.
Partner: UNT Libraries Government Documents Department
open access

Research Data Management Practices of Academic Researchers in Turkey

Description: Presentation paper for the 2017 International Conference on Knowledge Management. This paper examines the level of awareness and understanding of the various issues, challenges, and training needs associated with research data management for Hacettepe University researchers
Date: October 25, 2017
Creator: Ünal, Yurdagül & Kurbanoğlu, Serap
Partner: UNT College of Information
open access

An Introduction to Data Science

Description: This book provides non-technical readers with a gentle introduction to essential concepts and activities of data science. For more technical readers, the book provides explanations and code for a range of interesting applications using the open source R language for statistical computing and graphics"--Resource home page.
Date: 2012
Creator: Stanton, Jeffrey M.
Partner: UNT Libraries
open access

RADIX 95n: Binary-to-Text Data Conversion

Description: This paper presents Radix 95n, a binary to text data conversion algorithm. Radix 95n (base 95) is a variable length encoding scheme that offers slightly better efficiency than is available with conventional fixed length encoding procedures. Radix 95n advances previous techniques by allowing a greater pool of 7-bit combinations to be made available for 8-bit data translation. Since 8-bit data (i.e. binary files) can prove to be difficult to transfer over 7-bit networks, the Radix 95n conversion technique provides a way to convert data such as compiled programs or graphic images to printable ASCII characters and allows for their transfer over 7-bit networks.
Date: August 1991
Creator: Jones, Greg, 1963-2017.
Partner: UNT Libraries
open access

Gemini Series Experiment Data Reduction and Storage Techniques

Description: The presentation covers data formats expected from Gemini experiments; data quick look vs. in-depth analysis; iPDV object-oriented data storage; iPDV's traceability of analysis results; optimizing object memory usage in iPDV; and long-term archival of data objects by iPDV.
Date: November 1, 2011
Creator: Berglin, R. A.
Partner: UNT Libraries Government Documents Department
open access

Unfolding Research Data Services: An Information Architecture Perspective

Description: This paper describes the use of a content analysis with the lens of information architecture to better understand how research data services are organize in North American academic library websites, and to what extent the research data lifecycle is supported within these services.
Date: June 4, 2018
Creator: Khan, Hammad; Chang, Hsia-Ching & Kim, Jeonghyun
Partner: UNT College of Engineering

Unfolding Research Data Services: An Information Architecture Perspective

Description: Poster presented at the 2018 ACM/IEEE Joint Conference on Digital Libraries. This poster describes the use of a content analysis with the lens of information architecture to better understand how research data services are organize in North American academic library websites, and to what extent the research data lifecycle is supported within these services.
Date: June 4, 2018
Creator: Khan, Hammad; Chang, Hsia-Ching & Kim, Jeonghyun
Partner: UNT College of Information
open access

Building Agency within the Agency and in the Community: Improving Web Access to Public Health Data in New York City

Description: This paper catalogues the New York City Department of Health and Mental Hygiene's redevelopment efforts to understand users and reimagine uses for its public use data.
Date: November 9, 2018
Creator: Marcial, Laura; Quattrone, Wes; Affrunti, Megan; Hinterland, Kinjia; Laufenberg, Jeff; Duparc, Martin et al.
Partner: UNT College of Information
open access

The Target Data Breach: Frequently Asked Questions

Description: This report answers some frequently asked questions about the Target (store) data breach, including what is known to have happened in the breach, and what costs may result. It also examines some of the broader issues common to data breaches, including how the payment system works, how cybersecurity costs are shared and allocated within the payment system, who bears the losses in such breaches more generally, what emerging cybersecurity technologies may help prevent them, and what role the government could play in encouraging their adoption, as well as some of the legislation that the 113th Congress has introduced to deal with these issues.
Date: April 22, 2014
Creator: Weiss, N. E. & Miller, Rena S.
Partner: UNT Libraries Government Documents Department
open access

FastBit: Interactively Searching Massive Data

Description: As scientific instruments and computer simulations produce more and more data, the task of locating the essential information to gain insight becomes increasingly difficult. FastBit is an efficient software tool to address this challenge. In this article, we present a summary of the key underlying technologies, namely bitmap compression, encoding, and binning. Together these techniques enable FastBit to answer structured (SQL) queries orders of magnitude faster than popular database systems. To illustrate how FastBit is used in applications, we present three examples involving a high-energy physics experiment, a combustion simulation, and an accelerator simulation. In each case, FastBit significantly reduces the response time and enables interactive exploration on terabytes of data.
Date: June 23, 2009
Creator: Wu, Kesheng; Ahern, Sean; Bethel, E. Wes; Chen, Jacqueline; Childs, Hank; Cormier-Michel, Estelle et al.
Partner: UNT Libraries Government Documents Department
open access

High Performance Multivariate Visual Data Exploration for Extremely Large Data

Description: One of the central challenges in modern science is the need to quickly derive knowledge and understanding from large, complex collections of data. We present a new approach that deals with this challenge by combining and extending techniques from high performance visual data analysis and scientific data management. This approach is demonstrated within the context of gaining insight from complex, time-varying datasets produced by a laser wakefield accelerator simulation. Our approach leverages histogram-based parallel coordinates for both visual information display as well as a vehicle for guiding a data mining operation. Data extraction and subsetting are implemented with state-of-the-art index/query technology. This approach, while applied here to accelerator science, is generally applicable to a broad set of science applications, and is implemented in a production-quality visual data analysis infrastructure. We conduct a detailed performance analysis and demonstrate good scalability on a distributed memory Cray XT4 system.
Date: August 22, 2008
Creator: Rubel, Oliver; Wu, Kesheng; Childs, Hank; Meredith, Jeremy; Geddes, Cameron G.R.; Cormier-Michel, Estelle et al.
Partner: UNT Libraries Government Documents Department
open access

Big Data and Government Information

Description: Panel paper for the 2017 International Conference on Knowledge Management. This panel explores the current landscape of big data applications for government information.
Date: October 25, 2017
Creator: Alemneh, Daniel Gelaw; Phillips, Mark Edward; Smith, Lisa & Shimabuku, Hope
Partner: UNT Libraries
open access

An analysis of normal-acceleration and airspeed data from a four-engine type of transport airplane in commercial operation on an eastern united states route from November 1947 to February 1950

Description: Report presenting an analysis of 48,187 hours of normal-acceleration and airspeed data obtained on a four-engine type of transport airplane in commercial operation on an eastern United States route from November 1947 to February 1950 to determine gusts and gust loads for operations. Results regarding the presentation of data, effect of changes in operating conditions, effect of seasons, comparison with other results, reliability, acceleration, gusts, and airspeeds are provided.
Date: August 1953
Creator: Coleman, Thomas L. & Schumacher, Paul W. J.
Partner: UNT Libraries Government Documents Department
open access

An analysis of the normal accelerations and airspeeds of a four-engine airplane type in postwar commercial transport operations on trans-Pacific and Caribbean-South American routes

Description: Report presenting an analysis of normal-acceleration and airspeed data on several transport airplanes of a four-engine type during postwar commerical operations on trans-Pacific and Caribbean-South American routes of the same airlines. The results indicate that the acceleration increment corresponding to the limit-gust-load-factor increment may be exceed about twice in all of the flight miles for each route. Results regarding accelerations, flight loads, gust encounters, and speeds are provided.
Date: August 1950
Creator: Coleman, Thomas L. & Schumacher, Paul W. J.
Partner: UNT Libraries Government Documents Department
open access

Efficient Bulk Data Replication for the Earth System Grid

Description: The Earth System Grid (ESG) community faces the difficult challenge of managing the distribution of massive data sets to thousands of scientists around the world. To move data replicas efficiently, the ESG has developed a data transfer management tool called the Bulk Data Mover (BDM). We describe the performance results of the current system and plans towards extending the techniques developed so far for the up- coming project, in which the ESG will employ advanced networks to move multi-TB datasets with the ulti- mate goal of helping researchers understand climate change and its potential impacts on world ecology and society.
Date: March 10, 2010
Creator: Sim, Alex; Gunter, Dan; Natarajan, Vijaya; Shoshani, Arie; Williams, Dean; Long, Jeff et al.
Partner: UNT Libraries Government Documents Department
open access

Computerizing Materials Data -- A Workshop for the Nuclear Power Industry

Description: Abstract: This report summarizes the recommendations of a Workshop in Computerized Materials Data as related to engineers in the Nuclear Power Industry. Four areas of discussion are featured: the content of a proposed data system; its size and data sources; the user interfaces and system capabilities; and ways of making further progress. In addition, changes in the use of materials data in the Nuclear Power Industry and progress-to-date in computerizing these data are presented.
Date: January 1985
Creator: Rumble, John, Jr. & Westbrook, Jack H.
Partner: UNT Libraries Government Documents Department
open access

Data Formats and Procedures for the ENDF Neutron Cross Section Library

Description: Report issued by the Brookhaven National Laboratory discussing the formats and procedures used at the ENDF Neutron Cross Section Library. As stated in the introduction, "this report describes the philosophy, data formats, and procedures that have been developed for the Evaluated Nuclear Data File (ENDF)" (p. 1-1). This report includes tables, and illustrations.
Date: March 1971
Creator: Drake, M. K. & Honeck, Henry C.
Partner: UNT Libraries Government Documents Department
open access

Note on a simple tool for the comparison of several estimates of the same parameter

Description: One basic step towards improving the quality of data produced by a data collection agency is to identify problem areas that require special attention. Comparisons with other comparible data sources, when possible, serve as a simple technique that can explore existing problems for one or both data collection systems. In this paper, a simple tool is suggested which can be used in the comparison of several estimates of the same parameter and can provide some statements about the accuracy of the estimates vs. the target parameter. This tool, called the Maximum Ratio, gives a measure of closeness among the estimates and also serves as the basis of a test for determining when at least one of the estimates under comparison will be more than (100)(..cap alpha..) percent away from the true parameter where 0 < ..cap alpha.. < 1.
Date: December 1, 1981
Creator: Tsao, H. & Wright, T.
Partner: UNT Libraries Government Documents Department
open access

Review of particle properties. 25th anniversary edition

Description: This review is a reprint of Physics Letters, Vol. IIIB, April 22, 1982, and is an updating through December 1981 of our previous review of particle properties (Particle Data Group (1980)). As in previous editions we have attempted to make the text as complete and self-contained as possible. The results of our compilation are presented in two sections, the Tables of Particle Properties and the Data Card Listings. The Tables summarize the properties of only those particles whose existence is in our judgment experimentally well founded and which have a high probability of standing the test of time. The Data Card Listings give up-to-date information, with references, on all reported particles, whether considered well established or not. The Listings also contain mini-reviews on questions of interest. As in previous editions, we include a section of miscellaneous tables, figures, and formulae. These are aimed at the practicing high energy physics experimentalist. (WHK)
Date: April 1, 1982
Partner: UNT Libraries Government Documents Department
open access

Application specific compression : final report.

Description: With the continuing development of more capable data gathering sensors, comes an increased demand on the bandwidth for transmitting larger quantities of data. To help counteract that trend, a study was undertaken to determine appropriate lossy data compression strategies for minimizing their impact on target detection and characterization. The survey of current compression techniques led us to the conclusion that wavelet compression was well suited for this purpose. Wavelet analysis essentially applies a low-pass and high-pass filter to the data, converting the data into the related coefficients that maintain spatial information as well as frequency information. Wavelet compression is achieved by zeroing the coefficients that pertain to the noise in the signal, i.e. the high frequency, low amplitude portion. This approach is well suited for our goal because it reduces the noise in the signal with only minimal impact on the larger, lower frequency target signatures. The resulting coefficients can then be encoded using lossless techniques with higher compression levels because of the lower entropy and significant number of zeros. No significant signal degradation or difficulties in target characterization or detection were observed or measured when wavelet compression was applied to simulated and real data, even when over 80% of the coefficients were zeroed. While the exact level of compression will be data set dependent, for the data sets we studied, compression factors over 10 were found to be satisfactory where conventional lossless techniques achieved levels of less than 3.
Date: December 1, 2008
Creator: Melgaard, David Kennett; Byrne, Raymond Harry; Myers, Daniel S.; Harrison, Carol D.; Lee, David S.; Lewis, Phillip J. et al.
Partner: UNT Libraries Government Documents Department
Back to Top of Screen