Member Login

E-mail:    Password:  




 TitleDate AddedCompany
whitepaper Introducing the SAS Code Analyzer2008-03-13 SAS Institute
  This paper introduces the PROC SCAPROC procedure, the SAS Code Analyzer that is new in Release 9.2 of Base SAS Software. The paper examines the advantages of using the procedure, its syntax and phases of execution, and the output that the procedure can produce. If one is responsible for maintaining a large legacy SAS application, one is probably interested in finding ways to improve the performance of the application. This is especially true if the application makes many passes through the same data sets, and one knows that those passes could potentially be run in parallel. However, it can be difficult and tedious to convert an application to run parallel steps, even when one has a good understanding of the application.

Tags: Programming Languages, Application Development
  
whitepaper Getting Started With ODS Statistical Graphics in SAS 9.22008-03-13 SAS Institute
  ODS Statistical Graphics (or ODS Graphics for short) is major new functionality for creating statistical graphics that is available in a number of SAS software products, including SAS/STAT, SAS/ETS, SAS/QC, and SAS/GRAPH. With the production release of ODS Graphics in SAS 9.2, over sixty statistical procedures have been modified to use this functionality, and they now produce graphs as automatically as they produce tables. In addition, new procedures in SAS/GRAPH use this functionality to produce plots for exploratory data analysis and for customized statistical displays. SAS/GRAPH is required for ODS Graphics functionality in SAS 9.2. This paper presents the essential information one needs to get started with ODS Graphics in SAS 9.2.

Tags: Data Mining - Analysis, Data Visualization
  
whitepaper SAS Stat Studio: A Programming Environment for High-End Data Analysts2008-03-13 SAS Institute
  SAS Stat Studio 3.1 is new statistical software in SAS 9.2 that is designed to meet the needs of high-end data analysts - innovative problem solvers who are familiar with SAS/STAT and SAS/IML but need more versatility to try out new methods. Stat Studio provides a rich programming language, called IMLPlus, that blends an Interactive Matrix Language (IML) with the ability to call SAS procedures as functions and to create customized dynamic graphics. For standard tasks, Stat Studio provides the same interactive graphics and statistical capabilities available in SAS/INSIGHT, and so it serves as a programmable successor to SAS/INSIGHT.

Tags: Programming Languages, Data Mining - Analysis
  
whitepaper Updates to SAS Power and Sample Size Software in SAS/STAT 9.22008-03-13 SAS Institute
  Procedures for power analysis and sample size determination were introduced in SAS/STAT 9.1 along with a Web application for performing these analyses. SAS/STAT 9.2 adds analyses for several new designs, enhances some existing analyses, and replaces the Web application with a desktop application for your PC. The new analyses include logistic regression; confidence intervals, equivalence tests, and noninferiority tests for a binomial proportion; and the Wilcoxon Mann-Whitney test for two distributions. These new analyses join the ones already available with SAS/STAT 9.1: general linear univariate models, one-way ANOVA, multiple regression, t tests, confidence intervals and equivalence tests of means, tests of proportions, and survival rank tests.

Tags: Data Mining - Analysis, Data Visualization
  
whitepaper Try, Try Again: Replication-Based Variance Estimation Methods for Survey Data Analysis in SAS 9.22008-03-13 SAS Institute
  Complex survey samples are constructed with selection schemes that affect the usual random assumptions, so SAS/STAT software provides specialized procedures to analyze them: SURVEYMEANS, SURVEYFREQ, SURVEYREG, and SURVEYLOGISTIC for means, frequencies, regression, and logistic analysis, respectively. These procedures all use the Taylor series expansion method for variance estimation, which is usually considered to be the "Gold standard" when it is practical to compute. However, replication methods are also widely used in practice for variance estimation. Replication methods, such as the jackknife and Balanced Repeated Replication (BRR), replace complex algebra with simple repeated analysis. They enable the user to analyze the data without the original sample design, protecting survey security, and they ease the task of estimating variances for nonlinear quantities.

Tags: Data Mining - Analysis
  
whitepaper Five Steps to More Valuable Enterprise Data2008-03-12 DataFlux
 

Companies worldwide struggle with inconsistent, inaccurate or unreliable data - and often don't know how to build more useful corporate information. This white paper examines a five-step method for improving your data, using data profiling, data quality, data integration and other methods to find and fix bad data.



Tags: Data Quality, Business Intelligence - Data Warehousing
  
whitepaper Information Architecture Essentials, Part 4: Improving Usability of Information Systems2008-03-11 IBM
  The best-organized information in the world is useless without an effective way to present that information to the interested audience. Poorly designed interfaces, inadequate search and filtering support, and confusing visual displays can severely hamper the success of any information architecture. To make complex information accessible to end users, one must make a significant effort to research and design the way users will interact with the information management system.   
whitepaper Better Hashing in SAS 9.22008-03-11 SAS Institute
  The DATA step hash object is one of the most versatile new features of Base SAS programming. Since its introduction in SAS 9, the paper has numerous accounts from users where the hash object has drastically reduced data processing time for complex data join operations. Based on feedback, the paper has further enhanced the hash object. For SAS 9.2, the paper introduces the ability to store duplicate keys in a hash object and have added a find frequency counter. This paper leverages the duplicate key capability to implement true SQL-like joins as well as partial-key look-ups. In addition, the paper explores uses for the find frequency counter. Go beyond SAS 9.1 and see why hashing in SAS 9.2 improves how one processes data.

Tags: Programming Languages, Database Management
  
whitepaper Managing Large Data With SAS SPD Server2008-03-10 SAS Institute
  This paper provides the concepts behind demonstrations of how one can enhance query performance when one uses the SAS SPD Server to manage large data tables. This paper does not cover the main concepts for Dynamic Clusters or Parallel Join. This paper focuses on managing data within standard SAS Scalable Performance Data Server (SPD Server) tables, and therefore Dynamic Cluster tables, to enhance data querying. There are three common types of queries for which data within a table can be optimized: ordered processing, subsetting, and a hybrid of ordered processing and subsetting.

Tags: Application Servers, Parallel Processing
  
whitepaper Small Improvements Causing Substantial Savings - Forecasting Intermittent Demand Data Using SAS Forecast Server2008-03-10 SAS Institute
  Businesses require accurate forecasts of time series data that is not continuous. Often, time series data is intermittent (discontinuous or interrupted). Intermittent time series data points are mostly zero (the base value), with occasional departures from the base value. Intermittent time series are common in business and economic data. For example, at progressively lower levels of data disaggregation (larger frequency, smaller geography, or both), the time series data is often intermittent. The most commonly used forecasting techniques are continuous time series methods such as Exponential Smoothing Methods (ESM). Continuous methods are meant to forecast the future values with respect to future time periods.

Tags: Data Mining - Analysis