PARSING: Using SASĀ® When the Data Are Hiding in a Non-Standard Format

Andrew Kuligowski
independent


Abstract

Sequential files? Spreadsheets? Databases? There are numerous tutorials that instruct the SASĀ® user in techniques to extract data from standard sources. Sometimes, however, the desired data is hidden inside a non-standard source; information may be found within the flow of a text document, for example.

This presentation will address some techniques that can be used when not dealing with cleanly formatted data, through use of an example where data are found within a free-form text file. It will deal with identifying what can be considered useful data and what can be discarded, then tackle techniques to extract the data for further analysis, reporting, or whatever is the desired end result.