Use the Data Integrator window panels to:
See Also Data Integration Overview OpenVMS Data Integrator UNIX Data Integrator Performing Tasks in the Data Integrator Window
The Data Integrator window controls the merging of measured, sampled, and event data to produce a uniform system and network resource utilization file (.MERG or .merg) and a report file (.PROC or .proc).
The merged file is also reduced in size in preparation for the analysis phase. In the Workload Analysis component, you read in the merged data after completing the classification step.
The report file contains user profile information for the merged data.
The Data Integrator window has panels in which you can indicate the time period for data integration, the data files to integrate, and options specific to OpenVMS and UNIX systems.
See Also OpenVMS Data Integration Overview UNIX Data Integration Overview
For OpenVMS, data files that can be integrated include:
The Data Integrator component merges the .CPD, .EVM, and .DAT files to generate a single file .MERG file for each node.
For OpenVMS, you can also select different processing options.
See Also OpenVMS Data Integrator Performing Tasks in the Data Integrator Window
For UNIX systems using FullSail data, all the data for one UNIX system can be contained in a single file of type .arc. You must therefore use the Data Integrator to separate the data according to node name. The resulting .merg files contain the measured process and configuration data for each node in the system. You must repeat the integration process for each node to be included in the model.
For UNIX systems collecting system accounting plus snapshot data, use .acct and .snap files to create a .merg file for each node to be included in the model.
For UNIX systems using DECps data, use .CPD files.
For UNIX systems using OpenAviator data, you will have data files for each node on which you collected data.
See Also UNIX Data Integrator Performing Tasks in the Data Integrator Window
The OpenVMS Data Integrator has several panels and a menu item for selecting data and processing options:
The data push buttons display the File Selection dialog box, allowing you to specify DECps and Event Monitor data as input to the Data Integrator. The selected data files appear in the Data Files Selected fields.
See Also Specifying Data Types for OpenVMS
To specify PATHWORKS data as input to the Data Integrator, from the Options menu in the Data Integrator window, choose Pathworks.... The Pathworks Data dialog box appears.
See Also Pathworks Data Dialog Box
In the Processing options panel, toggle buttons and sliders allow you to select different processing options. You can specify data integration by process, and by process and image within the process. You can also distribute a selected percentage of the interrupt stack time and busy wait overhead time to processes.
See Also Specifying Processing Options (OpenVMS)
The UNIX Data Integrator has panels for selecting the time period for integration, the data source, and data files for input and output:
The push button in the Data Source panel allows you to select Accounting Data, Fullsail Data, DECps data, or OpenAviator data.
See Also Specifying Data Types for UNIX
Push buttons in the Data Input panel display the File Selection dialog box, allowing you to specify Accounting/Snapshot files, a Fullsail archive file, DECps data, or OpenAviator data as input to the Data Integrator. The selected data files appear in the Data Input fields.
For Fullsail data you must select a hostname by entering a name directly into the Hostname field.
See Also Specifying Data Types for UNIX Specifying Hostname (UNIX)
Use the Selection Time Period panel to select a particular time period from the data files that you want to integrate. For DECps and OpenAviator data, you can select the time period automatically from a graphical display generated by clicking the Graph Time Period push button.
See Also Specifying Time Periods for Collected Data Start Time Menu Buttons End Time Menu Buttons
The Data Output panel allows you to select a report file and MERG file for data output.
See Also Specifying Data Integrator Output Files
The tasks that you can perform in the Data Integrator window are
See Also Data Integrator Window OpenVMS Data Integrator UNIX Data Integrator
Your can specify the time period for collected data manually or, for DECps and OpenAviator data, automatically from a graph of the data versus time.
See Also Specifying Time Periods Manually Specifying Time Periods Automatically
To manually specify a time period for collected data, in the Selection Time Period panel:
All the start date and time settings are displayed.
All the end date and time settings are displayed.
See Also Performing Tasks in the Data Integrator Window
To automatically specify a time period for collected DECps or OpenAviator data:
The data is read in and the DI Time Period window displays a graph.
You can descend to four levels of detail when selecting the time period.
The start and end times for the period you selected are automatically set in the Data Integrator window.
See Also Performing Tasks in the Data Integrator Window
For OpenVMS, you can integrate a .CPD file, an .EVM file, and one or more optional PATHWORKS .DAT files.
Each push button displays the File Selection dialog box.
The directory path of the selected data file is displayed in the Data Files Selected text field in the Data Integrator window.
See Also Performing Tasks in the Data Integrator Window Pathworks Data Dialog Box
You can specify processing options for integrating OpenVMS data. You can also assign the amount of Interrupt Stack and Busy Wait overhead to processes.
To select a processing option, perform one of the following steps:
Collected data will be integrated by processes. This is the default. When you define classes during workload characterization, you will indicate specific processes.
or
Collected data will be integrated first by processes and then by images. When you define classes during workload characterization, you will indicate specific processes and images.
To assign overhead to processes:
The processes will be assigned the amount of specified overhead. The default for each is 100%.
See Also By Process Toggle Button (OpenVMS) By Process, By Image Toggle Button (OpenVMS) Percent I-Stack to Merge Slider (OpenVMS) Percent Busy Wait to Merge Slider (OpenVMS) Performing Tasks in the Data Integrator Window
To define the DECps database directory:
The PSDC$DATABASE definition dialog box is displayed.
See Also PSDC$DATABASE Definition Dialog Box Performing Tasks in the Data Integrator Window
For UNIX, you can integrate a Fullsail .arc file, an .acct file and .snap file, a DECps .cpd file, or an OpenAviator data file. If you specify a Fullsail .arc file, you must also specify a Hostname.
Each push button displays the File Selection dialog box.
The directory path of the selected data file is displayed in the Data Input text field in the Data Integrator window.
See Also Performing Tasks in the Data Integrator Window
For UNIX FullSail data, you must specify the name of the host for which you want to integrate the data. To select a hostname:
The field is highlighted.
See Also Performing Tasks in the Data Integrator Window
The Data Integrator outputs two files, a report (.PROC or .proc) and a merged data file (.MERG or .merg).
Each push button displays the File Selection dialog box.
The directory path of the selected data file is displayed in the Data Output text field in the Data Integrator window.
See Also Performing Tasks in the Data Integrator Window
To start the Data Integrator:
or
An informational message is displayed.
See Also Performing Tasks in the Data Integrator Window
Use the File menu to access these menu items:
Use the Start DI Menu item to start the Data Integrator.
See Also Starting the Data Integrator
Use the Options menu to access these menu items:
Use the PSDC$DATABASE definition.... menu item to display the PSDC$DATABASE definition dialog box. You can specify a directory path name for the DECps database.
See Also PSDC$DATABASE Definition Dialog Box
Use the Pathworks... menu item to display the Pathworks Data dialog box. You can specify different types of PATHWORKS data files to the Data Integrator.
See Also Pathworks Data Dialog Box
Use the Extract Date/Time from Data File menu item to automatically select the dates and times for data integration from the data files themselves. If this option is turned on, the software reads the data files to determine start and end dates and times.
Use these buttons to specify a starting time for a time period from DECps and Event Monitor files (OpenVMS) or from accounting files, Fullsail Archive files, DECps files, or OpenAviator files (UNIX).
See Also Selection Time Period Panel
Use these buttons to specify an ending time for a time period from DECps and Event Monitor files (OpenVMS) or from accounting files, Fullsail Archive files, DECps files, or OpenAviator files (UNIX).
See Also Selection Time Period Panel
Use the graph Time period push button to display the DI Time Period Window for a selected DECps or OpenAviator data file. The DI Time Period Window graphs the CPU Utilization and/or Disk I/O Rate against time, and allows you to automatically and visually select a time period for data integration.
See Also DI Time Period Window
The Report File... push button displays the File Selection dialog box. You must select a .PROC file for integration.
See Also Specifying Data Integrator Output Files
The Merg Data... push button displays the File Selection dialog box. You must select a .MERG file for integration.
See Also Specifying Data Integrator Output Files
Use the Start DI push button to start data integration.
See Also Starting the Data Integrator
The DECps Data... push button displays the File Selection dialog box. You must select a .CPD file for integration.
See Also Specifying Data Types for OpenVMS
The Event Monitor Data... push button displays the File Selection dialog box. You can select an .EVM file for integration.
See Also Specifying Data Types for OpenVMS
Use the By Process toggle button to specify data integration by processes. That is, the Data Integrator will record the utilization of system resources as they relate to processes. When you characterize the workload, you will then classify the processes from the .MERG file according to data such as PID, user name, and UIC. By Process is the default.
See Also By Process, By Image Toggle Button (OpenVMS)
Use the By Process, By Image button to specify data integration by process, by image. That is, the Data Integrator will record the utilization of system resources by process and then by image within the process. During workload characterization, you can then classify the images as subclasses within each process. If you do not use the By Process, By Image button, you will not be able to define subclasses for your process data.
See Also By Process Toggle Button (OpenVMS)
The interrupt stack time is the total time that the CPU spends in the interrupt stack during the current interval record.
This slider specifies the percentage of interrupt stack time that you want distributed to processes. You can distribute all, none, or a portion of the interrupt stack CPU overhead to all processes. The default is 100%. Any interrupt stack time that is not distributed to processes is sorted into the last class.
The overhead CPU is calculated for each process based on its percentage of the system-wide total disk I/O, which includes fault I/O.
If the overhead CPU time distributed to interactive processes causes the process total CPU time to exceed the process response time, then the overhead is not added and a warning is displayed.
See Also Percent Busy Wait to Merge Slider (OpenVMS)
This slider specifies the percentage of busy wait overhead time (also called multi-process synchronization time) that you want distributed to processes. You can distribute all, none, or a portion of the busy wait CPU overhead to all processes. The default is 100%. Any busy wait time that is not distributed to processes is sorted into the last class.
The overhead CPU is calculated for each process based on its percentage of the system-wide total disk I/O, which includes fault I/O.
If the overhead CPU time attributed to interactive processes causes the process total CPU time to exceed the process response time, then the overhead is not added and a warning is displayed.
See Also Percent I-Stack to Merge Slider (OpenVMS)
The Fullsail Archive push button displays the File Selection dialog box. You must select an .arc file for integration.
See Also Specifying Data Types for UNIX
The OpenAviator Archive push button displays the File Selection dialog box. You must select an OpenAviator data file for integration.
See Also Specifying Data Types for UNIX
This field allows you to enter the name of the host for Fullsail data integration. For DECps and OpenAviator data, the hostname appears here when the data file is read.
See Also Specifying Hostname (UNIX)
The Accounting push button displays the File Selection dialog box. You must select an .acct file for integration.
See Also Specifying Data Types for UNIX
The Snapshot push button displays the File Selection dialog box. You must select a .snap file for integration.
See Also Specifying Data Types for UNIX
The Select .cpd push button displays the File Selection dialog box. You must select a .cpd file for integration.
See Also Specifying Data Types for UNIX