Definition
In SPSS, “procedures” refer to all SPSS commands that read the data and are carried out immediately when run.
SPSS Procedures - What and Why
So what is meant by “SPSS procedures” and why does it matter if a command is a procedure, transformation or other command? This tutorial will briefly address these two questions, demonstrating some stuff on age_income.sav. The screenshot below shows what the data look like in data view after opening it.
data:image/s3,"s3://crabby-images/aadeb/aadebc2bb3e298cbd047f40f2e1032993c8dc685" alt="SPSS Age Income Data File"
Now, perhaps we'd like to see some basic DESCRIPTIVES for income. We can generate those by running the following syntax: descriptives income. The screenshot below shows the result in the output viewer window.
data:image/s3,"s3://crabby-images/2165a/2165a3cbf9dac5da55f64830860288db5bd69137" alt="SPSS DESCRIPTIVES Table"
Now, how can SPSS present us with these statistics? It does so by what's known as as “reading the data” or a data pass in SPSS.
SPSS Data Passes
In SPSS, a data pass refers to the process of SPSS going through all cases in the data (from top to bottom) in order to collect the values on one or more variables. The figure below illustrates the process.
data:image/s3,"s3://crabby-images/da272/da272a6b86c72c9cba3a7c50701bf25784b6392f" alt="SPSS Data Pass"
Now if we consult the command syntax reference on DESCRIPTIVES, we'll encounter the following statement: This command reads the active dataset and causes execution of any pending commands. The first part of the statement indicates that running DESCRIPTIVES triggers a data pass. The “pending commands” refer to any transformations that were run but not yet executed when a procedure is run. We'll demonstrate this below.
Executing Tranformations
In the data at hand, all incomes are stated in Euros. But what if we'd like to have them in Dollars? When the data were collected, 1 Euro corresponded to 1.1 Dollars. We'll therefore COMPUTE income in dollars by running
compute income_dollars = income * 1.1.
Now, since COMPUTE is a transformation command, it's not immediately carried out. The result is shown in the next figure, left half. However, running any procedure - even if unrelated to such “pending transformations” - will execute them. For instance, running
descriptives age.
executes our previous COMPUTE command.
SPSS Procedures - Practical Implications
At this point we have an idea of what's meant by “procedures” in SPSS. So why is it important to distinguish procedures from transformations and other commands? The reason is that procedures often behave differently in a number of ways:
- procedures cause all transformations to be executed;
- in contrast to transformations, procedures can't be used within a LOOP, DO REPEAT or DO IF command;
- procedures indicate the end of TEMPORARY and reverse temporary transformations;
- procedures indicate the end of any VECTOR.
- procedures delete all scratch variables.
SPSS Procedures - Processing Time
SPSS procedures involve data passes; they go through all cases in your data from top to bottom. As a consequence, their processing time depends on the number of cases in your data. If you have many (say 1,000,000 or over) cases in your data, running a procedure may take a few seconds on a modern computer. This does not hold for transformations and other commands.
If you work on a huge dataset and processing time is a real issue, minimizing the number of procedures will make your syntax run faster. For example
descriptives age income.
runs faster than two separate DESCRIPTIVES commands for these two variables. Finally, removing all unnecessary EXECUTE commands is another great way to speed things up.
THIS TUTORIAL HAS 4 COMMENTS:
By BHARATHANAND on January 16th, 2016
Hi ,
I am a research scholar working on conjoint analysis ( using SPSS software). I am stuck with syntax for connecting the demographics with the analysis and can mail u the files ( pls provide your mail id)
By Ruben Geert van den Berg on January 16th, 2016
I'm sorry, we're not currently taking on any assignments, we're fully booked for the next couple of weeks. If you can estimate the available budget for the assignment, we can contact some of our partners and ask whether they can provide you with assistance.
Kind regards,
Ruben
By R. Schneider on December 15th, 2016
Could you please tell us, where are EXECUTE commands not necessary "to speed up things" ?
By Ruben Geert van den Berg on December 15th, 2016
First off, using EXECUTE whenever you like will only slow things down on a slow computer when working on many cases (say n > 10,000 or so). EXECUTE is rarely necessary because all transformations are executed in one go whenever you run a procedure. The simplest rule of thumb is that EXECUTE is never needed but there's a couple of exceptions.
You often need EXECUTE right after a command that uses LAG.
Second, DELETE VARIABLES doesn't run if any transformations are pending.
Third, you may need EXECUTE when using $casenum.
Those are the first exceptions that come to mind. There may be more, though.
Hope that helps!