|
Frequently
Asked Questions
Index
Scorecard
North
American Industry Classification System
Survey
Effectiveness Index Construct Article
|
by
Robert A. Page, Jr., Ph.D.,
and Edward Tamson, Ph.D.
Contact
PDi
about these publications
(copyright 2002 by PDi. All
rights reserved. Please do not modify, copy or distribute this article
-- see website terms and conditions of use.)
Executive
Summary
Most organizations believe
that surveys will give them the data and information measurements necessary
to focus improvement efforts that serve to increase both employee and
customer loyalty, thus, generate the kind of products and services which
create a very positive financial ROI with surveys. For such organizations,
PDi has developed a powerful measurement tool -The PDi Survey Effectiveness
Index ©. This survey is valid and reliable, and provides a comprehensive
assessment of important performance indicators with national comparative
data on how organizational survey efforts compare with those of other
companies. Incorporating the latest research on survey effectiveness,
the PDi Survey Effectiveness Index © provides a measurement of the
key drivers to high quality data collection and information by assessing
the following essentials:
1. Needs Analysis
2. Sampling Strategy
3. Survey Design & Administration
4. Survey Analysis & Feedback
5. Action Planning
6. Survey Synchronization
7. Overall Survey Satisfaction
The procedures used to establish
reliability and validity (face, content and construct) of this index are
outlined in this paper, and well as the pretest findings. Results indicate
that most respondents are not satisfied with the survey efforts at their
organizations - the category mean of 3.15 indicates they are neutral in
their responses to positively worded questions. Survey satisfaction tends
to improve with effective action planning and follow through on survey
results. Respondents also tend to link survey satisfaction with survey
efforts that effectively identify, analyze, and communicate important
information and trends. Poor survey design and reporting issues not only
tend to undermine survey satisfaction, they also tend to undermine respondent
trust in management guarantees of confidentiality. Poor survey quality
and credibility issues may explain the written comments noting poor respondent
motivation and participation in survey efforts.
Pretest Sample.
In the fall of 2003 managers
and employees working in over 50 different companies were contacted to
participate in constructing a benchmark database for this Index. The list
of organizations includes:
|
1. Ansonia Copper & Brass
2. Anthem Blue Cross/Blue Shield
3. BAE Systems
4. Barclay's Capital
5. Bryan Edwards Euro-Clean
6. Bridgeport Hospital
7. Cablevision
8. Computer Innovations
9. Connecticut Post
10. Costco
11. Council of Foundations
12. CT Tax Fraud Bureau
13. Dee's Cleaners and Laundry
14. Dellacamora Co.
15. Dingere
16. Flame Productions
17. Fleet Bank
18. George Harte Nissan
19. Hamilton Connections
20. Health Net, Inc.
21. Holt, Wexter, and Farnham
22. Kennedy Center
23. Kompa Magazine
24. MacDermid Inc.
25. Metlife
26. Merrill Lynch
27. Milani Bakery
28. Millburn Ridgefield Corp.
|
29. NEC America
30. Nicolas Moving LLC
31. People's Bank
32. Phantoms Production
33. Praxis Institute
34. PriceWaterhouseCoopers LLP
35. Prudential Securities
36. PSEG
37. Red Robin
38. Ro-La-Lume Awning
39. Sabian
40. SBC SNET
41. Schler/Lesser Architects
42. Shop Rite
43. Sikorsky Aircraft
44. Southern Connecticut State Univ.
45. Stanadyne Corp.
46. State of Connecticut
47. TC Distribution Co.
48. Trader Joe's
49. Trefz Co. / McDonald's
50. Trilegiant
51. U.S. Postal Service
52. Vinny's Home & Garden
53. Walmart
54. World Gym
55. Yale University
56. Yale New Haven Hospital |
The Survey Effectiveness Index©
is populating its database using NAIC codes, which replaced SIC (Standard
Industry Classification) codes in 1997. For the complete listing of 2002
codes, go to: http://www.census.gov/epcd/naics02/naicod02.txt
Industry Cluster A: Natural Resources
(1) Resources - agriculture, forestry, fishing, hunting, mining, oil and
gas [codes 11-21]
(2) Utilities - electric, gas, water, sewer, steam [code 22]
Industry Cluster B: Heavy Manufacturing
(3) Construction [code 23]
(4) Metal manufacturing - machinery, tools, equipment, transportation
vehicles (land, sea, air and outer space), etc. [codes 331-333, 336]
(5) Chemical manufacturing - including drugs, chemicals, rubber and petroleum-based
products, etc. [codes 324-326]
Industry Cluster C: General Manufacturing
(6) Food and beverage manufacturing [code 311]
(7) Light manufacturing - textiles, apparel, wood, paper, furniture and
other non-metallic products; printing services [codes 313-323, 327, 337-339]
(8) Computer and electronics manufacturing [codes 334-335]
Industry Cluster D: Sales
(9) Wholesale trade - durable and non-durable goods [code 423-424]
(10) Retail trade - durable and non-durable goods [codes 44-45]
Industry Cluster E: Information Services
(11) Communication - publishing, broadcasting, telecommunications [code
51]
(12) Finance, insurance and real estate [codes 52-53]
(13) Professional services - consultants, lawyers, accountants, etc. [codes
54-55]
Industry Cluster F: Private Sector Services
(14) Transportation and warehousing - passenger services, freight shipment
and storage on land, sea or air [codes 48-49]
(15) Support services for business - temp agencies, cleaning services,
collection agencies, call centers, and other service contractors [code
56]
(16) Educational services [code 61]
(17) Health care and social assistance [code 62]
(18) Travel and food services - hotels, restaurants, caterers, museums,
entertainment, theatres, sports and amusement parks, casinos, etc. [codes
71-72]
(19) Personal services - home & auto repair and service, social orgs.,
professional
associations, unions, barber shops, etc. [code 55, 81]
Industry Cluster G: Public Sector Services
(16) Educational services [code 61]
(17) Health care and social assistance [code 62]
(20) Public administration - local, state and federal government [code
92]
Respondents were limited to
working professionals, and were asked to choose a survey effort they were
familiar with, both in terms of taking the survey, and learning of the
results. Most respondents rated employee surveys, and were in the information
and private sector industry clusters:
Validity
and Reliability
The
PDi Survey Effectiveness Index© has been extensively pre-tested and
meets all generally accepted statistical standards for reliability, and
for face, content, and construct validity.
The PDi Survey Effectiveness Assessment Index© has been developed
from reliable, valid, pre-tested questions from the extensive PDi library
of questions. The index is reliable to the extent that the measures are
accurate, dependable, stable, and consistent. The index is valid to the
extent that the survey questions actually measure what they claim to measure.
Validity addresses the basic questions of "Is this survey any good?"
and "Can I trust what the findings are telling me?" Specific
types of validity include (1) face validity, (2) content validity, (3)
normal distribution, and (4) construct validity.
1. Face validity assesses where the language, phrasing, and content of
the survey item is clear, unambiguous, unbiased, and easily understood.
Face validity problems make the survey data impossible to interpret.
2. Content validity assesses
whether the survey adequately explores all relevant aspects of performance,
so that the resultant data set will be comprehensive, maximizing systematic
variance. Content validity problems mean the survey findings are missing
or confusing important parts of the story.
3. Normal Distribution assesses
whether the data for each question is normally distributed around the
mean. If means are highly skewed, or the variance is restricted, the question
has two problems. First, the question can not be used for higher level
statistical testing, and second, you can predict the answer to these questions
before you ask them again. They are not providing useful data.
4. Construct validity assesses
whether the survey is performing as it should. Construct validation requires
that the questions and categories feature discriminant validity (they
are statistically distinct and independent from one another) and convergent
validity (they correlate with each other in patterns which make conceptual
sense).
Face and content validity were
established by an expert panel composed of academic researchers, consultants
and organizational practitioners. Once any problems identified by the
expert panel were corrected, the index was pre-tested.
Normal distribution around the mean was measured through coefficients
of skewness and kurtosis. Complying with generally accepted statistical
standards, all questions featured skewness or kurtosis coefficients between
+2.0 and -2.0.
Construct validity is assessed through measuring convergent and discriminant
validity. Discriminant validity assesses whether the questions load into
independent categories (called factors), or whether they are really measuring
the same concept, and can be collapsed or eliminated. To test discriminant
validity, the more rigorous exploratory factor analysis procedure was
used. This involved a principal components analysis using an orthogonal
(VARIMAX) rotation. Following generally accepted statistical standards,
factors are retained if they had Eigen values greater than 1.0. Questions
are retained with they featured factor loadings greater than .4, or if
double-loadings are separated by a differential field greater than .10.
While all of the questions successfully loaded on a factor, 5 of the 36
double-loaded. Of the 5, only one featured a sufficiently large differential
field, and the other 4 were edited accordingly. They were excluded from
subsequent calculations of category scores and Cronbach Alpha reliability
coefficients.
Convergent validity explores if questions and categories are statistically
related to each other in conceptually consistent ways. Convergent validity
was explored using Pearson correlation and intercorrelation matrices.
As expected, most of the categories established by factor analysis and
most of the individual questions were significantly, positively related,
meaning there was over a 95% probability that when the scores on one of
these categories or questions were low, the other categories and questions
tended to be low as well, and when scores were high, the others also tended
to be high. For example, you would expect the categories of "question
quality" and "survey credibility" to be positively related
- you seldom see one without the other also being present.
While the correlations were positive and significant, they were not identical,
and seldom exceeded .70. This indicates low to moderately strong correlations,
indicating that while the questions and categories are related, they are
still contributing different and unique information.
Some of the individual questions and categories are not significantly
related to each other, whatever their scores might be. This means that
a high score on one such question or category gives you no clue as to
what the score of the other question or category might be. As expected,
none of the index questions or categories were significantly negatively
or inversely related, meaning that when the scores of one are low, the
others tend to be high.
Index Construct
The PDi Survey Effectiveness Assessment Index © contains 35 questions
with an agreement rating scale in 10 different categories. The agreement
scale featured 5 rating points and a non-response option:
· 1: Strongly Disagree
· 2: Disagree
· 3: Neutral
· 4: Agree
· 5: Strongly Agree
· 6: Don't Know
The 10 categories were established
from the 9 factors which emerged from the exploratory factor analysis
described above. The satisfaction category questions loaded on the same
factor as the action planning category questions, and these categories
share the highest correlation. Descriptive statistics and a Pearson Correlation
matrix follow the category list.
1. Needs Analysis
2. Sampling Strategy
3. Survey Design & Administration: Question Quality
4. Survey Design & Administration: Survey Credibility
5. Survey Analysis & Feedback
6. Action Planning
7. Survey Synchronization with Customers
8. Survey Synchronization with Internal Partners
9. Overall Survey Satisfaction
10. Respondent Trust
Demographic
Comparisons
Four demographic categories were included to assess each respondent: gender,
ethnicity, managerial responsibility and functional specialization. These
demographics were measured by the following nominal questions:
Gender: What is your gender?
1. Male
2. Female
Ethnicity:
Which category best describes your ethnicity?
1. Hispanic
2. Black
3. Asian / Pacific Islander
4. White
5. Native American / Eskimo
6. Other
Managerial
Responsibility: Which level best matches your job?
1. No managerial or decision making duties
2. Supervisor - occasionally directs others during a project or task
3. Manager - consistently directs a team or group of people
4. Senior Manager - directs managers
5. Executive - in an executive team of a business unit or autonomous division
Functional
Specialization: Which of the following functional areas would you claim
as your own, and would claim you as a member?
1. Sales and Marketing
2. Customer Service
3. Product Management (including engineering and development)
4. Operations and Manufacturing
5. Support (MIS, Research, Art & Graphics, etc.)
6. Administration (HR, PR, Legal, Finance & Accounting, etc.)
Most of the demographic groups,
with the exception of gender, did not feature enough respondents for reliable
comparisons. As the database becomes populated, this problem of small
numbers will be resolved.
As for gender, women consistently gave higher ratings on survey effectiveness
than men, regardless of the category. These higher ratings were statistically
significant (over 99% confidence, under .01 significance) for the survey
credibility in the design and administration process, survey synchronization
with customers, needs analysis and respondent trust. These differences
can be illustrated:
Content
Analysis
The PDi Survey Effectiveness Index© pre-test also included three
open ended questions. A detailed Content Analysis of the responses to
each of these three open ended questions was conducted. The following
thematic patterns emerged:
1. What are the primary goals of the survey? Were these goals met?
| |
Primary
Goals |
Frequency |
| |
Improve
the Work Place (ID Strengths/Weaknesses)
Obtain Employee
Input
Measure Employee Satisfaction
Improve Customer Services
Evaluate Management
Miscellaneous
Total
Were these goals met?
Yes
No
Somewhat
Unsure |
14
8
8
6
3
7
46
12
6
3
1 |
2. What are the biggest barriers
and limitations to effectively using this kind of survey research in your
organization?
| |
Barrier |
Frequency |
| |
Poor
Survey Format (Design, Questions & Administration)
Employees Not Motivated to Participate
Confidentiality Concerns
No Follow Up/Action Planning
Poor Sampling of all Employees
Inadequate Resources (Time & Money)
No Barriers
Results Not Communicated
Miscellaneous
Total |
12
8
5
5
4
4
4
2
8
52 |
3. Does this organization support
survey research well? How could the organization do better?
| |
Theme |
Frequency |
| |
Improve
Follow Up/Action Planning
Improve Survey Design, Questions & Administration
Improve Response Rates
Improve Confidentiality
Improve Management Support
Miscellaneous
Total |
15
3
3
2
2
10
35 |
Content
Analysis Summary
The open ended two part question
on goals and goal attainment suggests that improving the work place, obtaining
employee input and measuring employee satisfaction are the top three goals
for conducting surveys in this sample. Given that most of the surveys
analyzed were already identified employee surveys, this data did not add
enough information to justify the inclusion of this qualitative question
in the final survey instrument. Goal realization was mixed, with survey
efforts failing to meet their goals 43 percent of the time. Given that
an agreement scale would yield more differentiation than the trichotomous
themes (yes, no, somewhat) in the written comments, this question was
rewritten for use with the 5 point agreement rating scale.
The responses to the open-ended question on barriers and limitations focused
on themes around poor survey format, non-participative / unmotivated respondents,
confidentiality concerns and lack of follow up/action planning. Similarly,
the responses to the open ended question on survey effort support focused
on themes around improving follow up/action planning, improving survey
questions and improving response rates. Respondents provided minimal data
on how their organizations support survey research.
Since respondents focused on the strengths and weaknesses of survey efforts,
the open-ended questions were reworded to directly assess these issues.
In brief, the initial three open ended questions provided us with the
input to better evaluate responses, thus we have reduced the initial three
open ended questions to two open ended question that asked about strengths
and weaknesses.
Outcome
Drivers
The outcome questions assessed overall satisfaction with the quality and
value of the survey effort, as well as whether participants would recommend
this survey to others. In factor analysis, one of the outcome questions,
dealing with participant confidence that guarantees of confidentiality
will be honored, loaded on its own factor. Consequently there are two
outcome categories: Overall satisfaction and respondent trust.
Survey Satisfaction. Overall satisfaction was positively and significantly
correlated to almost all of the other categories, except question quality.
This means that when the category ratings were high, ratings of satisfaction
also tended to be high. When any of the categories were not rated highly,
the ratings of satisfaction tended to suffer as well. This relationship
can be illustrated

Not all significant correlations have the same explanatory power, however.
The higher the correlation, the more ratings of that particular category
explain the variation seen in the ratings of a particular outcome question.
When a correlation exceeds .50, it is considered a strong correlation.
While correlations do not measure causality, change agents who want to
improve ratings of a particular outcome variable often target behaviors
which are strongly related to that outcome. Chances are that improvements
in those areas will improve the outcome of interest as well. Accordingly,
categories with the highest correlations with outcomes can be considered
drivers.
Of all the categories, action planning is the most highly correlated (.70),
to the extent that the action planning and satisfaction questions loaded
together on the same factor in factor analysis. This finding is consistent
with the high frequency of written comment themes emphasizing the importance
of improving follow through and action planning.
Survey analysis and feedback also featured a strong correlation (.52)
with satisfaction. Respondents clearly link satisfaction with survey efforts
that effectively identify, analyze, and communicate important information
and trends. Written comments supported these findings, with respondents
targeting poor survey design and reporting issues as a serious weakness.
Poor survey quality and credibility issues may explain the written comments
noting poor respondent motivation and participation in survey efforts.
Respondent Trust. This single item indicator assessed whether respondents
trusted guarantees of confidentiality. Written comments in response to
both the weakness and improvement questions also emphasized the importance
of this issue. Trust was positively and significantly correlated to almost
all of the other categories, except question sampling strategy and internal
synchronization of survey efforts. This means that when the category ratings
were high, ratings of respondent trust also tended to be high. When any
of the categories were not rated highly, the ratings of respondent trust
tended to suffer as well. This relationship can be illustrated:

Of these categories, the strongest
correlations with trust were survey credibility (.43) and question quality
(.36). This indicates that respondents are most likely to trust promises
and guarantees of confidentiality from management when they see quality
and integrity in the survey design and administration process.
Implications
Of all the categories, action planning was not only strongly correlated
with the satisfaction outcomes (.70), it consistently featured the highest
correlations with the other non-outcome categories as well (correlations
from .39 to .68). As one of the most frequently mentioned qualitative
themes, the importance of this issue in the mind of survey respondents
is indisputable. Consequently, in terms of where to prioritize investments
to improve the survey process, action planning is an excellent place to
start. These correlations between action planning and most of the other
categories are pictured below:
Contact
PDi
about these publications
(copyright 2002 by PDi. All
rights reserved. Please do not modify, copy or distribute this article
-- see website terms and conditions of use.)
|