You are on page 1of 12

BISE – RESEARCH PAPER

Comparing Business Intelligence and Big Data Skills


A Text Mining Study Using Job Advertisements
The required skill set for dealing with big data has not yet been studied empirically.
By analyzing and interpreting the statistical results of a text mining application on job
advertisements, we develop a competency taxonomy for big data and business intelligence.
Our findings can guide individual professionals, organizations, and academic institutions in
assessing and advancing their BD and BI competencies.

DOI 10.1007/s12599-014-0344-2

data sets that are too large and com- and a study by the McKinsey Global Insti-
The Authors plex to be processed using traditional tute states that “[t]he United States alone
storage (e.g., relational database man- faces a shortage of 140,000 to 190,000
Stefan Debortoli, M.Sc. () agement systems) and analysis technolo- people with deep analytical skills as well
Dr. Oliver Müller gies (e.g., packaged software for statistical as 1.5 million managers and analysts
Prof. Dr. Jan vom Brocke analysis). More specifically, researchers to analyze big data and make decisions
Institute of Information Systems and practitioners use the term “big data” based on their findings” (Manyika et al.
University of Liechtenstein to refer to the ongoing expansion of 2011, p. 3).
Fürst-Franz-Josef-Strasse data in terms of volume, variety, velocity Given these figures, we academics have
9490 Vaduz (Laney 2001), and veracity (IBM 2012). to ask ourselves to what degree current
Principality of Liechtenstein Given the current excitement around research agendas and curricula satisfy in-
stefan.debortoli@uni.li big data, critical voices question whether dustry’s growing demand for competence
oliver.mueller@uni.li big data is “really something new or [. . .] in the areas of big data and analytics.
jan.vom.brocke@uni.li just new wine in old bottles” (Buhl et al. Against this background, the objective of
url: http://www.uni.li/iwi 2013) or postulate that we should “for- this paper is to clarify the competency re-
get big data [because] small data is the quirements of the emerging field of big
Received: 2013-10-31 real revolution” (Polock 2013). Others, data (BD) and compare them to the re-
Accepted: 2014-05-07 such as Chen et al. (2012) and Golden quirements of the established field of BI.
Accepted after two revisions by the (2013), argue that big data is not a rev- In particularly, we seek to (1) identify
editors of the special focus. olution but an evolution of traditional and categorize competency requirements
business intelligence (BI). According to for BD professionals and BI profession-
this view, big data analytics widens the als from a practitioner’s point of view
This article is also available in Ger- scope of BI, which focuses on integrating and (2) highlight theses requirements’
man in print and via http://www. and reporting structured data residing similarities and differences.
wirtschaftsinformatik.de: S Debor- in company-internal databases, by seek- The current literature contains only a
toli, O Müller, J vom Brocke ing to extract value from semi-structured few contributions on the topic of BI and
(2014) Vergleich von Kompetenz- and unstructured data originating in data BD competencies, so we collected and an-
anforderungen an Business-Intel- sources like the web, mobile devices, and alyzed empirical data from the BI and
ligence- und Big-Data-Spezialisten. sensor networks that are external to the BD job market. Following the logic of
Eine Text-Mining-Studie auf Basis company. extant studies on information systems
von Stellenausschreibungen. WIRT-
Big data offers enormous opportunities competency requirements (e.g., Gallivan
SCHAFTSINFORMATIK. doi: 10.1007/
for businesses but also poses many chal- et al. 2004; Litecky and Aken 2010; Todd
s11576-014-0432-4.
lenges (Buhl 2013). A survey of nearly et al. 1995), we used online job adver-
3000 executives, managers, and analysts tisements as a data source and performed
© Springer Fachmedien Wiesbaden
from more than 30 industries and 100 a quantitative content analysis of 1357
2014
countries conducted by MIT Sloan Man- BI-related and 450 BD-related job adver-
agement Review and the IBM Insti- tisements using a text-mining technique
tute for Business Value finds that top- called latent semantic analysis (LSA).
performing organizations use analytics Our analysis revealed fifteen distinct
1 Introduction five times more often than lower per- areas of competency for BI profession-
formers do (LaValle et al. 2011), yet not als and fifteen distinct areas of compe-
Big data and big data analytics are among all corporate big data initiatives are suc- tency for BD professional. On the most
today’s most frequently discussed top- cessful. Research shows that “inadequate abstract level, these areas of competency
ics in research and practice (Buhl et al. staffing and skills are the leading barriers can be classified into business competen-
2013). In loose terms, big data refers to to Big Data Analytics” (Russom 2011), cies and IT competencies. The business

Business & Information Systems Engineering


BISE – RESEARCH PAPER

competencies can be further sub-divided structures, policies and rules, workplace and depth of the data processed, but
into management and domain compe- practices, culture). Section 2.1 elaborates also the types of questions they answer.
tencies, and the IT competencies can be on the technological IT resources asso- While BI traditionally focuses on using
further sub-divided into methodological, ciated with BI and BD, Sects. 2.2 and a consistent set of metrics to measure
conceptual, and product-specific compe- 2.3 discuss required human IT resources, past business performance (Davenport
tencies. Comparing and contrasting the and Sect. 2.4 addresses complementary 2006), big data applications emphasize
competency requirements for BI and BD organizational capital resources. exploration, discovery, and prediction.
professionals shows areas of overlap, es- As Dhar (2013) states, “Big data makes it
pecially regarding IT concepts and meth- 2.1 Business Intelligence and Big Data feasible for a machine to ask and validate
ods and the business domain, as well interesting questions humans might not
as clear differences when it comes to IT Howard Dresner of the Gartner Group consider.”
competencies. While BI requires skills in introduced the term “business intelli-
the area of commercial software plat- gence” in 1989, describing “a set of con-
2.2 Business Intelligence Competencies
forms, BD largely relies on software engi- cepts and methods to improve business
neering, statistics skills, and open-source decision making by using fact-based sup-
port systems” (Power 2007). The first As we found no literature that studies in-
products. dividual BI competencies, we gained an
Our empirically grounded frameworks productive BI systems were implemented
at large consumer goods manufacturers overview of individual BI competency re-
of BI and BD competencies contribute to quirements by consulting extant work on
the IS body of knowledge by (1) helping like Procter & Gamble and retailers like
Wal-Mart for the purpose of analyzing BI maturity/capability models, reviews of
professionals to assess and advance their the BI literature, and panel reports.
individual competencies, (2) guiding or- sales data (Power 2007). Although Dres-
ner’s original definition of BI, as well as Both research and practice have
ganizations in composing effective port-
more recent definitions from analysts like engaged in developing BI maturity/
folios of BI and BD professionals, and (3)
Gartner, Forrester, and TDWI, are broad capability models. (For an overview, see,
informing the development of academic
in scope, most practitioners associate e.g., Russell et al. 2010). The general pur-
and professional education programs.
with the term a narrow set of capabilities, pose of such models is to systematize
The remainder of this paper is struc-
such as extraction, transformation, and organizational capabilities and outline
tured as follows. The next section pro-
loading (ETL); data warehousing; on-line pathways for advancing them. Models
vides research background on the topic of
analytical processing (OLAP); and re- that originate from industry include the
BI and BD competencies. Then we intro-
porting (Davenport 2006). The focus of TDWI Business Intelligence Maturity
duce our methodology and explain our
these traditional BI solutions is on ana- Model (Eckerson 2004), Gartner’s Matu-
data-collection and analysis processes.
lyzing historical data in order to answer rity Model for Business Intelligence and
Next, we present our results and dis-
questions like “how much did we sell in Performance Management (Hostmann
cuss our findings against the background
a certain region?” and “how much profit and Hagerty 2010), Gartner’s Magic
of related work. We close by pointing
did we make last quarter?” Quadrant for Business Intelligence Plat-
out the limitations of our work and
At the end of the 1990s, the term forms (Schlegel et al. 2013), and Logica’s
implications for future research.
“big data” started to appear in the scien- Capability/Maturity Model (Van Roekel
tific literature, referring to data sets that et al. 2009). Lahrmann et al. (2011), Din-
were too large to fit into main memory ter (2012), and Cates et al. (2005) provide
2 Research Background or even local disks (Cox and Ellsworth examples of academic BI maturity mod-
1997; Forbes 2013). The first publica- els. Industry maturity models tend to
The resource-based view (RBV) of tions about big data originated from the
the firm, especially the framework by focus on technological capabilities that
field of scientific computing, but in 2001 BI platforms should provide (Russell
Melville et al. (2004), can be used to Doug Laney, an analyst with the Meta
evaluate BI/BD implementations’ gen- et al. 2010). For example, Gartner lists
Group, transferred the concept to the
eration of business value and to assess thirteen essential capabilities, includ-
business domain and coined the term
which resources and competencies are ing reporting, OLAP, and visualization
“the 3Vs” to stand for volume, veloc-
required and may lead to competitive (Schlegel et al. 2013). Such functional IT
ity, and variety, which quickly became
advantage. In the focal firm, IT business the constituting dimensions of big data capabilities provide some guidance for
value is generated by the deployment of (Laney 2001). After the mid-2000s, fu- assessing and developing individual-level
IT and complementary organizational eled by Davenport’s (2006) seminal ar- BI competencies but largely neglect the
resources (Melville et al. 2004). How- ticle “Competing on Analytics,” busi- business-related aspects of BI, such as
ever, IT affects organizational perfor- nesses became increasingly interested in project management and domain skills.
mance only via intermediate business big data, and the focus shifted from tech- By contrast, the academic models pro-
processes. Melville et al. (2004) opera- nical issues around the storage of big data vide a high-level view of strategic BI
tionalize IT based on Barney’s (1991) to its analysis. Internet-based businesses capabilities like architecture planning,
classification of firm resources into phys- like Google, Amazon, and Facebook were IT-business alignment, and generation
ical capital (technological IT resources among the first to exploit big data by ap- of business value. While these topics are
or TIR, i.e., infrastructure and business plying sophisticated data mining and ma- key to engaging effectively in BI on an
applications), human capital (human IT chine learning techniques. What differ- organizational level, we believe that they
resources or HIR, i.e., technical skills and entiates today’s big data analytics appli- are too abstract to be useful in assess-
managerial skills), and organizational cations from traditional business intelli- ing and developing individual-level BI
capital resources (e.g., organizational gence applications is not only the breadth competencies.

Business & Information Systems Engineering


BISE – RESEARCH PAPER

The purpose of literature reviews is of BD specialists or similar jobs, such as Therefore, we study the competencies re-
to analyze and synthesize the academic those of data scientists. quired of BI and BD professionals by per-
body of knowledge, so it is reasonable to In an influential Harvard Business Re- forming an automated content analysis
expect that reviews can provide insight view article, Davenport and Patil (2012) of job ads using a text mining technique
into competency requirements by, for ex- describe a data scientist as “a hybrid of called latent semantic analysis (LSA),
ample, outlining curricula. We identified data hacker, analyst, communicator, and a quantitative method for analyzing qual-
one review in the area of BI that explicitly trusted adviser” (p. 73) and call the job itative data. LSA extracts word usage pat-
comments on aspects of education. Based of the data scientist “the sexiest job of the terns and their meaning through sta-
on market research results from Gartner, 21st century” (p. 70). Likewise, Hammer- tistical computations (Landauer et al.
Chen et al. (2012) perform a bibliomet- bacher, who created the first data science 1998) based on the idea that the contexts
ric study of academic and industry pub- team at Facebook, portrays a data scien- (e.g., documents, paragraphs, sentences)
lications on business intelligence and an- tist as “a team member [who] could au- in which a word appears or does not ap-
alytics and structured the business intel- thor a multistage processing pipeline in pear largely determine the word’s mean-
ligence and analytics (BI&A) discipline Python, design a hypothesis test, perform ing. LSA is based on the classical vector
into three evolutionary waves – BI&A a regression analysis over data samples space model (Salton et al. 1975), in which
1.0 (database-based, structured content), with R, design and implement an algo- documents are represented as vectors of
BI&A 2.0 (web-based, unstructured con- rithm for some data-intensive product or terms, and a collection of documents is
tent), and BI&A 3.0 (mobile and sensor- service in Hadoop, or communicate the represented as a term-document matrix
based content) – and five emerging re- results of our analyses to other members that contains the number of times each
search areas – big data analytics, text of the organization” (as cited in Loukides term appears in each document (Man-
analytics, web analytics, network analyt- 2012). ning et al. 2008). In a fashion similar
ics, and mobile analytics. Chen et al. These characterizations seem to call to exploratory factor analysis, LSA per-
(2012) also outline and map the com- for a hybrid of a computer scientist forms a matrix operation called singular
petency requirements for each of these and statistician, yet many more business- value decomposition (SVD) on the term-
fields and advocate that higher educa- related authors state that, in the world of document matrix in order to reduce its
tion should consider these competencies big data, one cannot separate data pro- dimensionality. The latent semantic fac-
in their curricula. Examples of the com- cessing from analysis or from domain tors that are extracted during this pro-
petencies Chen et al. (2012) name include knowledge (e.g., Chen et al. 2012; Dav- cess can be interpreted as topics running
relational database management systems enport and Patil 2012; Loukides 2012; through the collection of documents an-
(RDBMS), data warehousing, ETL, data Provost and Fawcett 2013; Waller and alyzed. LSA has received growing atten-
mining, statistical analysis, web crawl- Fawcett 2013). Hence, BD specialists tion in the IS discipline for quantitative
ing, recommender systems, social net- must have substantial industry knowl- content analysis of academic papers (e.g.,
work theories, smartphone platforms, edge in order to make sense of statisti- Larsen et al. 2008; Sidorova et al. 2008),
machine learning, process mining, in- cal analyses and communicate effectively social media posts (e.g., Evangelopou-
memory DBMS, cloud computing, senti- with business colleagues. los and Visinescu 2012), sustainability re-
ment analysis, and web visualization. ports (e.g., Reuter et al. 2014), vendor
Wixom’s et al. (2011) panel report 2.4 Organizational Setup of Business case studies (e.g., Herbst et al. 2014), and
notes that industry trends raise concerns Intelligence and Big Data Teams customer feedback (e.g., Coussement and
that “academia may be behind the curve Poel 2008).
in delivering effective Business Intelli- The differences between BI and BD also A typical LSA is comprised of three
gence programs and course offerings to have consequences on how they are orga- phases. (For a more detailed introduc-
students.” Based on surveys conducted nized. Traditionally, BI teams are located tion and numerical examples, see Lan-
at BI practitioner events, Wixom et al. in internal consulting organizations, cen- dauer et al. 1998 and Evangelopoulos
(2011) formulate four academic BI best ters of excellence, or IT departments, et al. 2012). In the first phase, a collection
practices that would close the gap be- where they provide managers and exec- of documents is transformed into a term-
tween BI market needs and the content utives with reports for their well-defined document matrix. This step typically re-
of IS education programs: (1) provide and stable information needs (Burton quires pre-processing of documents (e.g.,
a broader range of BI skills, (2) take et al. 2006; Davenport et al. 2012; Varon removing irrelevant or duplicate docu-
an interdisciplinary approach to BI pro- 2012). However, since most BD initiatives ments) and terms (e.g., uni- and bi-gram
grams, (3) develop reusable teaching re- lack predefined questions and are much tokenization, filtering out uninformative
sources, and (4) align with practice. Be- more experimental in nature (Casey et al. terms, weighting terms according to their
sides arguing for the need for techni- 2013), BD specialists must be organized relative importance).
cal skills, Wixom et al. (2011) argue that so they are close to products and pro- In the second phase, the term-
a deep understanding of business sub- cesses in organizations, that is, co-located document matrix undergoes SVD to
jects (e.g., finance, marketing) and strong with business units (Davenport et al. reduce the dimensionality of the term-
communication skills are required. 2012). document matrix without losing essen-
tial information by identifying groups of
2.3 Big Data Competencies highly correlated terms (i.e., terms that
3 Methodology co-occur together in documents) and
No scientific literature on the topic of BD highly correlated documents (i.e., docu-
competences has yet been published, al- While the literature provides first insights ments that contain similar terms). The
though a number of articles and web re- into the topic of BI and BD competen- result of the SVD is a set of factors (top-
sources anecdotally describe the profile cies, it is not grounded in empirical data. ics) with associated high-loading terms

Business & Information Systems Engineering


BISE – RESEARCH PAPER

and documents. Together, they form pat-


terns of word use that represent topics in
the underlying collection of documents.
The extracted word-use patterns are in-
terpreted in the third phase, which usu-
ally involves additional statistical analyses
and, most importantly, expert judgment.

4 Data Collection and Analysis


4.1 Overview

The next sections illustrate how we ap-


plied LSA to analyze BI- and BD-related
job advertisements. We followed the pro-
cedure described in Sect. 3 and depicted
in Fig. 1. As Fig. 1 indicates, LSA of-
ten requires multiple iterations in which Fig. 1 Data collection and analysis process
experts review statistical results, and in-
puts (e.g., documents, terms) and param-
eters (e.g., term weights, number of fac- job ad. Then we weighted terms based interpretation of the factors. This proce-
tors to be extracted, loading thresholds) on their occurrence in and across doc- dure rotates the coordinates of the term
are fine-tuned in order to yield optimal uments, applying the commonly used loadings matrix in a way that maximizes
results. TF-IDF (Term Frequency-Inverse Doc- the variance of a factor’s squared load-
ument Frequency) weighting scheme, ings on all terms in the matrix. As a re-
4.2 Data Collection and Pre-processing which promotes the occurrence of rare sult, each factor tends to load either high
terms (e.g., “hadoop”) and discounts the or low on a particular term; in other
We performed multiple crawls of the occurrence of more common terms (e.g., words, a term is either descriptive (high-
global online recruitment website mon- “business,” “analysis”) (Manning et al. loading) or not descriptive (low-loading)
ster.com, downloading job advertise- 2008). The two weighted term-document for a particular factor. To maintain the
ments from the U.S., Canada, Australia, matrices built the foundation for the representation of the documents in the
and the U.K. that included either the term subsequent SVD. same factor space, we performed an iden-
“business intelligence” or the term “big tical rotation with the document loadings
data.” We downloaded the data as two 4.3 Singular Value Decomposition (SVD) matrix.
single-day snapshots in September 2013 Next, loading thresholds must be de-
and March 2014. After removing irrele- We performed the SVD using the statisti- fined in order to determine whether a
vant hits (e.g., spam, non-English ads), cal computing software R. The first step term or document is descriptive for a
we had an initial pool of 4246 BI-related of SVD is to define the number of fac- given factor. Again, no standard rules
job ads and 1411 BD-related job ads. tors (topics) to be extracted. Techniques for setting this thresholds have emerged
Following common text-mining proce- from exploratory factor analysis, such as (Evangelopoulos et al. 2012), so we
dures, we reduced the vocabulary in our scree plots and the Kaiser-Harris crite- adopted a heuristic that Sidorova et al.
document collection by removing stop rion, would lead to a high number of (2008) and Evangelopoulos et al. (2012)
words (e.g., “and,” “or,” “then”) and elim- factors, so these techniques are not rec- apply in their LSA-based literature anal-
inating terms that occurred in less than 1 ommended when the goal of LSA is to yses and set the threshold based on the
percent of the documents (Manning et al. identify topics in a collection of docu- probability distribution of term and doc-
2008). The remaining vocabulary con- ments. Since there is no standard proce- ument loadings. For a k-factors LSA, we
tained 6813 terms, which we then manu- dure for determining an optimal number retained the top-1/k high-loading docu-
ally reviewed to filter out other irrelevant of topics, we manually explored alterna- ments, so each term and each document
terms while keeping only those terms tive numbers of factors and qualitatively loads, on average, on one factor. How-
that describe competencies. In particular, assessed the results (Evangelopoulos et al. ever, terms and documents that load high
we removed standard human resources 2012). We tested several dimensionalities, on multiple factors or that load on no
terms like “salary,” “bonus,” and “apply.” including 2, 5, 10, 15, 20, 30, and 50 factor at all are to be expected.
After this manual data clean-up, the final factors. For each solution, we performed The final step consisted of the man-
dictionary that we used as a go-list for the a SVD to compute term and document ual sense-making and interpretation of
further analysis contained 1570 terms. loadings for each factor. the extracted factors and associated high-
Based on the controlled vocabulary loading terms and documents. Two re-
and the two document sets, we built 4.4 Analysis and Interpretation searchers independently interpreted and
two term-document-matrices, one for BI labeled each factor by examining the lists
jobs and one for BD jobs. These ma- Following Sidorova et al. (2008), we per- of extracted high-loading terms and doc-
trices contained the number of times a formed a varimax rotation on the ma- uments. In almost all cases, factor inter-
competency-related term appeared in a trices with the term loadings to simplify pretation was straightforward, and any

Business & Information Systems Engineering


BISE – RESEARCH PAPER

minor disagreements in labeling factors Table 1 Exploratory data analysis


were resolved during a final discussion.
Keywords Big Data (1411 ads) Business Intelligence (4246 ads)

Big Data 100.0 % (1411) 5.2 % (221)


5 Results Business Intelligence 15.7 % (221) 100.0 % (4246)

5.1 Exploratory Data Analysis


BI,” “Sales Executive BI,” and “New Busi- and “mining,” and important descrip-
After downloading the job advertise-
ness Sales Executive.” We had no diffi- tors for IBM BI Platform competencies
ments, we conducted an exploratory data
culty or disagreement in making sense (Factor BI15.12) are “etl,” “report,” and
analysis to get a first feeling for the data.
of and interpreting these results and la- “query.” The varying foci and strengths
We observed that there were about three
beled the two areas of competency “BI of each vendor explain these differences,
times more BI-related job advertisements
Architecture” and “Sales and Business as SAS is strong in data mining and IBM
than BD-related job ads on monster.com.
Development.” Cognos is a leader in data warehousing.
As a next step, we conducted a word fre-
The fifteen-factor solution revealed Our analysis also produced some
quency count, looking for overlaps be-
clearly distinguishable BI-related topics generic IT competencies, such as data-
tween job ads (cf. Table 1). The results
that were neither too broad nor too spe- base administration, software engineer-
showed that about 15 percent of the BD
cific. Table 2 provides an overview of the ing, and BI architecture. Database ad-
jobs also include the term “business intel-
results and shows the high-loading terms ministration requires SQL knowledge as
ligence,” while only 5 percent of the BI job
and job ad titles as well as the manu- well as knowledge in performance tun-
ads also included the term “big data,” per-
ally assigned labels for each of the ex- ing of applications. Typical job ads that
haps an indicator that BD requires some tracted factors. The terms and job ad ti- include these competencies are titled
basic BI-related skills, but BI does not tles are presented in order of descrip- with “DBA” and its variants, depending
necessarily require BD skills. tiveness, as expressed by the factor load- on the operating platform (e.g., Oracle
The word frequency count also showed ings calculated during SVD. (Uninfor- or MS SQL). Software engineering de-
that the frequency with which the terms mative terms and duplicate job titles scribes the competency of building cus-
“business intelligence” and “big data” ap- were removed.) We will refer to these tom pieces of software for data analy-
peared in the job ads was unbalanced, factors as competency requirements or sis. In particular, Java programming skills
as many ads contained the search terms competencies. and web front-end-development knowl-
only once. A manual inspection of a sam- Table 2 makes clear that industry de- edge are demanded. Last, the factor BI ar-
ple of these ads revealed that the search mands both business and IT compe- chitecture describes a demand for exper-
terms often occurred only in the com- tencies. The group of business-oriented tise along the whole BI stack, from ETL
pany descriptions (e.g., “our company competencies includes those related to to building reports.
specializes in big data solutions”) and specific domains (i.e., healthcare and dig- In addition to analyzing single areas of
that the companies were not looking for ital marketing) and those related to man- competency, we determined the current
any BD-/BI-related employees but for, agerial competencies (e.g., project man- demand for each competency by calculat-
for example, a team assistant. Therefore, agement). The IT competencies can be ing how many job ads loaded high on a
we filtered out job ads that included the divided into those related to vendor- factor. The relative numbers of jobs as-
keywords “big data” or “business intel- specific products (e.g., Microsoft, SAP) signed to a factor, displayed in Table 2,
ligence” only once, which narrowed our and those related to general concepts indicate that competencies in BI plat-
data set down to 450 BD-related ads and and methods (e.g., database administra- forms, healthcare, and sales and business
1357 BI-related ads. (The ratios displayed tion, BI architecture). Figure 2 aggre- development are among the competen-
in Table 1 were almost unchanged.) gates the fifteen areas of competency in cies with the highest demand on the BI
a taxonomy. job market. Table 2 also shows that the
5.2 Competency Requirements for A more detailed examination of the demand for business-related jobs and IT-
Business Intelligence Professionals descriptive terms and job ads associ- related jobs is almost evenly distributed.
ated with each factor gives insights into
On the most abstract level of the LSA, the corresponding competency require- 5.3 Competency Requirements for Big
the two-factor solution, jobs were as- ments. Among the vendor-specific com- Data Professionals
signed to only two topics. The first petencies are product and technology
factor was associated with high-loading names of specific vendors. For example, To report on the results for BD-related
descriptive terms like “developer,” “sql BI professionals working with SAP tech- jobs, we conducted the LSA on sev-
server,” “data warehouse,” “etl,” and “bi nologies (Factor BI15.04) need compe- eral levels of abstraction. On the most
developer.” Associated titles of job ads tencies in SAP BusinessObjects (“busi- abstract level, the two-factor solution,
included “BI Developer SQL Server,” ness objects”), SAP Business Warehouse we assigned jobs to two topics. The
“ETL Developer,” and “SQL Server DBA.” (“sap bw”), and/or the SAP High Per- five highest-loading terms for the first
Terms like “sales,” “business develop- formance Analytical Appliance (“hana”). topic were “java,” “developer,” “hadoop,”
ment,” “marketing,” “account,” and “new Vendors focus on varying aspects of BI, “web,” and “sql,” and exemplary titles of
business” described the second group of as competencies related to the SAS BI high-loading job ads were “Experienced
jobs, with such exemplary associated job Platform (Factor BI15.07) are described Java Developer,” “Java Hadoop Devel-
titles as “Business Development Manager using terms like “statistical,” “analytics,” oper,” and “Data Scientist Java Hadoop

Business & Information Systems Engineering


BISE – RESEARCH PAPER

Table 2 Competency requirements for business intelligence professionals

Factor ID Factor label High-loading descriptive terms (excerpt) Titles of high-loading job ads (excerpt)
(% of jobs)

BI15.01 Healthcare care, health, systems, reporting, Business Analyst Regulatory Healthcare, Report Writer
(9 %)a information, analysis Business Analyst, Manager Clinical Decision Support

BI15.02 Sales and Business sales, business development, executive, Legal Sales Executive, Business Development Manager,
(8 %) Development legal, sales team Sales Executive Business Intelligence, Sales Manager
Business Intelligence

BI15.03 BI Platforms sql server, ssis, ssrs, ssas, microsoft, BI Developer SSIS SSAS SSRS SQL, BI Data Warehouse
(16 %) (Microsoft) microsoft bi, reporting services, etl Developer SQL Server, SQL Server Developer, ETL
Developer Business Intelligence SSIS SQL SSRS

BI15.04 BI Platforms (SAP) sap, sap bi, hana, business objects, sap bw, SAP BI Principal Consultant, SAP BI Senior Technical
(5 %) erp, consultant, business analyst, crystal Consultant, SAP BI Report Analyst Developer, Senior
Business Objects Consultant

BI15.05 Digital Marketing marketing, digital, campaigns, product, Senior Marketing Executive Online Data Solutions Job,
(6 %) analytics, segmentation, customer Marketing Database Analyst, Email Marketing Manager,
Digital Relationship Marketing Manager

BI15.06 Database dba, database administrator, sql server, Oracle DBA SQL Server Database Administrator, Senior
(2 %) Administration oracle, sql, production, developer, tuning DBA SQL Server Database Administrator, SQL DBA with
BI Business Intelligence, MS-SQL Server DBA

BI15.07 BI Platforms (SAS) sas, studio, analytics, statistical, mining, SAS BI Analyst, Data Analytics Business Intelligence
(3 %) olap, data mining, data analytics Consultant, Senior SAS Developer, SAS Consultant

BI15.08 Software java, eclipse, apache, web, linux, engineer, Senior Java Consultant, Senior Java Technical Consultant,
(4 %) Engineering software, javascript, developer, big data Mobile Developer Java jQuery HTML5, Front End
Engineer, Senior Backend Engineer

BI15.09 BI Architecture bi developer, etl, developer, bi stack, Business Intelligence Developer, ETL Business Intelligence
(5 %) organization, report Developer, BI Developer Excel Microsoft BI SQL Server,
Senior BI Developer Architect

BI15.10 Project Management project manager, project, management, Senior Project Manager, Technical Project Manager, BI
(5 %) head, client, change, agile, planning Project Manager Data Warehouse Implementations, Sr
Project Manager Business Intelligence

BI15.11 Web Portals sharepoint,.net, server, microsoft, SharePoint Developer, SharePoint 2007–2010 Developer,
(3 %) (Microsoft) administrator, software, web, application SharePoint Administrator SharePoint 2010 Server,
SharePoint Consultant, SharePoint Architect

BI15.12 BI Platforms (IBM) cognos, studio, manager, report, Cognos BI Developer, Cognos BI Manager, Cognos
(5 %) framework, developer, ibm, query, Designer, MIS Manager with Cognos BI experience,
analyst, etl Cognos 10 Consultant Developer

BI15.13 BI Platforms qlikview, microstrategy, oracle, obiee, MicroStrategy Business Intelligence Analyst,
(15 %) (QlikView, warehouse, etl, architect, consultant MicroStrategy Developer, Senior QlikView Developer, BI
Microstrategy, Visualization Consultant, ETL Specialist
OBIEE)

BI15.14 Business Analysis business analyst, data analyst, reporting, Business Analyst, Business Analyst SAP APO Excel Expert,
(6 %) excel, organization, specialist, pivot Data Analyst, Reporting Data Analyst, BI Report Analyst,
Technical Business Analyst

BI15.15 Business consultancy, business development, sales, Business Development Manager & Market Intelligence
(7 %) Development development manager, account, market Consultancy, Sales Business Development Manager, Sales
(Consultancy) Account Manager Research Consultancy

a Even though we retained the top-1/k term and document loadings and set the computed threshold value accordingly, we followed Evangelopoulos

et al. (2012) in double-checking and manually selecting a threshold for each factor separately based on domain knowledge. As a result, we had to
reduce the number of jobs that loaded on the first factor (BI15.01).

Business & Information Systems Engineering


BISE – RESEARCH PAPER

Fig. 2 Business intelligence competency taxonomy

NoSQL.” In contrast, the top five descrip- In contrast to the BI competencies, we 5.4 Comparison
tive terms for the second topic were “digi- find no factors related to the technolo-
tal,” “sales,” “manager,” “advertising,” and gies of commercial vendors, yet many We identified a number of similarities be-
“marketing,” and frequent job titles in- conceptual and methodological compe- tween the fields of BI and BD. Especially
cluded “Digital Sales Executive,” “Sales tencies, as well as programming skills when it comes to generic IT concepts and
Manager Big Data,” and “Digital Rela- in various languages are required. In methods and business skills, we observed
tionship Marketing Manager.” The exam- the factor representing competency in a considerable overlap between BI and
ination of the highest-loading terms and NoSQL (BD15.01), not a single prod- BDA (cf. Fig. 4). For example, working in
job titles for both factors suggests that uct or technology name of one of the either field requires a certain amount of
the first factor describes jobs related to big commercial database vendors ap- software engineering and database com-
the development of BD solutions (big pears. Instead, terms referring to open- petency. Sales and business development
data developers), while the second factor source technologies from the Apache skills for managing BI and BD solutions
refers to the use of BD in marketing and also overlap. Finally, domain knowledge
Foundation are dominating the descrip-
sales (big data users). overlaps in healthcare/life sciences and
tions (e.g., “hadoop,” “hive,” “pig,” “cas-
Table 3 provides an overview of the digital marketing, domains known to be
sandra”). Furthermore, conceptual and
results of the fifteen-factor solution and especially data-driven. The absence of
methodological IT skills like quantita-
shows exemplary high-loading terms and other domain skills is a result of the level
tive analysis (BD15.03), machine learn- of analysis we chose; a more granular LSA
job titles, as well as the manually assigned
ing (BD15.05), database administration on BI and BD job ads (e.g., 50 instead
labels for each of the extracted factors.
(BD15.10), and software engineering and of 15 factors) would reveal the additional
The inspection of the identified areas of
testing (BD15.13, BD15.14) are in high domains of banking, finance, insurance,
competency shows that, just as for BI
demand. These findings suggest that the and supply chain management.
jobs, competencies can be clustered into
business competencies and IT competen- field of BD is not (yet) dominated by The major differences between BI and
cies. The IT competency area can be fur- big vendors’ standard software but (still) BD competencies are discussed in the
ther broken down into generic concepts relies largely on open-source technolo- next section.
and methods like quantitative analysis, gies and custom-made software solu-
machine learning, and database adminis- tions.
tration, and products for developing big Comparing the relative demand be- 6 Discussion
data solutions (i.e., a variety of program- tween business and IT competencies re-
ming languages and NoSQL databases). veals that almost 70 percent of the Our research revealed highly demanded
The group of business-oriented compe- posted BD-related job ads seek techni- BI and BD skills in at least two ar-
tencies is made up of domain competen- cal skills. Knowledge in NoSQL databases eas, business and IT. This first finding
cies in the areas of life sciences and digital and software engineering and program- empirically grounds the ongoing discus-
marketing, as well as managerial compe- ming are the most highly demanded areas sion about business knowledge’s being as
tencies in sales and business development of technical competency. Digital market- important as technical skills for work-
and working in start-up companies. Fig- ing, business development, and sales con- ing successfully on BI and BD initia-
ure 3 summarizes these findings in a big stitute highly demanded business compe- tives (e.g., Chen et al. 2012; de Lange
data competency taxonomy. tencies. 2013; Waller and Fawcett 2013; Wixom

Business & Information Systems Engineering


BISE – RESEARCH PAPER

Table 3 Competency requirements for big data professionals

Factor ID Factor label High-loading descriptive terms (excerpt) Titles of high-loading job ads (excerpt)
(% of jobs)

BD15.01 NoSQL Databases hadoop, nosql, java, hive, scripting, Java Hadoop Developer, Big Data Solutions Architect, Big Data
(17 %) distributed, database, apache, mapreduce, Consultant, Database Architect, Big Data Scientist, Chief
hbase, pig, cassandra Architect Big Data Guru

BD15.02 Sales digital, sales, advertising, manager, media, Junior Digital Sales Manager, Digital Agency Sales Manager,
(6 %) forecasting, presentation, platforms New Business Sales Manager

BD15.03 Quantitative quantitative, risk, analyst, models, Senior Quantitative Analyst, Quantitative Analyst Financial
(2 %) Analysis modeling, matlab, java, algorithms, Risk Management, Big Data Business Systems Analyst, Sr Data
physics, phd, financial, mathematics, data Analyst
analyst

BD15.04 Programming java developer, junit, tdd, hadoop, maven, Experienced Java Developer Java Multi Thread JUnit TDD,
(8 %) (Java) git, nosql, hibernate, eclipse, agile, hive, Java Developer Big Data, Java Hadoop Developer, Senior Java
mongodb, apache, pig Architects Developers Core Java Programming, Senior Java
Consultant Java Spring Hibernate Maven

BD15.05 Machine Learning data scientist, machine learning, Data Scientist Machine Learning C Java Python, Software
(7 %) visualization, statistical, algorithms, Engineer Data Scientist Machine Learning, Security Cleared
mining, predictive, analysis, science, Big Data Scientist, Big Data Architect Hadoop R Machine
mathematics Learning

BD15.06 Startup startup, sales, analytics platforms, market, Sales Big Data Software, Front End Developer for Big Data
(2 %) data analytics, applications, solutions, Startup, Big Data Analytics Sales Consultant, Junior Python
information, enterprise, consultant Developer Big Data Tech Startup, Java Software Engineers
High Profit Big Data Startup

BD15.07 Programming net, sql server, microsoft, visual, High Paid Junior C# ASP.NET Developer, Developer.NET
(2 %) (.NET) developer, warehouse, api, front end, MVC API, C#.NET Developer SQL Server Senior Software
scrum, mvc, agile, project, online Engineer Big Data, Junior C# ASP.NET Developer

BD15.08 Life Science sciences, life, medical, visualization, Strategic Account Manager Big Data Life Sciences, Big Data
(2 %) revenue, health, care, project Engineer Exciting Start Up, Revenue Analyst, Project Manager
management, industries, consulting Big Data, Solutions Consultant Big Data Analytics
Visualisation

BD15.09 Programming developer, php, web, javascript, front end, Lead PHP Ninja, Front End Developer Start Up, Senior PHP
(8 %) (PHP/JavaScript) user, css, agile, html, jquery, web services, Developer, UI Developer Big Data JavaScript, Front End
api, mysql, open source, mongodb Developer HTML5 CSS3 JavaScript, PHP Web Developer OOP
LAMP

BD15.10 Database dba, mysql, oracle, high availability, sql MySQL DBA Big Data High Availability Replication, DBA
(4 %) Administration server, linux, database, senior, Data Modeler, Junior Database Administrator, DBA Systems
consultancy Engineer MySQL NoSQL Big Data Unix Linux
BD15.11 Digital Marketing marketing, digital, analytics, media, Associate Director Digital Media Analytics, Marketing
(10 %) insights, information, social, research, Director, Senior Analyst Big Data Digital Media, Digital
strategy Relationship Marketing Manager

BD15.12 Business sales, customer, revenue, account, Business Development Manager Big Data Technology, Business
(11 %) Development management, executive, business Development Manager Cloud Computing
development, marketing, relationships

BD15.13 Software software engineer, linux, data engineer, Senior Engineer Big Data, Principle Software Engineer Head of
(9 %) Engineering online, professional, open, product, Software Development, Senior Big Data Engineer, Senior
natural language, systems, agile, Software Engineer
distributed

BD15.14 Software Testing testing, software, engineer, product, Python Test Engineer, Software Test Engineer, Java Tester,
(9 %) machine learning, automated, Software Design Engineer in Test, Test Lead
development, open source, agile, building
BD15.15 Data Warehousing etl, data warehouse, business intelligence, Data Warehouse Product Owner, Director of Data
(3 %) manager, project, technical, modeling, Engineering, Snr Business Intelligence Developer, ETL
agile Engineer, DWH Delivery Manager

Business & Information Systems Engineering


BISE – RESEARCH PAPER

Fig. 3 Big data competency taxonomy

et al. 2011). For example, de Lange (2013)


sees programming and statistical exper-
tise as the foundation for data scientists
but also states that “a strong background
in business and strategy can help jetti-
son a younger scientist’s career to the next
level.” Chen et al. (2012) argue that BI
and analytics professionals “must be ca-
pable of understanding the business is-
sues” and at the same time capable of
“framing the appropriate analytical solu-
tions” to provide useful decision-making
support. Wixom et al. (2011) analyze ex-
isting BI-related university programs and
courses and conclude that the BI pro-
gram of the future should include both
business and technical courses, including
at least an understanding of data man-
agement, functional business knowledge,
statistics and quantitative analysis, and
communication and visualization skills,
in order to address the widest scale of
industry needs. The empirical evidence
we provide with this study underscores Fig. 4 Similarities and differences in BI and BD areas of competency
these arguments and should encourage
IS scholars to develop inter-disciplinary software development skills and statisti- consistently emphasizes its variety, sug-
programs and courses to prepare “the cal knowledge, whereas BI jobs required gesting that big data does not refer to re-
next generation of analytical thinkers” much less “programming” and statisti- lational data managed in enterprise sys-
(Chen et al. 2012). cal knowledge. While BD jobs demand tems or data warehouses but to streams of
We also showed that there are con- quantitative analysis and machine learn- data in various formats and from various
siderable differences between BI skills ing skills, there is no explicit mentioning sources (Davenport et al. 2012), mostly
and BD skills. The extracted BI compe- of such terms in BI jobs. the Internet. Because of this variety of
tency requirements feature skills related Why did we find such differences, al- data, big data analytics solutions rely
to commercial products of large software though both BI and BD focus on sup- less on standard software products than
vendors, whereas no BD skills descrip- porting decision-making through quan- they do on custom-made solutions. Sec-
tors refer to one of the large BI ven- titative analysis of data? There are two ond, current big data projects seek an-
dors. In addition, almost 70 % of the possible explanations for this finding: swers to highly specialized questions and
BD jobs we analyzed asked for strong First, the emerging literature on big data are often more comparable to research

Business & Information Systems Engineering


BISE – RESEARCH PAPER

projects than to traditional IT projects


(Marchand and Peppard 2013). Because
of this variety of questions, big data an-
alytics solutions require more tailored
software tools and better methodologi-
cal skills than traditional BI does. What-
ever the explanation, our observation is
in line with Golden’s (2013) argument
that big data investments will be open-
source. Even though large vendors like
SAP are developing analytical solutions
like SAP HANA (vom Brocke et al. 2014)
that will offer “predictive analytics, text
and big data in a single package” (SAP Fig. 5 Google search volume for the search queries “business intelligence” and “big
2014), our analysis shows that the BD job data” (Source: Google Trends)
market does not yet ask for experts in the
use of these tools.
We also found that the demand for practical skills concerning major BI plat- Our research contributes to the scien-
BI competencies is still far bigger than forms. Supporting decision-making by tific body of knowledge on BI and BD
that for BD competencies, as we found extracting knowledge from web-based and has several implications for practi-
three times more job ads containing the and unstructured content (i.e., second- tioners. By uncovering highly demanded
term “business intelligence” than we did wave BI&A) still seems to be in its in- skill sets for BI and BD experts, we
job ads containing the term “big data.” fancy, as we found no factor related to complement existing scientific work on
This finding might be surprising given text mining, web mining, or social net- BI/BD maturity models. The empirically
the current media excitement around big work analytics, although some of these grounded taxonomies we developed can
data (cf. Fig. 5), but our empirical re- terms were scattered among the high- be used as a foundation for future em-
sults suggest that most companies are still loading terms that described BD factors. pirical studies on BI and BD, such as
working on advancing the maturity of These results are surprising, as many ex- efforts to develop measurement instru-
their internal BI and are not yet seeking perts point out that these techniques are ments for studying BI and BD profession-
to exploit big data. at the core of big data analytics. Finally, als or teams. Our findings also inform the
Highlighting our results against the our analysis did not produce evidence assessment and development of BI and
background of the resource-based view that industry currently demands third- BD curricula. As numerous practitioners
of the firm (i.e., the TIR and HIR men- wave BI&A competencies (i.e., mobile and researchers have pointed out, under-
tioned in Sect. 2), we argue that BI and sensor data), a finding that disagrees graduate and graduate programs should
implementations rely heavily on well- with Chen et al. (2012). be created or modified in order to sat-
established TIR, as BI platform ven- isfy industry’s high demand for analytical
dors already provide them. Significant skills, especially in the areas of software
amounts of knowledge have already been 7 Conclusion engineering, statistics, and business skills.
built into the technology itself, and it is Practice may benefit from this study in
at a mature state and easily deployed in This paper set out to shed light on the two ways. At an individual level, our re-
a company. HIR are only required for topic of individual-level BI and BD com- sults provide guidance for individuals’
customizing and adapting the technology petencies. Given the lack of empirical re- professional development by, for exam-
to the organizational context. However, search in this area, we conducted an LSA ple, outlining pathways for career choices
BD still relies on basic TIR, such as pro- of 1357 BI-related and 450 BD-related and decisions about continuing educa-
gramming languages and plain database job ads harvested from the online em- tion. At an organizational level, the iden-
technologies, which require extensive ployment platform monster.com. By an- tified competencies can be used to inform
HIR in order to build the sophisticated, alyzing and interpreting the statistical re- strategic HR management (e.g., estab-
company-specific big data solutions that sults of the LSA, we developed BI and lishment of a BI/BD Center of Excellence)
may lead to temporary competitive ad- BD competency taxonomies. Our ma- and staffing decisions (e.g., for BI/BD
vantage. Therefore, we can conclude that jor findings are that (1) business knowl- projects). In particular, we advise orga-
BD initiatives are currently much more edge is as important as technical skills for nizations that want to engage in big data
human-capital-intensive than BI projects working successfully on BI and BD initia- analytics either to invest in building in-
are, so we call for further action in tives; (2) BI competency is characterized house software engineering and statistical
educating current and future employees. by skills related to commercial products skills or to collaborate with third parties
Contrasting our findings against the of large software vendors, whereas BD (e.g., universities) in order to obtain the
three evolutionary BI&A waves Chen jobs ask for strong software development required competencies.
et al. (2012) identify, we observe that and statistical skills; (3) the demand for As in all research, this study is not with-
the skills that are related to the first BI competencies is still far bigger than the out several limitations. First, our findings
wave (structured content residing in demand for BD competencies; and (4) are based on snapshots of the BI and BD
databases) are still the most frequently BD initiatives are currently much more job market taken in September 2013 and
demanded. Examples include conceptual human-capital-intensive than BI projects March 2014. To gain a more reliable pic-
knowledge about data warehousing and are. ture of knowledge and skill requirements

Business & Information Systems Engineering


BISE – RESEARCH PAPER

and track their development over time Chen H, Chiang R, Storey V (2012) Business in-
we plan to repeat the study presented telligence and analytics: from big data to Abstract
big impact. MIS Q 36:1165–1188
here regularly in the future. Second, our Coussement K, Poel den Van D (2008) Im- Stefan Debortoli, Oliver Müller,
data analysis used job advertisements to proving customer complaint management Jan vom Brocke
elaborate on the differences between BI by automatic email classification using lin-
guistic style features as predictors. Decis
and BD competencies, as it is reasonable Support Syst 44:870–882 Comparing Business
to assume that job advertisements act as Cox M, Ellsworth D (1997) Application-
proxies for a demand for human capi- controlled demand paging for out-of-
Intelligence and Big Data Skills
tal in industry and that they can provide core visualization. In: Proc 8th conf vis, A Text Mining Study Using Job
pp 235–244
insights into competency requirements. Davenport TH (2006) Competing on analytics. Advertisements
However, one must be aware that job ads Harv Bus Rev 84:98–107
do not always reflect an employer’s true Davenport TH, Patil D (2012) Data scientist. While many studies on big data an-
Harv Bus Rev 90:70–76
requirements, as the employer may ask Davenport TH, Barth P, Bean R (2012) How
alytics describe the data deluge and
for more competencies than can be rea- “big data” is different. MIT Sloan Manag Rev potential applications for such ana-
sonably expected from an applicant, or 54:22–24 lytics, the required skill set for deal-
they may use a specific vocabulary to pol- de Lange C (2013) So you want to be a ing with big data has not yet been
data scientist? Nat Jobs Blog http://blogs.
ish job ads so they are appealing to a cer- nature.com/naturejobs/2013/03/18/so- studied empirically. The difference be-
tain group of candidates. Such may be you-want-to-be-a-data-scientist. Accessed, tween big data (BD) and traditional
the case especially in the area of BI and 2013–05–2013-03 business intelligence (BI) is also heav-
Dhar V (2013) Data science and prediction.
BD, which lacks clear-cut definitions and Commun ACM 56:64–73 ily discussed among practitioners and
is full of industry jargon. While we ac- Dinter B (2012) The maturing of a business in- scholars. We conduct a latent seman-
knowledge that such biases may exist in telligence maturity model. In: Proc Am conf tic analysis (LSA) on job advertisements
our data, we believe that the number of inf syst, Seattle, pp 1–10
Eckerson W (2004) Gauge your data ware- harvested from the online employment
job ads that we examined should be suf- house maturity. http://www.information- platform monster.com to extract infor-
ficient to minimize the effect of biases management.com/issues/20041101/ mation about the knowledge and skill
in a few ads. The processing of such a 1012391-1.html. Accessed 2013-04-29
Evangelopoulos N, Visinescu L (2012) Text- requirements for BD and BI profession-
broad data source as that used in this mining the voice of the people. Commun als. By analyzing and interpreting the
research gives a particular advantage to ACM 55:62–69 statistical results of the LSA, we de-
the approach we used over other research Evangelopoulos N, Zhang X, Prybutok VR velop a competency taxonomy for big
(2012) Latent semantic analysis: five
methods, such as interviews, because it methodological recommendations. Eur J data and business intelligence. Our ma-
diminishes the risk of biases caused by Inf Syst 21:70–86 jor findings are that (1) business knowl-
specific contextual backgrounds. Third, Forbes (2013) A very short history of big data. edge is as important as technical skills
our findings are limited to job markets http://www.forbes.com/sites/gilpress/
2013/05/09/a-very-short-history-of-big- for working successfully on BI and BD
in English-speaking countries because of data/. Accessed 2014-03-23 initiatives; (2) BI competency is char-
the nature of the text mining technique Gallivan MJ, Truex DP, Kvasny L (2004) Chang- acterized by skills related to commer-
we applied, which cannot process multi- ing patterns in IT skill sets: a content anal-
ysis of classified advertising. Data Base Adv cial products of large software vendors,
lingual texts. Future studies may look at Inf Syst 35:64–87 whereas BD jobs ask for strong soft-
job markets in other major language re- Golden B (2013) Does big data spell the end ware development and statistical skills;
gions (e.g., Spanish, French, Portuguese, of business intelligence as we know it?
In: CIO. Accessed 2013-10-04. http://www. (3) the demand for BI competencies is
German, Russian, Hindustani, Mandarin cio.com/article/print/730774 still far bigger than the demand for BD
Chinese). Finally, our study is induc- Herbst A, Simons A, vom Brocke J et al (2014) competencies; and (4) BD initiatives are
tive and exploratory in nature, so future Identifying and characterizing topics in en-
currently much more human-capital-
confirmatory research (e.g., surveys) is terprise content management: a latent se-
mantic analysis of vendor case studies. In: intensive than BI projects are. Our find-
needed in order to test and refine our Proc 22nd Eur conf inf syst ings can guide individual professionals,
results. Hostmann B, Hagerty J (2010) IT-score organizations, and academic institu-
overview for business intelligence and
performance management. http://www. tions in assessing and advancing their
gartner.com/id=1433813. Accessed 2013- BD and BI competencies.
References 04-29
IBM (2012) Analytics: the real-world use of big Keywords: Big data, Business intelli-
Barney J (1991) Firm resources and sustained data. http://www-935.ibm.com/services/ gence, Competencies, Latent semantic
competitive advantage. J Manage 17:99– us/gbs/thoughtleadership/ibv-big-data-at- analysis, Text mining
120 work.html. Accessed 2013-10-04
Buhl HU (2013) Interview with Martin Petry on Lahrmann G, Marx F, Winter R, Wortmann F
”Big data“. Bus Inf Syst Eng 5:101–102 (2011) Business intelligence maturity: de-
Buhl HU, Röglinger M, Moser F, Heidemann J velopment and evaluation of a theoreti-
(2013) Big data. Bus Inf Syst Eng 5:65–69 cal model. In: Proc Hawaii int conf syst sci,
Burton B, Geishecker L, Hostmann B et al Koloa, pp 1–10
(2006) Organizational structure: business Landauer TK, Foltz PW, Laham D (1998) An
intelligence and information management introduction to latent semantic analysis.
pp 1–11 Discourse Process 25:259–284
Casey T, Krishnamurthy K, Abezgauz B Laney D (2001) 3D data management: con-
(2013) Who should own big data? trolling data volume, velocity, and variety.
strategy+business In: META Gr. http://blogs.gartner.com/
Cates J, Gill S, Zeituny N (2005) The ladder doug-laney/files/2012/01/ad949-3D-Data-
of business intelligence (LOBI): a framework Management-Controlling-Data-Volume-
for enterprise IT planning and architecture. Velocity-and-Variety.pdf. Accessed 2013-10
Int J Bus Inf Syst 1:220–238 -04

Business & Information Systems Engineering


BISE – RESEARCH PAPER

Larsen KR, Monarchi DE, Hovorka DS, Bai- forget-big-data-small-data-is-the-real- Schlegel K, Sallam RS, Yuen D, Tapadinhas Y
ley CN (2008) Analyzing unstructured text revolution/. Accessed 2013-10-04 (2013) Magic quadrant for business intelli-
data: using latent categorization to iden- Power D (2007) A brief history of decision gence platforms. http://www.gartner.com/
tify intellectual communities in informa- support systems technology/reprints.do?id=1-1DZLPEP&ct
tion systems. Decis Support Syst 45:884– Provost F, Fawcett T (2013) Data science and =130207&st=sb. Accessed 2013-04-29
896 its relationship to big data and data-driven Sidorova A, Evangelopoulos N, Valacich JS,
LaValle S, Lesser E, Shockley R et al (2011) Big decision making. Big Data 1:51–59 Ramakrishnan T (2008) Uncovering the in-
data, analytics and the path from insights Reuter N, Vakulenko S, vom Brocke J et al tellectual core of the information systems
to value big data, analytics and the path (2014) Identifying the role of information discipline. MIS Q 32:467–482
from insights to value. MIT Sloan Manag systems in achieving energy-related envi-
ronmental sustainability using text mining. Todd PA, McKeen JD, Gallupe RB (1995) The
Rev 52:21–31 evolution of IS job skills: a content analysis
Litecky C, Aken A (2010) Mining for comput- In: Proc 22nd Eur conf inf syst
Van Roekel H, Linders J, Raja K et al (2009) The of IS job advertisements from 1970 to 1990.
ing jobs. IEEE Softw 27:78–85 MIS Q 19:1–27
BI framework: how to turn information into
Loukides M (2012) What is data science? a competitive asset. http://www.tdwi.eu/ Varon E (2012) Rethink your org chart for big
O’Reilly Media, Sebastopol wissen/whitepaper/?no_cache=1&tx_ data analytics teams. http://data-informed.
Manning CD, Raghavan P, Schutze H (2008) mwknowledgebase_pi1[showUid]=45. com/rethink-your-org-chart-for-big-data-
Introduction to information retrieval. Cam- Accessed 2013-04-29 analytics-teams/. Accessed 2014-03-13
bridge University Press, New York Russell S, Haddad M, Bruni M, Granger M vom Brocke J, Debortoli S, Müller O, Reuter
Manyika J, Chui M, Brown B et al (2011) (2010) Organic evolution and the capability N (2014) How in-memory technology can
Big data: the next frontier for innovation, maturity of business intelligence. In: Proc create business value: insights from the
competition, and productivity, pp 1–156 Am conf inf syst Hilti case. Commun Assoc Inf Syst 34:151–
Marchand D, Peppard J (2013) Why IT fumbles Russom P (2011) Big data analytics. TDWI Res 168
analytics. Harv Bus Rev 91:104–112 Salton G, Wong A, Yang CS (1975) A vec- Waller Ma, Fawcett SE (2013) Data science,
Melville N, Kraemer KL, Gurbaxani V (2004) tor space model for automatic indexing. predictive analytics, and big data: a revolu-
Review: information technology and or- Commun ACM 18:613–620
ganizational performance: an integrative SAP (2014) SAP HANA integrates predictive tion that will transform supply chain design
model of IT business value. MIS Q 28:283– analytics, text and big data in a single and management. J Bus Logist 34:77–84
322 package. http://www.sap.com/pc/tech/in- Wixom B, Ariyachandra T, Goul M et al (2011)
Polock R (2013) Forget big data, small data memory-computing-hana/software/ The current state of business intelligence in
is the real revolution. Open Knowl Found analytics/big-data.html. Accessed 2014-03- academia. Commun Assoc Inf Syst 29:299–
Blog http://blog.okfn.org/2013/04/22/ 23 312

Business & Information Systems Engineering

You might also like