Professional Documents
Culture Documents
2. Context stall programs (if they have not learned this by them-
selves before); and they learn to create and query
2.1. Secondary Students and CS databases using SQL. If the school is large, and there
The last 4-5 years have seen a steady decrease in the are enough interested students, and a sufficiently pas-
quantity of applications to university-level CS studies sionate teacher is around, even a further module may
in the most of Europe and North America. Partly be offered with advanced material: students get to
(but only partly) because of this, the average quality write Java or C programs with a GUI, to install and
of applicants seems to be lowering too: good students manage a whole Linux, and to create databases that
that would have chosen CS a few years ago seem to be can be queried via web using PHP.
choosing other studies now. If this trend persists, it This effort, commendable as it is, transmits the stu-
poses a very serious threat to the growth of IT industry dent the idea that CS equals programming, and that
in Europe and, therefore, a problem for society as a the challenges in CS are writing larger and larger pro-
whole. grams and managing bigger and bigger systems. This
There are many factors explaining this drop, and a de- does not necessarily make the discipline attractive, es-
tailed analysis or diagnostic is out of the scope of this pecially if the potential applications of such larger pro-
note (and of my ability). It is safe to say, however, that grams and systems are not explained at the same time.
important factors are a series of damaging and com- Also, in most cases the teachers for these courses have
mon social (mis)conceptions about CS and CS-related had no formal training on CS themselves. In this re-
professional careers. Some examples are: Computing spect, initiatives that aim at familiarizing high-school
professionals have less social “prestige” than other pro- teachers with Computer Science university-level stud-
fessionals (doctors, architects, other types of engineers, ies are extremely important. For example, I have re-
etc.); CS professionals work long hours in stressing but cently learned about the successful CS4HS Workshops
boring projects; after the dot-com bubble, the career organized by Carnegie-Mellon University (Carnegie-
market is not expanding anymore, and jobs are unsta- Mellon University, 2006 2008).
ble and underpaid, etc.
Actually, these lines were partly inspired by a similar
Without discussing truth or falsity of these beliefs, I (though at a smaller scale) initiative at my school, the
want to focus on one in particular which, in my opin- Barcelona School of Informatics (www.fib.upc.edu).
ion, is driving away many good students with a sci- The school organized a meeting open to all secondary-
entifically oriented mind: the conception that CS as education teachers, to exchange views and explore pos-
a subject deals mostly with routine business applica- sible cooperations. As one of the points in the agenda,
tions, or, at best, with compulsory, hacker-type, pro- the school wanted to ask the teachers what could uni-
gramming. There is no real challenge remotely com- versities do to help them encouraging students to pur-
parable to those found in mathematics, medicine, bi- sue computing studies. Can we provide teacher train-
ology, or physics. Indeed, how can CS provide excite- ing? Can we offer disk space for file and webhosting?
ment like that of understanding how the cell works, Should we visit high schools more often and give flashy
creating the cure for cancer, or finding the traces of talks? Send monthly bulletins via email?
the Big Bang in deep space?
The answer from the teachers was clear. Plainly ex-
2.2. CS in High School pressed: “Well, all of this would be great. But if you
really, really want to help us, please provide us with
Nowadays, most primary and high-school students in interesting materials, lesson plans, and activities. Ide-
the EU area do get (or should get in the next decade) ally, it’s stuff we can download, unpack, and bring to
the basic computing skills that all citizens will be the classroom essentially as-it-is. We spend a lot of
pretty soon expected to have (writing documents, us- our time preparing material for our regular courses.
ing spreadsheets or similar tools, organizing informa- We have neither the time nor the knowledge to pre-
tion and searching it in the web, etc). These skills are pare materials and activities to introduce our students
acquired either in specific IT courses or instrumentally to Computer Science”. I have heard similar comments
within other courses. several times. It is a still a small sample, and restricted
Many schools offer interested students further optional to the Spanish context, but I have no reason to believe
modules introducing “real computing”, namely. . . pro- that the answer would be substantially different else-
gramming!! Students learn the syntax of Java or C and where.
use them to write a few programs (even surprisingly
complex ones); they learn how to download and in-
Machine Learning in Secondary Education?
3. Why Machine Learning is so stood at the high-school level, and whose “outcome”
Appropriate? students certainly find exciting. However, it is not
easy to do much in Robotics and Computer Graphics
In my opinion, the Machine Learning community can without programming. In contrast, I believe that it is
help (therefore: should help) with the student enrol- possible to design a good number of sessions and ac-
ment crisis more than most other fields of Computer tivities using pre-programmed Machine Learning tools
Science. that do not require the students to write any programs
I believe it is possible to explain to high-school stu- at all. Machine Learning therefore helps conveying the
dents that CS in general, and Machine Learning in idea that Computer Science is not exclusively about
particular, is a challenging, exciting field. In particu- writing programs.
lar, that progress on the “big problems” mentioned
above (understanding the cell, curing cancer, find- 4. What Machine Learning can be
ing the black matter among galaxies) strictly depends Taught in High School?
today on cutting-edge CS and Machine Learning re-
search. And that as time goes, the number of challeng- We arrive now to the complicated details: what exactly
ing application fields keeps growing. Certainly, not all could be taught in such module or modules? Admit-
computing related jobs are equally rewarding intellec- tedly, these are only very preliminary thoughts – but
tually — but neither do all jobs that engineers, medical I am very much willing to discuss any ideas.
doctors, architects, or biologists get. The basic techniques that could be explained and il-
More in particular, not many subfields of CS have the lustrated include at least:
following two features, which I think are true of Ma-
chine Learning: 1) It is possible to explain some of • k-nearest neighbors classifiers
the foundations of the field with the mathematical
knowledge that (at least in theory) high-school stu-
• Basic linear classifiers (say, the perceptron and
dents have. And 2) there are applications, from sim-
possibly winnow)
ple ones to extremely challenging ones, that can be
explained in rigorous but understandable ways.
• Naive Bayes
Concerning 1), “the basic math behind ML can be ex-
plained”, note that basic concepts of linear algebra, • Simple clustering algorithms (k-means and ag-
vectors and vector operations, probability, and statis- glomerative algorithms)
tics suffice to explain how simple predictive models
and simple unsupervised learning methods work, as • Frequent set and association rule mining
well as the notions of classification error, testing, cross-
validation, etc. Teachers may in fact welcome this as
Note again that only basic vector terminology, proba-
a way of reinforcing the math learned elsewhere and
bility (up to Bayes’ Rule) and the notion of implication
showing that it is not completely useless.
are strictly necessary to understand how the methods
Concerning 2), “the challenges can be explained”, we work. Ideally, the principle behind each method is
could all produce examples where Machine Learning explained to the students, who can then experiment
helps genetics, drug design, cancer research, medical with it using a given implementation. Later, more ad-
diagnosis, remote homecare for aged people, domotics, vanced methods can be provided as black boxes, with
astronomy, not to mention economics and business. less or no detail about their inner workings, to deal
with larger or more complex problems.
For comparison, consider how hard it would be to
explain to high-school students the methods, goals, A critical issue is to come up with the right examples
and possible benefits of, say, theory of computation; (scenarios, challenges, datasets). The difficulty is that
programming language design and semantics; super- the problem should be realistic enough to interest the
computing; VLSI circuit design; software engineering; students and the same time be “easy” enough that
operating system internals; networking protocols and they can obtain meaningful results in reasonable time,
standards; database management system implementa- before they find the task boring or frustrating. Iden-
tion, and so on. tifiying, preparing, and testing these examples would
be one of the longest and most delicate stages of a
Robotics and Computer Graphics are two other sub-
possible “Machine Learning for Secondary Education”
disciplines of CS whose very basic math can be under-
project.
Machine Learning in Secondary Education?
5. Thoughts for a Project the design, dissemination, and validation of the teach-
ing modules. Eventually, it is them who have to say
In this section I collect a few (incomplete, disparate, whether the product is interesting, adequate for real-
randomly ordered) thoughts about a possible project istic high-school education, technologically and peda-
aimed at developing teaching materials introducing as- gogically manageable, etc.
pects of Machine Learning in secondary education.
It would of course be advisable to look for cooperation
I specifically think of a project created as an initia- or co-coordination with the local educational author-
tive within the PASCAL2 NOE, probably within its ities at the most active sites. On the one hand, their
Curriculum Development programme, with coopera- endorsement would probably encourage more schools
tion from other agents. PASCAL2 is probably the to join in the experience (although, as I have expressed
largest organized group of Machine Learning experts before, I am optimistic that the initiative would be
in Europe, so it is a natural source of scientific ex- welcome by many teachers anyway). On the other
pertise for such a project. It has the critical mass hand, they could probably facilitate the contact with
and the sufficient breadth to help deploy and test the the most appropriate schools, contacts that PASCAL2
project in a significant number of contexts within Eu- researchers will not have in general. Finally, educa-
rope – even though probably a very limited number tional agencies may be aware of other initiatives aimed
of PASCAL2 sites would be involved in the actual de- at promoting technical studies among high-school stu-
velopment phases. Experiences such as the Videolec- dents — for example, one such initiative is being set
tures website in its predecessor PASCAL network have up right now by the regional government of Catalonia,
shown the tremendous potential of cooperation within where the crisis seems to be particularly acute.
the network.
Some of the obvious technical requisites of the mate- References
rials produced would be:
Carnegie-Mellon University (2006-2008). CS4HS: Ex-
plorations in computer science for high school teach-
• They certainly should be free (as in “free lunch”).
ers. http://www.cs.cmu.edu/cs4hs/.
Although, in principle, materials should be us-
able without students or teachers having to do
any programming, it is just a matter of time un-
til some who know how to program feel like ex-
tending or modifying the software, and therefore
being open-source is essentially a must. A num-
ber of open-source machine learning packages ex-
ist which could, or not, be used as a basis.
• The software should be adaptable to a huge va-
riety of contexts: hardware, operating systems,
content / courseware management systems, e-
learning platforms, etc.
• In particular, if the project is indeed intending to
be tested in several countries, localization would
be a major issue. Careful design should ensure
that porting to new languages is easy. Local-
ization to different educational systems (with dif-
ferent curricula, different teaching cultures, etc.)
would be an important challenge.