È Tutto Un Magna Magna

Saturday 6 June 2015

Gianluca is back! (or: PhD course on Micro-task crowdsourcing)

It is with much pleasure that I welcome Gianluca Demartini back at Udine University, Department of Mathematics and Computer Science, for a PhD course on Micro-task crowdsourcing. Gianluca got his Master's degree under my supervision some years ago (don't ask!) and since then he has obtained several research positions abroad. He is now Lecturer at the University of Sheffield.

The lecture timetable is below, together with a tentative program; everyone is welcome!

Timetable (all lectures are in the "Aula Multimediale / DIMI"):

1. Monday 15/6 10:00 - 12:00
2. Monday 15/6 15:00 - 17:00
3. Tuesday 16/6 10:00 - 12:00
4. Tuesday 16/6 15:00 - 17:00
5. Wednesday 17/6 10:00 - 12:00
6. Wednesday 17/6 15:00 - 17:00
7. Thursday 18/6 10:00 - 12:00

Preliminary/tentative program:

Lecture 1 - Introduction to Crowdsourcing
We will start with an overview of the entire module highlighting its aims and objectives. Then, we will look at fundamental definitions and different types of crowdsourcing incentives. Finally, we will present early examples of crowdsourcing such as reCAPTCHA and the ESP game.

Lecture 2 - Introduction to Micro-task Crowdsourcing Platforms
After defining the key terminology of micro-task crowdsourcing, we will introduce popular crowdsourcing platforms such as Amazon MTurk and CrowdFlower including a demonstration on how to use such systems both as a crowd worker as well as a requester.

Lecture 3 - How to Setup a Crowdsourcing Task
In this lecture we will discuss all the dimensions involved in crowdsourcing task design such as pricing, question design, and quality assurance mechanisms (e.g., honeypots). We will also design and deploy a task during the lecture and see how to collect results back from the crowdsourcing platform.

Lecture 4 - Crowdsourcing Patterns
In this lecture we will define the concept of crowdsourcing pattern (i.e., the combination of multiple crowdsourcing tasks) and present popular example patterns. We will also discuss the concept of crowdsourcing workflows where multiple tasks as well as machine processing steps are combined together.

Lecture 5 - Hybrid Human-machine Systems
In this lecture we will see some advanced example uses of crowdsourcing applied to the database, web, and biomedical domains. We will see how systems that combine both the scalability of machines over large amounts of data as well as the quality of human intelligence can be used to improve the effectiveness of Big Data systems.

Lecture 6 - Crowdsourcing Scalability
In hybrid human-machine systems the latency bottleneck lays on the side of the crowd as human intelligence is naturally slower than machine-based computation. In this lecture we will see recent research results that proposed techniques to improve the latency of crowdsourcing platforms.

Lecture 7 - Open Research Directions in Crowdsourcing
In this lecture we will give an overview on which micro-task crowdsourcing research questions different Computer Science areas focus on including database, information retrieval, semantic web, human-computer interaction, multimedia, and bio-informatics.

Prerequisites:
No prior knowledge is needed. Having at least basic programming skills is a plus.

S.

Friday 29 March 2013

Postdoc position

In the context of a Google Faculty Research Award (http://research.google.com/university/relations/research_awards.html), we are seeking a post doctoral researcher at Udine University (Udine, Italy). The title of the project is:

Axiometrics: Foundations of Evaluation Metrics in IR.

Axiometrics is one of the most important research directions proposed during the SWIRL 2012 meeting (http://www.cs.rmit.edu.au/swirl12/). A slightly more detailed description is at the end of this announcement. This is a joint project, involving:

Stefano Mizzaro (Principal Investigator), Dept. of Maths and Computer Science, University of Udine, Italy, mizzaro_foo@uniud.it.

Julio Gonzalo (co-Principal Investigator) ed Enrique Amigó, E.T.S.I. Informática de la UNED, Madrid, Spain, julio_foo@lsi.uned.es, enrique_foo@lsi.uned.es.

Evangelos Kanoulas (Google sponsor), Google Zurigo, ekanoulas_foo@gmail.com.

[you'll need to remove some characters from the email address -- unless you're a spammer :)] Feel free to contact me for further information, informal enquires, etc. The position is for one year (renewable if further funds are available). Salary is around 1900 euros/month. The activity will start in May/June 2013. The formal announcement in Italian is at http://www.uniud.it/ateneo/normativa/albo_ufficiale/183-2013 (# 20).

*** Application deadline: 16 April 2013. ***

Short project description

Effectiveness evaluation is of paramount importance in the field of Information Retrieval (IR). IR is probably the most evaluation-oriented field in computer science, as witnessed by an evaluation methodology developed in the 60s during the Cranfield experiments and by several evaluation initiatives running today (TREC, CLEF, NTCIR, INEX, FIRE). One crucial aspect of evaluation are evaluation metrics. About 100 IR effectiveness metrics exist, and counting. This project aims at understanding the relationships among them, in terms of both axiomatic properties and statistical relations, for both metric science (understanding of metrics) and engineering (their development). More in detail, we aim at proposing:

Axioms: rules that any metric must satisfy. For example, when swapping a relevant document and a non-relevant one in the ranking, by decreasing the rank of the relevant one and increasing the rank of the non relevant one, the metric value should decrease. Axioms might be verified and compared with user intuition using crowdsourcing.

Desiderata: desirable properties that a metric should have, based on common sense. For example, an effectiveness value according to one metric should be affected more by a swap in earlier rank positions than a swap in later ranks.

Empirical properties: those that emerge from data, i.e., from actual test collections and system comparisons. These include robustness, statistical correlation between metrics, etc.

Stefano Mizzaro
www.dimi.uniud.it/mizzaro

Sunday 17 March 2013

Computer science education

Un post interessante...

S.

Wednesday 27 February 2013

Dottorato in informatica: in memoriam?

Fra i 21 dottorati di ricerca dell'Universita' di Udine c'e' un dottorato in Informatica. A causa di un recente decreto, i 21 dottorati attuali verranno ridotti a circa 10. Semplificando un po': ogni dottorato deve avere 6 borse, e a UD sembra che ci siano circa 60 borse in totale; ci devono essere 16 membri del collegio docenti "sopra le mediane" (...), di cui al massimo 4 ricercatori; e altro.

Le soluzioni che oggi ci sono state proposte dal coordinatore del dottorato di informatica sono:

1) dottorato in Informatica e Matematica (con i nomi in qualche ordine);

2) dottorato in Informatica, Matematica e Fisica (con i nomi in qualche ordine).

A me sembra una follia l'eliminare, o anche solo lo sminuire, il dottorato di Informatica. Lo e', secondo me, sia da un punto di vista storico (Udine e' stata una delle prime sedi della laurea in informatica e il dottorato di Informatica esiste fin dal primo ciclo, anche se nei primi anni e' stato in consorzio), sia geografico (non ci sono altri dottorati in regione, per i piu' vicini bisogna andare a VE, PD, VR), sia culturale (importanza dell'informatica oggi ecc. ecc.).

E' sostenibile un dottorato di Informatica? Secondo me si' (sarebbe da fare conti precisi ma non ne ho ne' il tempo, ne' la voglia; ne' e' compito mio...). Ci sono 4 dottorati da "cannibalizzare": Informatica, Matematica e fisica, Ingegneria dell'informazione, Comunicazione multimediale, sia in termini di borse sia come docenti. Le borse "portate in dote" dai 4 dottorati potrebbero essere 3/4+1+1+1 = 6/7. A Udine siamo 40 docenti di informatica (INF/01 e ING-INF/05), piu' altri di settori affini (matematici applicati, ingegneria dell'informazione, ...). E fra 40 non riusciamo a trovarne 16 sopra le mediane??

Io farei una "battaglia" con gli organi di governo del tipo: noi proponiamo che ci sia un dottorato di informatica fatto cosi'; supportateci (non tanto economicamente, ma politicamente: e' una scelta strategica che va fatta a livello di governance). Se invece non lo volete, ditecelo chiaramente e assumetevene le responsabilita', e allora si vedra' che fare. Ma che risulti chiaro di chi e' la responsabilita' di "ammazzare" il dottorato in Informatica.

Mi lascia quantomeno perplesso che questa battaglia non si voglia fare...

S.

P.S. E questa sara' probabilmente un'altra occasione persa, dopo non aver fatto la Facolta' di Informatica e il Dipartimento di Informatica...

Sunday 13 January 2013

Cerchi lavoro?

A breve bandirò un assegno di ricerca annuale per lavorare al progetto Axiometrics: Foundations of Evaluation Metrics in IR. Il progetto è finanziato da un Google Faculty Research Award (http://research.google.com/university/relations/research_awards.html).

Abstract del progetto Axiometrics
Effectiveness evaluation is of paramount importance in the field of Information Retrieval (IR). IR is probably the most evaluation-oriented field in computer science, as witnessed by an evaluation methodology developed in the 60s during the Cranfield experiments and by several evaluation initiatives running today (TREC, CLEF, NTCIR, INEX, FIRE). One crucial aspect of evaluation are evaluation metrics. About 100 IR effectiveness metrics exist, and counting. This project aims at understanding the relationships among them, in terms of both axiomatic properties and statistical relations, for both metric science (understanding of metrics) and engineering (their development). More in detail, we aim at proposing:

Axioms: rules that any metric must satisfy. For example, when swapping a relevant document and a non-relevant one in the ranking, by decreasing the rank of the relevant one and increasing the rank of the non relevant one, the metric value should decrease. Axioms might be verified and compared with user intuition using crowdsourcing.
Desiderata: desirable properties that a metric should have, based on common sense. For example, an effectiveness value according to one metric should be affected more by a swap in earlier rank positions than a swap in later ranks.
Empirical properties: those that emerge from data, i.e., from actual test collections and system comparisons. These include robustness, statistical correlation between metrics, etc.

Axiometrics è una delle linee di ricerca più importanti per l’IR proposte durante il meeting SWIRL 2012 (http://www.cs.rmit.edu.au/swirl12/). L’attività è svolta in collaborazione con:

Stefano Mizzaro (Principal Investigator), Dept. of Maths and Computer Science, University of Udine, Italy, mizzaro_pippo@uniud.it.
Julio Gonzalo (co-Principal Investigator) ed Enrique Amigó, E.T.S.I. Informática de la UNED, Madrid, Spain, julio_pippo@lsi.uned.es, enrique_pippo@lsi.uned.es.
Evangelos Kanoulas (Google sponsor), Google Zurigo, ekanoulas_pippo@gmail.com.

[ovviamente gli indirizzi email contengono alcuni caratteri in più da rimuovere, a meno che tu non sia uno spammer :)]
Se pensi che ti possa interessare, contattami.

Stefano Mizzaro
www.dimi.uniud.it/~mizzaro

Friday 7 December 2012

Tempi moderni

Da questo mese il mio ateneo ha organizzato due iniziative, potenzialmente sensate:

1. Una comunicazione settimanale con le iniziative di ateneo della settimana seguente;

2. Una "Newsletter" mensile con notizie di interesse della vita di ateneo.

"Ah, hanno fatto un blog", direte voi... Eh no:

1. La comunicazione settimanale arriva sotto forma di un link, spedito via email il venerdì, da cui scaricare un file pdf tutto bello formattato con le varie iniziative della settimana seguente.

2. (non ho ancora ricevuto la Newsletter ma mi aspetto che) la Newsletter venga spedita con modalità analoghe. In più, se si vuole pubblicare qualcosa nella Newsletter, bisogna spedire un messaggio di posta elettronica alla propria segretaria di dipartimento, che poi lo inoltra a qualcuno che, di nuovo, lo formatterà, ecc. ecc.

Probabilmente non gli è venuto in mente ma con ciclostile, telegramma, fotocopiatrice e fax si poteva fare anche qualcosa di meglio... Apprezzo la volontà di trasparenza (era ora... ma questo è un discorso lungo...), ma non bastava fare un blog con accesso riservato ai docenti dell'università? (e magari a qualcuno che raccogliesse comunicazioni da studenti e/o esterni). Con i feed le comunicazioni arrivano ben prima, si risparmiava lavoro manuale inutile, e introdurre scadenze inutili per far aspettare le notizie un mese o una settimana non ha nessun senso, mi pare. Bah. Viva il Web.

S.

P.S. E io continuo a pensarla così.

Wednesday 5 December 2012

Sempre riguardo al vulcano...

... di cui scrivevo tempo fa, qui ci sono un po' di considerazioni. E di precedenti...

S.

Monday 3 December 2012

Rassegna stampa

Sono finito su "la Repubblica" grazie alla mia pagina "Come-non-si-scrive-la-tesi", ma...

Bah. Meglio google
S.

Wednesday 12 September 2012

Alcune bufale sull'università italiana

Alberto Baccini su ROARS elenca alcune affermazioni sull'università italiana, affermazioni che si sentono spesso e che sono, come dice appunto Baccini, false e/o distorte -- insomma, "bufale":

Il contesto. Ecco ciò-che-tutti-sanno-dell’università-italiana (poco importa che alcuni punti dello scenario siano sostanzialmente falsi e altri ingigantiti e distorti):
1. “La ricerca scientifica attraversa un periodo di stasi”, perché l’università produce poca ricerca [si legga qui e qui], ed è avviata al declino;
2. “La ricerca scientifica deve servire alla scienza e alle esigenze nazionali. Non deve servire a creare nuove cattedre e nuovi insegnamenti.” Il declino della ricerca italiana è causato dall’autoreferenzialità dei baroni.
3. Il sistema di reclutamento è distorto e corrotto da nepotismo e clientele. Il merito è mortificato.
4. All’università italiana e alla ricerca non mancano le risorse. La ricerca condotta dai baroni è spesso inutile per la società ed autoreferenziale.
5. I baroni hanno stipendi tra i più alti al mondo.
Da questo segue che l’università italiana è irriformabile con gli strumenti legislativi usuali; c’è bisogno di una rivoluzione (lo sostiene per esempio Andrea Ichino). La politica ha il compito di individuare una élite accademica illuminata e d’avanguardia che possa modificare dall’alto il funzionamento della università e della ricerca italiana. I due snodi fondamentali sono finanziamento e reclutamento. Lo strumento istituzionale è l’ANVUR: un organismo tecnico di nomina ministeriale, lasciato incompiuto dal governo di centro-sinistra, cui vengono attribuiti poteri (oltre a molti altri) su valutazione e criteri per il reclutamento.

Wednesday 18 July 2012

SA, CdA e Informatica

Mah. Leggo i nomi dei componenti del nuovo SA e del nuovo CdA e resto un po' perplesso. Capisco che in precedenza abbiamo avuto un rettore informatico e che un po' di alternanza non guasta, ma secondo me, in questo periodo in cui l'informatica sta avendo un'esplosione a livello planetario, non aver nessun informatico applicativo né nel SA, né nel CdA, né nella "squadra del rettore", fra i delegati, mi pare una grossa stupidaggine. Rischiamo di perdere un sacco di occasioni. Sperin ben...

S.

Readersourcing - some stuff

Just to collect some thoughts:

The official version of my "Readersourcing - A Manifesto" paper is now available on publisher's website.
There seems to be an increasing interest on issues related to peer review and scholarly publishing:

M. Tamer Özsu writes on the ACM SIGMOD blog;
Moshe Y. Vardi continues to write editorials on CACM: 1, 2, 3.

Elsevier asks to the crowd for good ideas.
I came across to this other website.

I admit that I still have to read everything but I have the feeling that:

most people aim at a system evolution; Readersourcing is a revolution-oriente approach. Not sure which is wiser...
Bits and pieces of the whole Readersourcing idea can be found here and there (e.g., in several of Elsevier's challenge ideas).

Future will tell...

Wednesday 4 July 2012

"No flashy. This is Science." [censored] -- Or: How two jetlagged IR researchers met and had an idea

[Prologue: the beginning of a beautiful friendship]

J. Hi. Can I sit here?
S. Sure. Hi. I don't think we met before. Nice to meet you, I'm S.
J. I'm J.
S. Nice place here, isn't it?
J. Yep! Beach this afternoon?
S. Sure!
J. Ok, let's pretend we do some work first. So, what are you working on?
S. Oh, several things, blablabla [snip] And you?
J. Well, I did blablabla... and blabla... and I also published some papers on formal analysis of clustering metrics.
S. Interesting. I also started something similar for IR metrics years ago, but I never managed to publish it.
J. Really?
S. Yep. I'll show you [opening his laptop]. See, I defined a framework based on measurement theory, then I defined some axioms...
J. That's crazy! I did exactly the same!
S. ... then some desiderata...
J. Exactly the same!
S. ... Yes but you published, I didn't...
J. ... because you tried to do everything, look here, that's crazy!
S. Yes, I'm a bit, sure... and then blablabla...
J. blablabla [very technical details here - ok, ok, I admit I do not remember that!]
S. blablabla [very technical details here - ok, ok, I admit I do not remember that!]
J. Well, why not proposing this as an idea for SWIRL research directions?
S. We might indeed!
J. Ok, beach time now.
S. Beach!

[Chapter 1: split groups doing Science]

M. So, let's have a round of the table and everyone presents his own idea.
[various good ideas...everyone presents just one, accordingly to the rules. But you know that Italians and Rules do not fit well in the same sentence, so...]
S. I do have two ideas. The first is not very exciting, but it's Science. The second is more exciting...
Others. Well, tell us both. [meaning: the usual Italian breaking rules...]
S. The first is about IR effectiveness metrics. We have around 100 metrics, counting the system-oriented ones only. The research project would be to find formal properties that each metric should satisfy.
Others. So do you mean...?
S. To define axioms that metrics should satisfy. For instance, first retrieved documents should weight more (or not less) than following docs, etc.
Others. Hm. What's that for?
S. Oh well, for instance we could understand why nDCG discount function is defined in that way...
Others. Cool. We need a name...
A. Axiometrics!
Everyone. That's great!!

[Chapter 2: Axiometrics]

S. Hi J.. Did you mention the idea about metrics in your group?
J. No, I didn't, and you?
S. Yes, I did, and guess it, it was selected!
J. Really?? Great!!
S. And A. invented a cool name: Axiometrics
J. That's wonderful!
S. Let's have some coffe, I'm still jetlagged.
J. Yes. And beach later. And beers.

[Chapter 3: Coffee and beers]

[after some hours, and coffee. And beach. And beers.]
S., J., A. M. N. Ok, let's write this short report about Axiometrics in SWIRL...
... well, let's have a short and last beach session first!

[Chapter 4: Planes, emails, and deadlines]

[After some long flights back home, tons of unanswered emails and student requests, expenses claim forms, etc. etc., and just a few spare days before the deadline...]
S. J., do you think it is reasonable to submit a Google research grant proposal on Axiometrics? Or is it a stupid idea?
J. That's a great idea!
S. Let's involve E. as well, he is interested.
[some hard science follows: bibliography is polished, CVs are created, margins and fonts are modified, PDFs generated...]

[Chapter 5: 4th of July]
Guys, we won!!

P.S. Thanks to Julio, Evangelos, Arjen, Marteen, Nicola. And to SWIRL and Mark!

S.

Wednesday 9 May 2012

CCL.zip.deluso.ancora / 2

E comunque, promemoria sui problemi di alcuni corsi a TWM:

Algoritmi e strutture dati. Non ha senso senza laboratorio.
Sistemi operativi. Anche qui sarebbe meglio il laboratorio, in modo che vedano il C, ma forse anche no.
Programmazione OO. Anche qui serve un laboratorio. Gli studenti non imparano a programmare OO. Non che sia facile, ma senza un lab. in cui mettere in pratica lo è ancora di meno...
Basi di dati. Evitare questioni teoriche sull'algebra relazionale. Invece fare DB XML, NoSQL, MapReduce, Hadoop, ...
Ingegneria del software. Probabilmente anche qui si può cambiare qualcosa. Che "software" scrive che si laurea in TWM? Applicazioncine Web, con uso di qualche framework stile MVC (Struts, Zend, ecc.), e/o qualche framework Javascript/AJaX (jQuery, ...), e/o un CMS. Che legami con la catena di corsi TW1-2-3? Con PSW (che secondo me è un po' l'"Ingegneria del software per TWM")? Con PAOO?

Se il prossimo anno con la primavera, oltre a pollini e rondini, arrivano anche le solite discussioni sulla Questione Programmazione, devo ricordarmi di sollevare anche queste. Anzi magari lo faccio comunque...

Friday 4 May 2012

We are sitting on a volcano...

... that is waking up...

S.

Tuesday 24 April 2012

CCL.zip.deluso.ancora

Con l'arrivo della primavera, come ogni anno (cfr. i post con etichetta CCL.zip degli ultimi 4 anni...) si è discusso per l'ennesima volta della "Questione Programmazione". Come al solito la discussione è stata spiacevole e come al solito alla fine abbiamo "vinto" noi, anche se è stata una vittoria risicata (per un voto) e con l'amaro in bocca perché "su queste cose non si deve arrivare a votare". E perché sono proprio stufo di questa storia.

Spero che da ora in poi si discuta su un altro piano, sui contenuti e sulle metodologie (qui un altro spunto interessante), e si smetta di fare una guerra fra poveri che toglie energie ed entusiasmo a tutti quanti. Ma sono 4 anni che lo spero e quindi non sono ottimista...

S.

Bibliostremista

Fatti:

l'università di Udine assegna ogni anno un budget di 2.200.000 € per la biblioteca;
se ho capito bene la maggior parte (la totalità?) di questo budget è speso per abbonamento a periodici;
all'università di Udine ci sono 708 fra professori e ricercatori;
2.200.000€/708 = 3.000+€ a testa, in media;
negli ultimi anni ho ricevuto 0 € (zero), o al massimo poche centinaia di euro (<500€) come fondi di ricerca per acquisto attrezzature, viaggi, partecipazioni a convegni, ecc.

Tremila. Zero. All'ultimo consiglio di dipartimento ho fatto notare la cosa e ho dichiarato che a mio parere questa è una situazione insostenibile, anche e soprattutto tenendo conto della diffusione di ebook, open access, accesso elettronico in generale, ecc. ecc. e delle numerose azioni internazionali di protesta e boicottaggio nei confronti degli editori. Qualcuno ha detto amichevolmente che sono un estremista :)

Adesso sembra che anche ad Harvard facciano gli estremisti...

S.

Wednesday 11 April 2012

È tutto un magna magna

Citation cartel.

S.

Tuesday 28 February 2012

SWIRL 2012 - 2

I forgot two things:

1) the tagcloud of the nominated papers. I simply cut&pasted the text at http://www.cs.rmit.edu.au/swirl12/discussion.php into Worlde and I got:

(I think it is clear that "evaluation" is a hot issue, once removed obvious terms like "paper", "information", "retrieval", etc., but some might disagree)

2) I believe there was an elephant in the room: Crowdsourcing.

S.

Friday 24 February 2012

SWIRL 2012

Last week I attended SWIRL 2012: these are my highlights.

Facts.

SWIRL 2012 has been a workshop where around 50 jetlagged top IR researchers gathered together essentially to experience summer during winter and incidentally to discuss the future of IR research. I am not sure that I count among the top IR researchers. Actually, probably the selection mechanism was more oriented towards the most crazy IR researchers since some really top were missing and some very crazy were definitely there. So I am not sure that I count among the top IR researchers, we can discuss if I count among the top crazy ones, but I'm sure I count among the top jetlagged ones. Luckily enough, all the others were jetlagged as well, and looked like Australians in Europe, so I managed to camouflage myself somehow. Also, and surprisingly, being very jetlagged even helped, since we had long discussions during the nights, and we managed to use the nights to read our emails :) and have the days free for working.

The workshop started with the usual nice dinner in a nice hotel. Well, actually we had a prologue, with some talks at RMIT from some IR researchers. They were, guess, extremely jetlagged, so probably not all the talks were so good, and yes I was one of the speakers. You see what I mean? Anyway, I presented Readersourcing, and I learned two things:

Now I know who was one of the referees of the famous Go*gle/PageRank paper rejected from SIGIR; and that that submitted version wasn't that good... Anyway...
When presenting Readersourcing in these years, I managed to discuss... ehm let's say to argue with S, K, and B. I did not record the first "discussion", but I did for the second one here. Now, the point is: shouldn't I seriously consider to quit this line of research?!?

Anyway, after that, the workshop started with the usual nice dinner in a nice hotel. During it I learned three things:

Evaluation? We haven't had enough.
How to wreck a nice beach (don't ask me to explain this, ask David).
You can live your life as a party (ask Leif)

Then the crowd of 50-top-and-seriously-jetlagged-IR-researchers was packed into a bus and sent to Lorne to experience real summer. Indeed, the first thing we did in Lorne was to go to the nice beach (...). But after that we started to do some serious work. I don't recall all the details, probably because I was considerably jetlagged, but I do remember the main things. We were divided into 6 random groups that had to produce some ideas about the future of IR, i.e., research topics and directions that we felt important for the field. After that, we pooled all the ideas together (you know, we like pooling; actually, it is surprising to me that we didn't manage to stick into this process a logarithm, or a significance test, or the definition of a novel evaluation metric) and we voted the 6 top ideas. Indeed we didn't like the voting result too much so we slightly modified it (I'd say "the Italian way", but strangely there were no Italians driving this process) by merging, splitting, adding, removing, changing, deleting, twinkling, etc., and we obtained The-list-of-the-6-top-ideas-that-will-change-the-IR-world-in-the-future:

Mobile IR
Structured and unstructured information
~~users~~ People
SmartIR (aka information literacy)
"(less than) zero query" [sic]
and, of course, Argy-Bargy

The other ideas were not killed, and will go back into play later on. We were then asked to form six groups, each one aiming at discussing one of the six ideas. Looking at the list it was very difficult for me to choose which group to join. Obviously, any schoolboy will understand the foundational and metaphysics nature of Argy-Bargy. Anyone will agree that we shall have no other information besides structured and unstructured. We can't kill ~~users~~ People, of course (fair enough, we didn't write "students"). SmartIR is the smarter term in the list. I've been working for the last 10 years, and published most of my last papers, on "(less than) zero query" [sic], although with different, and equally ~~stupid~~ strange labels like "query-free IR", "zero-interaction interactive IR", "browsing the virtual space by walking in the physical space", etc.

But at the end I joined the Mobile IR group. And, guess it, it was not about Mobile IR. It was about Understanding what Mobile IR is. Funny: a group that did not understand what it was about but aimed at understanding what its (mistaken) topic was about. How couldn't we succeed? Plus, when people chose their own group, strange things happened. Some people were not able to find their group; perhaps it was on the nice beach? Some people (those above plus others) joined another "wrong" group, and some of them, just to avoid being idle, decided to act as trolls. We had two trolls in the Mobile IR group, and you will understand how that made the discussion far more interesting, lively, polite, and constructive than you could ever imagine. Luckily, the night came, and after a good sleep (or perhaps some good chats, beers, barbecues, etc., since we were seriously jetlagged and can't manage to have that much sleep anyway), the morning after, Jamie and Vanessa had a clear vision of what the group was about and started drafting the report (each group was meant to write a report). Not having trolls around early morning probably helped. Did trolls suffer from jetlag? Did trolls take surfer lessons? We will never know.

As mentioned above, the other not-top-6-best-great-ideas were not killed, and some still-quite-heavily-jetlagged-IR-researchers volunteered to write a short report on those. Now, I have to mention the best name of the workshop: Axiometrics (credits to Arjen), a research line aimed at defining axioms in order to understand the about 100 IR effectiveness metrics. BTW, we might think of Anatometrics as well.

Comments?

So the workshop was great, the organizers incredibly managed to obtain something out of about-50-~~top~~-crazy-IR-researchers-that-as-you-know-were-very-jetlagged, the discussions were interesting, I managed to have some good ideas and contacts for future paper writing, and perhaps some contact for my sabbatical as well (I'm looking for places were to stay during my sabbatical; please let me know if you're interested. I promise that I won't be so jetlagged for the whole sabbatical duration.) Plus, I've been out of business recently, for several reasons including lack of funds, A.'s birth, etc., and it was really nice to meet some old good friends and some new ones.

Criticisms? I always have. We could have made use of some "Social Web/Web2.0" tools, like Gdocs, Facebook, Twitter, to have a virtual discussion as well and, for example, to vote the ideas (did I hint that the voting mechanism was a bit... "italianized"? Now, we all know about Arrow's theorem, but that was far beyond that). Someone said that s/he had the impression that we were simply drafting a report to make easier for US and Australian researchers to get funds. Someone had the impression that the meeting was a bit too "old fashioned". But as I wrote, the organizers did manage to get something out of about-50-~~top~~-IR-researchers-that-as-you-know-were-very-jetlagged, and this is an enormous success.

Quotations!

Some interesting sentences were uttered during those days, and shall never be forgotten:

Cloud computing is not transparent.
You can't leave indoor outdoor, but you can't leave outdoor outdoor either.
How to wreck a nice beach.
Life as a party.
(Julio please help with the other one)
(anybody welcome to add)

Post workshop!!

Once back in Melbourne, a (randomly) selected subgroup of all those (somehow) 50 selected IR researcher had an interesting post-workshop, post-dinner, during-beer mini-workshop on p*orn and user models. I took some pictures of the participants:

As you can see, we range from someone (pretending to be?) not interested, or perhaps simply jetlagged, to someone really having fun, to a shy guy who doesn't want to be recognized, to someone counting beers (not an easy task!), to someone hiding in the shade. I learned some interesting statistics about user features. Anyway, guys, it was fun. Thanks!

S.

Wednesday 8 February 2012

Ma dico io...

Qualche giorno fa ho pagato online un bollettino postale su poste.it. Alla fine del pagamento, mi è apparsa una pagina in cui c'era scritto che mi sarebbe arrivata nella "Nuova bacheca" personale una ricevuta del bollettino pagato (ovviamente la soluzione giusta sarebbe di visualizzare la ricevuta nella pagina web *e* di spedirmi un messaggio email, e questa seconda cosa veniva fatta prima che venisse introdotta 'sta "Nuova bacheca").

Vabbè. Comunque guardo e nella bacheca non c'è nulla. Allora scrivo non mi ricordo più neanche come, dove e cosa, una richiesta per spiegare il problema. Ieri (dopo una settimana!!) mi arriva questo email:

Gentile cliente,

in riferimento alla sua richiesta relativa all'uso dei servizi on-line del sito www.poste.it. del giorno 01/02/2012", le indichiamo alcuni controlli che può eseguire sul suo computer:
1. verificare che il browser consenta connessioni con protocollo SSL e accetti i cookie della sessione
2. eseguire periodicamente la pulizia dei file temporanei e dei cookie
3. verificare le proprietà data/ora e fuso orario del computer
4. verificare il corretto inserimento del nome utente e della password. Il nome utente va inserito nel formato nome.cognome più l'eventuale estensione (ad esempio mario.rossi-1234) richiesta al momento della registrazione. La password va digitata rispettando la sequenza dei caratteri maiuscolo o minuscolo cos come inseriti in fase di registrazione o in occasione dell'ultimo cambio.
Qualora avesse difficoltà nell'eseguire tali controlli o comunque risultasse ancora necessaria la nostra assistenza può ricontattarci telefonando al numero 803.160, gratuito da telefono fisso, (dal lunedì al sabato, con orario 8:00 20:00) effettuando la scelta 3 per i Servizi Internet.

Distinti saluti
Poste Italiane

Ma dico io... sono messaggi da mandare? Dopo una settimana?! Non riportano la mia richiesta, il messaggio è chiaramente preconfezionato e one-size-fits-all, e mi chiedono pure di telefonare? Mah. Per scrupolo controllo 'sta bacheca e ci sono ben 2 copie della ricevuta...E comunque una copia della ricevuta c'era anche in non so più quale meandro del sito web (l'avevo trovata quasi per caso visto che non rispondevano alla mia richiesta).

Bah. Che servizio.
S.

Wednesday 1 February 2012

Bah

Qualche Consiglio di Facoltà fa si è discusso dei laboratori di facoltà. Qualcuno (me compreso) sosteneva che forse prima di spendere migliaia di euro in postazioni di lavoro poteva aver senso pensare di "sfruttare" i portatili degli studenti e investire i miseri fondi in altro (software, server, infrastrutture, ecc.).

Più o meno contestualmente, qualche Consiglio di Facoltà fa (qualche mese fa!) ho chiesto che venisse creata una mailing list di facoltà. Ancora niente... (è un'operazione di pochi minuti). Volevo usare la mailing list anche per divulgare i risultati di un questionario; siccome la mailing list non c'è, divulgo qui.

Il questionario riguardava proprio la situazione laboratori di facoltà e conteneva varie domande, alcune con risposte aperte in testo libero. Hanno risposto in 406 (su circa un migliaio di studenti della facoltà). Fra i risultati, solo il 6% non ha a disposizione un suo portatile; solo il 17% preferisce avere a disposizione una postazione fissa in laboratorio. Fra i commenti in testo libero, ci sono anche un po' di richieste "interessanti" e ricorrenti, tipo avere stampanti migliori, avere accesso a internet e alla rete Wi-Fi, ecc.

Insomma, in una facoltà con corsi di laurea vari in Informatica, IT, ICT, la situazione è:

non c'è una mailing list dei membri della facoltà;
dai laboratori non c'è l'accesso alla rete WiFi;
si fa fatica a stampare.

Ecco: bah.

P.S. Riporto qui anche la mail che avevo fatto inviare il 5 ottobre 2011 a tutti i membri del CdF, così, per completezza (tanto, in barba alla netichetta, è già arrivata ad altre persone oltre i destinatari originali...):

Oggetto: Sui laboratori di facolta'

Care/i tutte/i,

durante l'ultimo CdF, [...] ha sollecitato una riflessione ponderata sul modo migliore per investire il contributo per i laboratori di facolta' che l'Ateneo (forse) ci assegnera'. Vorrei portare alcuni contributi piu' meditati di quanto ho gia' detto in Facolta', e soprattutto alcuni dati. Cerco di essere sintetico.

Innanzi tutto, osservo che questo non e' un problema solo nostro, e che se ne parla ormai da anni, come una veloce ricerca su google consente di appurare:

- http://www.google.com/search?q=what+computer+lab+should+university+have+now+that+students+have+laptops%3F

Poi, ecco alcuni dati:

1) Ho svolto un veloce sondaggio al corso di Programmazione e laboratorio per la laurea in TWM. Erano presenti circa 70 studenti, la maggior parte matricole piu' alcuni studenti degli anni successivi. Risultato: il 100% (*tutti*) avra' un portatile entro 6 mesi.

2) Ho svolto un sondaggio piu' sistematico usando Google Form. Il sondaggio dovrebbe essere stato inviato agli indirizzi mail SPES di tutti gli studenti di un corso di laurea della nostra facolta' (incluso l'interfacolta' CMTI). Risultati:

- hanno finora risposto 304 studenti (su un totale di circa 1000, immagino), in due giorni e mezzo;
- il 95% ha un proprio portatile o ce l'avra' entro 6 mesi;
- il 50% dichiara di preferire di "Poter usare il tuo portatile"; il 25% si dichiara "Indifferente"; il 16% di preferire di "Avere a disposizione un calcolatore fisso da usare, fornito dalla Facoltà".

Il sondaggio e' ancora in compilazione, immagino soprattutto da parte di studenti del primo anno che forse non hanno ancora un indirizzo email su SPES. Fra qualche giorno chiudero' il sondaggio e rendero' disponibili tutti i dati; comunque il quadro mi pare gia' chiaro.

Sulla base di questi dati, mi pare che non si possa che ribadire la necessita' di ridiscutere le modalita' di allocazione degli eventuali fondi per il laboratorio. Mi sembra che sia da considerare seriamente l'opzione di *non* investire in 50 postazioni di lavoro ma in infrastrutture di rete, prese elettriche, stampanti, software, licenze, servizi Web, ecc., come appunto suggerito durante il CdF. Molti commenti degli studenti (raccolti con altre domande specifiche del questionario) vanno in questo senso.

Secondo me sarebbe anche una buona mossa investire in un certo numero di laptop (5? 10?) da fornire in comodato d'uso ai migliori studenti di ogni corso di laurea, magari in convenzione con qualche venditore per avere prezzi scontati e una buona pubblicita'.

Gia' che ci sono, aggiungo in chiusura anche due brevi riflessioni sul sistema informativo di ateneo/facolta'.

A livello di ateneo, non c'e' una mailing list degli studenti di un certo anno/corso/facolta'. Per inviare il questionario a tutti gli studenti ho dovuto interagire con l'URP che, "a mano" (non so i dettagli...) ha dovuto estrarre da un database i vari sottoinsiemi di studenti e procedere a piu' invii separati (per inciso, questo e' il motivo per cui ho usato poc'anzi condizionale sull'invio a tutti gli studenti della facolta'). Ho interagito per un'ora con Google Forms per preparare il sondaggio, e poi ho interagito per un giorno con l'URP per riuscire a inviarlo. Scusate se sono brutale, ma mi sembra una cosa ridicola.

Ma ancora piu' ridicolo mi sembra il fatto che non c'e' una mailing list dei docenti del nostro CdF (di Scienze!), ne' del CCL (in discipline *informatiche*!). Per il CCL c'e' un elenco di docenti "in chiaro", e per il CdF neanche quello. Le liste CdF/CCL consentirebbero di far circolare certi dati per email (tipo questo messaggio, ma anche, ad es., i dati sulle immatricolazioni/iscrizioni), con conseguente risparmio di tempo e maggiore trasparenza. Usando gli strumenti di Google si fa tutto in un'ora; e c'e' comunque anche un servizio di ateneo.

Quindi concludo chiedendo che vengano create al piu' presto le mailing list di CdF e CCL, con gestione in carico alla segreteria di facolta'. Se questo non verra' fatto entro il prossimo consiglio, e' mia intenzione chiedere una votazione durante le Varie ed eventuali per deliberare sulla creazione immediata delle liste.

Saluti,
S.

Bah.
S.

Thursday 15 December 2011

Dati

In qualche ultimo consiglio si insiste che alle specialistiche diamo voti troppo alti, sia negli esami sia come voto di laurea. È una sensazione che ho avuto anche io, ma siccome è meglio avere dati, ho provato a confrontare su Almalaurea i voti di Udine e della media italiana degli anni 2006-2010, e il confronto è interessante: diamo voti più alti negli esami (circa +1% in media sui cinque anni) e più bassi alla laurea (circa -3% in media sui cinque anni). Ecco due grafici:

Come dico sempre, bisogna partire dai dati e *poi* formarsi le opinioni...

S.

Thursday 6 October 2011

Today

Stay Hungry. Stay Foolish.
Today. But always remember that fools rush in where Angels fear to tread.
S.

Thursday 29 September 2011

Tunnel

Ricapitoliamo:

Comunicato 1
risate
Comunicato 2
sempre risate
Articolo:
La fine?

Conclusione: