File:I-could-have-written-that these-are-the-words mb 300dpi.png File:Text-mining-is +1.00 certain.png File:I-could-have-written-that categories-modality.gif

i-could-have-written-that is a two hour workshop, in which we will write between the lines, raw data will be cooked, and personal oracles will be trained. The workshop contains three parts. We start with a writing exercise in which we challenge the strictness of numbers with words. Then we will train our personal oracles by following the 1989 model Knowledge Discovery in Databases, and do manual pattern recognition with notecards and pens. After this hard work, we will ask three algorithms to do a bit of work as well. They will built your personal oracle, which you can consult at any time in the future!

Many text mining services offer pre-trained classifiers through API's without specifying how the system is created. They promise to read meaning out of a text correctly. This unconditional trust is the basis of automated essay scoring systems (AES), psychology studies, job vacancy platforms and terrorism prevention. Automated systems are trained to detect sentiment, education level, mental illness, or suspicious behaviour. The techniques are used online in i.e. social media platforms, in the academic field and by the government. However, framing text mining techniques as readerly systems makes them immune for questions and discussion. When text mining results are considered as a product of writing, the presence of an author is emphasized. This makes it possible to discuss text mining results in relation to predefined goals and breaks the idea that text mining results represent an absolute truth.