Conceptualizing Creativity: From Distributional Semantics to Conceptual Spaces Kat Agres, Stephen McGregor, Matthew Purver, and Geraint Wiggins School of Electronic Engineering and Computer Science Queen Mary University of London London E1 4NS UK kathleen.agres, s.e.mcgregor, m.purver, geraint.wiggins (@qmul.ac.uk) Abstract This paper puts forth a method for discovering computationally-derived conceptual spaces that reflect human conceptualization of musical and poetic creativity. We describe a lexical space that is defined through co-occurrence statistics, and compare the dimensions of this space with human responses on a word association task. Participants’ responses serve as external validation of our computational findings, and frequent terms are also used as input dimensions for creating mappings from the linguistic to the conceptual domain. This novel method finds low-dimensional subspaces that represent particular conceptual regions within a vector space model of distributional semantics. Word-vectors from these discovered conceptual spaces are considered, and argued to be useful for the evaluation of creativity and creative artifacts within computational creativity. Introduction This paper presents a computational-linguistic model for mapping lexical spaces populated by statistical representations of words to conceptual spaces defined in terms of feature dimensions of conceptual representations. This research has three main goals. The first is to compare the features of a distributional semantic vector space with the results of an empirical word-association task completed by human subjects. This empirical corroboration serves to demonstrate that the model can capture meaningful aspects of human conceptualizations of queried topics, which are “musical creativity” and “poetic creativity” in the present study. The second goal is to use novel methods inspired by computational linguistics to map terms from the linguistic domain to representations in the conceptual domain. To this end, the terms generated by participants are used as input parameters for our computational model that uses co-occurrence statistics and linear algebraic metrics to quantify conceptual proximity. The third motivation of this work concerns the evaluation of creativity. In the field of computational creativity (CC), the evaluation of creative output is often either subjective on the part of the developer/researcher or unsystematic. We offer our own fundamentally computational approach as a means of identifying facets of the investigated concept or domain. Put another way, our model can generate terms within a conceptual space that may be used to query different aspects of creative output or creative behavior. Vector space models of distributional semantics are currently a popular approach for quantifying linguistic similarity, but many contemporary studies need grounding and external validation. Much of the work in this area compares model performance to semantic databases, but does not directly relate results to the cognitive performance of humans, or uses very restricted tasks, such as similarity judgments, rather than imploring subjects to elaborate on concepts. Because our aim is to elucidate how humans conceptualize creativity, sampling from people’s own formulation of conceptual spaces is essential. Therefore, in the present work, our ground truth is derived from human responses stemming from direct queries about creative concepts. Because human response data is a limited and expensive resource, we hope that our comparison to human data will inform how conceptual spaces may be discovered as autonomously as possible in the future (that is, without the requirement of subjective user-input or parameter-tweaking). We also believe that this multidisciplinary and externally validated approach produces a more robust system. In order to pinpoint the relationship between the output of our computational model and the results of our empirical study, we take the human-generated terms and investigate their situation within the multidimensional space of our distributional semantics model. We then determine the characteristic co-occurrence dimensions of sets of words associated with concepts, and apply appropriate methods to reduce the dimensionality of the space in order to map broader clusters of linguistic terms to conceptual regions. We argue that the online generation of a reduced lexical space corresponds to the contextualization inherent in the momentary way in which concepts are necessarily formed in response to situations in a cognitive environment. We expect that this methodology will be a useful applied approach to formalizing the geometrical representation of conceptual spaces. Our research explores two related concepts: musical creativity and poetic creativity. There are several reasons for this choice. First, we are interested in computational creativity, and in particular in the evaluation of creative systems and their output. In order to evaluate creativity, it is necessary to characterize features of this concept using the expressive affordances of language. Our computational methods seek to capture these features of the conceptual space. Our model may also be used to discover conceptually-related terms that a human might not necessarily immediately conProceedings of the Sixth International Conference on Computational Creativity June 2015 118 sider. We hope this approach may be used to elaborate abstract concepts by elucidating an extensive set of terms that correspond to the queried conceptual spaces. We therefore offer this methodology as a novel approach for exploring and elaborating concepts, both for the evaluation of creative systems and for potentially contributing to creative pursuits themselves (such as poetry generation). Furthermore, we apply our method to a more concrete domain, extending a small subset of terms relating to the concept WILD ANIMALS in order to indicate the anticipated generality of our model. The organization of the paper is as follows: first we offer a summary of computational approaches to conceptual creativity, situating our research within the field. This overview leads into a discussion of computational approaches to the topics of conceptual spaces and geometric representations of concepts. An explanation of our computational model is then provided, including a description of how we have modeled a lexical space populated by word-vectors. This is followed by a description of our empirical study with human participants, and findings from this questionnaire-based study are reported. Given this context, we then discuss two ways in which the participants’ responses contribute to our computational approach. The first is a comparison of computationally-derived terms with human-generated terms. The second contribution will be to treat the salient features of the word-vectors corresponding to the most frequently reported human terms as an indication of the dimensions of a vastly reduced subspace of our distributional semantic model. We then discuss how terms that fall near the centroid of the positively valued surface of the discovered lexical spaces may be used for the evaluation of creativity. CC and Concept Discovery Computational approaches to creative conceptualization have provided a target that is both elusive and essential to the identity of a field that incorporates a particularly diverse range of topics. Creativity itself has been interpreted by Koestler (1964) as a kind of meshing of disparate conceptual schemes, by which expectations are violated in favor of interesting new combinations of frames of reference. Presciently, Koestler has couched his model of creativity in terms of vector spaces and transformations, an idea which is broadly shared by the model presented in the present paper. In the same spirit of conceptual exploration, Hesse (1963) argued that the formation of creative analogies is the essence of scientific discovery, an idea demonstrated by the primacy of analogical modeling in fields such as physics, where there is no realistic way to literally conceive of phenomena that occur on obscurely minuscule or vast scales. In the specifically computational domain, Veale (2006) has proposed a system for the dynamic generation of new, non-literal conceptual categories based on a computational analysis of a taxonomical database such as WordNet. Likewise, other researchers are developing formal models of conceptual blending (Fauconnier and Turner, 2008) that seek to discover novel combinations of familiar ideas, targeting domains such as mathematical reasoning and story generation (Ontanon, Zhu, and Plaza, 2012). These ap- ´ proaches make clever and effective use of heuristics to pick out interesting new conceptual representations based on preconceived patterns identified by programmers. As such, the output of these methods is compelling and valid, but the conceptualization itself is arguably handed to the system in the prepackaged form of externally grounded symbols. Elsewhere, Heath et al. (2013) have taken a more connectionist approach to conceptual creativity, combining human based word associations with statistical models of distributional semantics to design a system that infers conceptual categories from lists of terms, and likewise generates lists of terms from linguistic input that is interpreted conceptually. In a similar vein, Jager (2009) has performed a statistical ¨ analysis on a set of human reported color terms and used this analysis to generate a geometric representation of certain consistencies in the ways that color is conceptualized across cultural linguistic boundaries. In their commitment to building models based on low level, non-symbolic observations about the world, these statistical approaches to creative conceptualization are in the same spirit as the work presented in the present paper. The model described here has been designed to engage with the field of computational creativity on two different planes. Principally, our method seeks to implement a low level approach to the delineation of conceptual regions based on the geometry of a distributed semantic space. By viewing concepts as momentary and pragmatic phenomena, we are able to use ad hoc reductions of a high dimensional lexical space to map concepts creatively based on situational contexts which do not have to be preformulated in the design of the model. Furthermore, our target domains of musical and poetic creativity play nicely into a salient issue in the field of computational creativity: the analysis of creativity itself, a difficult procedure that necessarily involves some degree of conceptualization about creativity. This secondary aspect of the work, the potential for meta-analysis inherent in the question of whether our model’s output will be useful for guiding an evaluative discussion of creative work elsewhere, is intended to give the work its own pragmatic grounding, in that this suggests a practical application for the creative output described in the following pages. Spaces of Meanings This project uses computational methods as a platform for exploring the relationships between words and concepts within the context of a cognitive system. In the pragmatic spirit of Wittgenstein (1953) and Grice (1969), language is presented as a system defined by its own functionality, with meaning emerging from the use of words in the course of accomplishing communicative goals. To the extent that language is used to communicate ideas, statements are formed contextually, with reference to expectations about how relationships between words will suggest hierarchies of categorization relative to a particular situation. Barsalou (1993) characterizes the relationship between words and concepts in terms of the linguistic vagary inherent in the application of names to ideas: words represent concepts in a way that is fleeting and mutable. Fundamentally, words stand as indices to concepts, and the relationship between language and ideas is best understood as a mapping between two separate domains. The project presented in this paper is therefore motivated by a desire to model the relationship between two Proceedings of the Sixth International Conference on Computational Creativity June 2015 119 different spaces, one of words and one of concepts, and to explore the ways in which these spaces might be aligned in terms of the computationally tractable elements of their geometries. Gardenfors (2000) has presented a spatial theory of con- ¨ cepts, by which the dimensions that determine the geometric situation of a conceptual region within a space of concepts correspond to the attributes which characterize that particular region. So, for instance, the concept RIPE BANANAS would occupy a region towards the higher end of the dimensions of curviness, yellowness, and sweetness within a conceptual space. This literal and factual quality of dimensions grounds conceptual spaces in low level observations about the world, giving regions within the space a geometric dynamism that lends itself to doing higher level work with the entities that emerge from the space as symbolic representations. In particular, well defined conceptual regions are characterized by convexity, a property that ensures that any intermediate point between two outlying extensions of a region will likewise belong to that domain. Vector space models of distributional semantics, on the other hand, offer an approach to language modeling involving a distinctly unstructured computational analysis of linguistic data. In the tradition of Harris (1957), the distributional hypothesis holds that there is semantic information inherent in the statistical comportment of language: linguistic meaning can be found in the quantifiable contextual relationships between words. This insight has motivated a productive field of research, with computational analyses of large scale corpora yielding distributional semantic models in which the meanings of words, sentences, and documents are rendered in terms of mathematically tractable representations (Schutze, 1992; Landauer et al., 1997). Distributional ¨ semantic models treat words as vectors, with the dimensions of these vectors representing, either directly or abstractly, the likelihood of a word occurring in a given context. The closeness of vectors in a lexical space, which reflects the tendency of the proximal vectors to occur in similar contexts, has been shown as an indication of lexical similarity between the words tied to the vectors. In their most straightforward implementation, distributed semantic spaces are constructed by counting the frequency with which each word in the model co-occurs with all other terms in a base corpus (see Turney and Patel, 2010; Clark, 2015, for an overview). There is an important difference between lexical spaces and conceptual spaces: the dimensionally regimented quality of coherent domains within a conceptual space is not reflected in the distribution of vectors in a lexical space, where the dimensions of word-vectors correspond simply to the context in which those words are likely to occur, and therefore capture all the flexibility and uncertainty of language in use—the linguistic vagary of Barsalou’s system of conceptual symbols. So, for instance, the other vectors in the proximity of the word-vector !!pet in a distributed semantic model cannot be expected to contain only terms corresponding to domesticated animals, not least because the word “pet” itself has other uses. In this sense, where conceptual spaces are marked by a tidy taxonomy facilitated by the clarity of a region’s dimensional substrate, distributed semantic spaces embody the pragmatic messiness of language as it is encountered in its natural, operational environment. Therefore, while lexical spaces and conceptual spaces both utilize geometry as a vehicle for semantics, the arrangement of a lexical space is in an essential way less ordered. An example of the difficulty of delineating conceptual regions within a lexical space is illustrated in Figure 1a. In the rudimentary distribution of words presented here, concepts are required to stretch and overlap in order to maintain their lexical constituencies. This simplified depiction of the potential uncertainty of conceptual membership doesn’t demonstrate the even more fundamental problem of picking out salient words in regions that are littered with noise: in practice, in the densely and unevenly populated territory mapped out by a vector space model, many unwanted terms will be discovered in the region generally between two other terms. For instance, in the unrefined version of our model, there are 14 terms essentially between the word-vectors !!cat and !!dog, including such unlikely candidates as “during”, “eventually”, and “featuring”. There is thus an inherent patchiness to the mapping of concepts that might be read in an unrefined vector space model of distributional semantics. Here we propose a system for mapping lexical spaces to conceptual spaces by considering a conceptualization as a particular and temporary perspective on a space of distributed semantics. The idea behind this system is that, for any desired clustering of words corresponding to a particular conceptualization, there is some subset of a distributional space’s dimensions that will render a subspace in which that clustering is realized. This intuition is illustrated in Figure 1b, where the conceptually entangled space of Figure 1a collapses into a particular conceptual regime depending on the axis along which the space is projected, which is to say, the perspective from which the space is considered. The task of our system is therefore to determine the dimensions which should be picked out of a higher order vector space model in order to realize a grouping of terms that is conceptually homogeneous by virtue of the contextualization imposed by a particular perspective on the space. It is precisely the massive dimensionality of the space which facilitates the method’s ability to pick out various successful conceptual perspectives on the space in a momentary and continuous way. With each additional contextual dimension introduced to a vector space, there is an exponential increase in the lower-dimensional combinations available to map corresponding spatial relationships of words to conceptual subspaces. Moving from the linguistic realm native to vector spaces back to the cognitive domain targeted by Gardenfors, these dimensional perspectives might be con- ¨ strued as corresponding to a contextualized perception of a situation. In this respect, our system models the haphazard quality of conceptualization described by Barsalou, as well the ad hoc nature of concept formation discussed more recently by Allott and Textor (2012), who suggest that meaning is appropriated in situ to endow statements with contextually relevant implicature. This phenomenon of conceptualization arising from pragmatic communicative affordances is what our method seeks to computationally model. Proceedings of the Sixth International Conference on Computational Creativity June 2015 120 Figure 1: Conceptual Perspectives of Vector Spaces wolf dog cat lion canine pet feline predator (a) In this simplified and unrefined distributional semantic space, the conceptual regions suggested by the spatial arrangement of terms are indeterminate. Each word is roughly equidistant from two other terms, either of which could be linked in a distinct linguistic depiction of a concept. The conceptual domains which are delineated by this arrangement of words are awkwardly elongated. wolf x x dog x cat x x x lion x x predator o predator pet o canine o feline o species niche (b) If the same space illustrated above is considered from two different perspectives, the indeterminate arrangements of words collapse into lower dimensional spaces (one dimensional, in this simple example) in which the clustering of terms suggests straightforward conceptual domains. These perspectives effectively contextualize the meanings inherent in the distributional characteristics of the language model, and this context facilitates the mapping of the linguistic space to sets of conceptual regions. A Literal Lexical Space Our lexical model has been constructed based on the distribution of words found in the textual component of articles on the English language Wikipedia website.1 The xml code of the site was downloaded and then parsed into a text-only for- 1 The December 8, 2014 dump, downloaded from http://meta.wikimedia.org/wiki/Data dump torrents on January 23, 2015, parsed into plain text using the “Wikipedia mat, eliminating images, tables, lists, captions, and section titles, leaving only the well formed sentences composing the content of the site’s articles. Sentences were separated by identifying terminal punctuation followed by whitespace, then punctuation was removed and all characters were converted to lower case. Articles (“a”, “an”, and “the”) were stripped from the text. Sentences containing less than five words were discarded. The resulting corpus consists of almost 60 million sentences, containing about 1.1 billion word tokens (individual words) corresponding to about 7.4 million word types (classes of words). From this base corpus, we took the 200,000 most frequent word types to form our system’s vocabulary. Our full lexical space is represented as a matrix Mw,c, where rows correspond to vectors representing words, and columns correspond to co-occurrence terms. The cell for word w and cooccurrence term c contains the mutual information MIw,c as described in Equation 1. Here nw,c is the frequency with which a term c is observed to co-occur within a context window of two words on either side of a vocabulary word w; nw represents the total count of w in the corpus; nc is the total count of c; and a is a smoothing constant. MIw,c = log2 ✓ nw,c ⇥ N nw ⇥ (nc + a) + 1◆ (1) The constant a reduces the undesirable effect of contextual words that occur very rarely throughout the corpus, but with a high frequency in the context of certain target words—we found 10,000 to be a good value for a based on trial and error. The value 1 is added to the probabilistic ratio in order to render all dimensions within the space positive: this means that the logged MI value of target words and context words that never occur together will be 0, and the value for terms that co-occur less frequently than would be expected in a random distribution will be between 0 and 1. Each word vector consists of a set of dimensions derived through this calculation, and each of these vectors is normalised to the scale of a unit vector. The result of this process is a distributional semantic space in which each of the 200,000 vocabulary terms sits in the positive region of the high-dimensional surface of a hypersphere. One notable feature of our vector space is the literal correspondence of its dimensions to co-occurrence terms. In general, state-of-the-art systems apply some form of dimensional reduction to the overall space, either using linear algebraic transformations to perform a principal component analysis (Pennington, Socher, and Manning, 2014), or by weighted networks to train abstract lower-dimensional word representations that predict the context in which that word is encountered in the course of training (Mikolov et al., 2013). While these techniques certainly make the space less expensive to compute, and arguably improve results for a variety of semantic tasks, our system is specifically geared towards the identification of salient, literal co-occurrence dimensions, and as such the space is, for the purposes of our initial analysis, maintained in its raw high-dimensional form. The dimensionality of our space is therefore on the Extractor” software, downloaded on February 13, 2015 from http://medialab.di.unipi.it/wiki/Wikipedia Extractor. Proceedings of the Sixth International Conference on Computational Creativity June 2015 121 order of 7.4 million, as every word token in the corpus is a potential context for the 200,000 words in our vocabulary. Empirical Validation In order to provide grounding and validation for our computational model, human participants were asked to generate terms during a word association task, and to reflect upon how they would evaluate creativity in the musical and poetic domains. Provided terms were analyzed for comparison with the vector space model. Method Participants Twenty participants (avg age = 30 yrs, stdev = 5.2 yrs) volunteered to take part in the study, of which 11 were female. Sixteen individuals indicated that their career is either inherently creative or that they apply creativity to improve their job performance, and all but two of the participants engage in creative pursuits outside of work. Seven currently practice or perform music, and five individuals currently engage in creative writing. Procedure After reading an information form and providing informed consent, participants were given a brief questionnaire to complete. Two of the questions consisted of a word association task in which participants were asked to list three terms they associate with “musical creativity” or “poetic creativity”. The order in which musical or poetic terms were prompted was counterbalanced across subjects. The other two questions requested participants to write one sentence describing how they would evaluate whether a new piece of music or poetry sounds creative (the order of these questions were similarly counterbalanced across subjects). These results will only briefly be touched upon in the current paper; although certainly of interest, due to space constraints, an in-depth analysis of the evaluation sentences must be saved for an expanded version of this work. After providing their responses, participants were given questionnaires requesting general demographic information (age, ethnicity, etc), and information about their past and current involvement in creative pursuits, e.g., “Do you currently play music or engage in creative writing?” Upon completing these, participants were debriefed as to the goals of the experiment and paid £2 each for their participation. Results For both musical and poetic creativity, participants’ terms were placed into two lists: An exhaustive list of all terms provided for the concept, and a short list for terms cited by more than two participants (per concept). In the case of musical creativity, this yielded an exhaustive list of 52 distinct terms, and for poetic creativity, a set of 42 terms. The short list of musical terms included the following six terms: innovation, sound, instruments, novelty, emotion, and expression. The short list of poetic terms included these six terms: emotion, rhythm, expression, structure, flow, and words. We interpreted these concise lists of most frequent words to re- flect dimensions of the concept that are more central to the conceptual space they populate. This resulted in discarding more peripheral terms such as ”sensitive” that are undoubtedly related to creativity, but not cited frequently as an associated concept. Plurals and conjugations were considered to be the same category of term, e.g., ”emotions”, ”emotional”, and ”emotion” were tallied together as ”emotion”. We continue the discussion of empirical findings in the next section, as we compare the model’s performance with human results. Mapping Words to Concepts We began the exploration of mappings from our space of distributed semantics to a conceptual space with a top down approach, investigating the way our system reacted to the same kind of input that we presented to our human subjects. Along these lines, we examined the vectors for the word pairing “musical” and “creativity”, and likewise for the pairing “poetical” and “creativity”. In each instance, we calculated the mean value for each dimension that had a nonzero value for both words – that is, for each dimension corresponding to a term that co-occurred with both words at least once in the corpus – and returned a ranked list of average scores, running from high to low. Out of the 7.5 million co-occurrence features across the entire model, 4,772 were non-zero for both “musical” and “creativity”, and 2,673 for both “poetic” and “creativity”, statistics which highlight the sparsity of the base space. Our objective was to examine the nature of the terms that tended to come up in the context of our query as phrased for our human subjects. Results are listed in Table 1. The first thing to note about these results is that they are, in a qualitative sense, coherent descriptions of properties typically associated with the two concepts being explored. To frame this more empirically, these results can be extended in order to discover how far down the list of top mean dimensions the terms reported by humans lie. Of the exhaustive list of terms reported by human subjects in response to the “musical creativity” query, 4 fall within the top 15 results generated by our model; likewise, 4 human responses fall within the top 15 mean dimensions for “poetic creativity” (these terms are italicised in Table 1). Considering that 200,000 words were used as the vocabulary of the model, yielding 4 of the top 15 dimensions in common with humans’ responses for both concepts is quite compelling. This outcome may be interpreted as indicating that there is a high degree of mutual information between the query words and terms that humans would consider as conceptually descriptive of those queries. In other words, there is a high likelihood of conceptually relevant co-occurrence within the context of terms that summarize these creative conceptual domains. These positive results do not hold up, however, for more concrete queries. For instance, when the mean dimensions for the query pair “wild” and “animal” are explored, top ranking results include some conceptually appropriate terms such as “boars”, “deer”, and “feral”, but less directly relevant words like “skins” and “vegetable”, and even antonymic terms like “domesticated” are also returned. It would seem that, in the case of words indexing more concrete concepts, the likelihood of co-occurrence in the conceptual context moves away from terms that generically describe components of the concept in question. This distinction is corroborated by Hill, Korhonen, and Bentz (2014), Proceedings of the Sixth International Conference on Computational Creativity June 2015 122 “musical” & “creativity” “poetic” & “creativity” innovation genius imagination imaginative inventiveness metaphors improvisation originality talent prose talents creativity experimentation artistry versatility craftsmanship artistic intuition creativity imagery ingenuity inspiration aesthetics talents spontaneity lyrical individuality talent artistry self-expression Table 1: The top 15 dimensions with the highest mean scores between the word-vectors for each of our queries as given to human subjects. Terms in italics denote dimensions that were also cited by humans. who have used computational analyses of both corpora and semantic graphs to illustrate a distinction between the way that abstract and concrete concepts are arranged in a cognitive linguistic system. It is hardly surprising, given the inherent ambiguity of language use – replete, as it is, with metaphor and implication – that simple co-occurrence probability statistics do not generally map neatly on to well de- fined conceptual spaces. Projecting Words to Conceptual Subspaces Motivated by this predictable shortcoming of a simple dimensional analysis, we developed a more sophisticated approach for delineating conceptual regions within dimensionally reduced subspaces of our language model. Our technique involves first hand-picking a small set of terms that might be considered as paradigmatic descriptions of components of a conceptual domain (in the present example, WILD ANIMALS). We perform an analysis similar to the one described above on these conceptual component terms, selecting the word-vector for each term and then extracting those features with non-zero values for all input terms. Once again, we compute a ranked list of these mean feature values and choose the co-occurrence dimensions which scored highest on average. These salient dimensions for the small set of words analyzed are again somewhat scattered: some of the highest mean dimensions correspond to relevant animal names, but the results also stray into the more conceptually ambiguous territory signified by words like “sightings”, “chases”, and “fat”. There are 827 universally non-zero dimensions found between the word-vectors of the six input terms describing exemplars of WILD ANIMALS listed in the first column of Table 2. We use these salient dimensions to define a drastically simplified subspace of our lexical model. Specifically, we reduce the model to the top 30 dimensions associated with the set of sample words (we arrived at the number 30 through trial and error; lower values tended to invite some unusual vectors into the crucial region of the subspace). After normalizing the new subspace, we then identify the central point on the surface of the positively valued quadrant of the reduced hypersphere—effectively the vector defined by 30 dimensions each with the the value 1/ p30. This positive centroid is then taken as the epicenter of a linguistic mapping of a new conceptual region, and we expand the region outward concentrically from this point, returning an ordered list of the points closest to the center of the positive surface of our space’s low dimensional projection. Euclidean proximity is calculated by computing the square root of the sum of the squared feature-wise differences between the unit centroid and each of the 200,000 vocabulary words projected into the subspace. The top fifteen terms encountered using this method are reported in Table 2. Please note that the input terms for WILD ANIMALS are used as a preliminary test of the model’s performance for concrete concepts; the input was hand-selected by the investigators, while the collection of terms relating to concrete concepts is the subject of ongoing emprical study. This same technique for expanding conceptual regions through a dimensional reduction of a distributional language model is applied to our target domains of musical and poetic creativity, again with compelling outcomes. In this case we were able to make use of our results from our survey: for each of our two target domains, we choose all the terms that were reported by three or more human subjects and analyze these for their most salient dimensions of co-occurrence. Again, the system’s output for these terms is not entirely unexpected, but also not conceptually completely cohesive. In the case of the highest mean dimensions for the human reported constituents of MUSICAL CREATIVITY, a number of predictable terms are returned, but somewhat less obvious dimensions such as “lab”, “mere”, and “shapes” also rank towards the top of the list. Despite the conceptual uncertainty in the dimensional analysis, when a new subspace is constructed based on these dimensions, the central region of this space is replete with terminology appropriate to the example words at the base of the process. Interestingly, the original input words are only partially rediscovered in this new space, at least within the set of vectors most central to the positive surface of the new subspace. This indicates that some of the input word-vectors (used to select dimensions for creating the new subspace) are, in terms of the probability of regular co-occurrence with all the dimensions that underwrite the subspace, relative outliers which nonetheless make an essential contribution to the delineation of this linguistic representation of a conceptual region. It is also notable, and perhaps even remarkable, that in the case of the mapping of the conceptual region of poetic creativity, quintessential new terms such as ”phrasing” and ”inflection”, arguably more intricately associated with the prosodic nature of the target domain than the original human generated terms, arise independently. When examining how well the model captures relevant terms to a conceptual query, it is informative to cluster human responses into semantic categories (such as emotion and structural elements), and compare these results to apparent categories of output vectors. For example, considering the sentences that the 20 participants wrote about how Proceedings of the Sixth International Conference on Computational Creativity June 2015 123 WILD ANIMALS MUSICAL CREATIVITY POETIC CREATIVITY human input model output human input model output human input model output lion bobcat innovation novelty emotion phrasing wolf alligator sound liveliness rhythm intonation coyote raccoon instruments spontaneity expression musicality alligator opossum novelty innovation structure nuances bear armadillo emotion expressiveness flow timbre snake white-tailed expression refinement words sprightly anteater nuance rhythmical ocelot ingenuity nuance peccary believability expressiveness pronghorn newness rubato cougar sophistication instinctive cottontail dynamism bluesy rattlesnake subtlety directness skunk vibrancy modal boar elusiveness inflections Table 2: The output vectors most central to the positive regions of the subspaces reduced in terms of the salient dimensions of a small set of conceptually exemplary input terms. they would evaluate the creativity of a new song, many individuals referred to the notion of novelty in musical creativity, but used various terms to do so. In addition to explicitly using the term “novelty”, participants made reference to “unexpected”, “new”, and “surprising elements” that were “like nothing else I’d heard before”, as well as “melodic originality” and cases in which “known musical concepts or styles [are] combined in a novel/innovative way.” Similarly, when considering the model’s conceptual space of musical creativity, the words “novelty”, “innovation”, “ingenuity”, “newness”, “inventiveness”, “distinctiveness”, and “uniqueness” are found within the top 30 model output vectors. Although this is a qualitative assessment of the results, it does seem clear that the precise terms from humans and the model might not be exactly the same, but there is significant categorical or conceptual overlap between the two. One may also note that the output vectors for poetic creativity appear to be rather “musical”. This may reflect the fact that the input dimensions were provided by people who, overall, have significant musical experience - all but three of the participants have had musical training or have played music informally, whereas only four of the participants have experience with creative writing. People’s experience with music might frame the way they think about other creative domains, or at the very least influencing the terms used to describe poetic creativity; consequently, this has led to a subspace that highlights the musical nature of this sample. In light of our model’s ability to find conceptually proximal terms, we propose that this method has the potential to be practically applied to the discovery of unexpected and valuable terms for the evaluation of creative output. Importantly, this approach may be applied to different corpora; for example, Wikipedia pages in different languages may be explored to address the difficult issue of identifying conceptually similar spaces across languages. The model’s conceptual spaces concretely delineate evaluative terms that one person alone may not consider. For example, the terms “distinctiveness”, “finesse”, “artistry”, and “stylization” were not cited by humans, but were within the top 30 output vectors discovered by the model. Future work may build on these findings, by using the model’s discovered terms as criteria for subjective evaluation of creative output. In addition, discovering the geometry, flexibility, and contextual speci- ficity of conceptual spaces may be very useful for assessing products or systems based on specific underlying concepts (or developed to address particular conceptual issues). More generally, our method is presented as an implementation of the mapping of words to concepts: this approach charts a passage from a statistically tractable lexical space to the abstract but natively cognitive domain of ideas. The temporary and contextual aspect of this mapping is essential to its success: it is the flexibility of the model that allows for the bespoke generation of subspaces, just as it is the pragmatic frangibility of language that permits the ready-tohand adaptation of meaning for unfolding expressive purposes. As can be seen in our results, the same terms arise in different constellations of meaning depending on the contextual perspective taken on the space. It is the strength of our language model that it can be adapted in this way, with the high dimensional arrangement of words allowing for their projection as multitudinous conceptual representations. Conclusion We investigated the terms and concepts that individuals most strongly associate with creativity in the musical and poetic domains, and described a computational methodology for modeling these conceptual relationships. Our multidisciplinary approach employs methods inspired by computational linguistics, as well as methods from empirical psychology. There were several outcomes of this work: the output from our distributional semantics vector space model was compared with human responses on a word association task. Human-generated terms were found within the top 15 dimensions of our model’s lexical space, despite the model’s very large vocabulary. This served as validation that the Proceedings of the Sixth International Conference on Computational Creativity June 2015 124 model discovers a lexical space that encapsulates the kind of terms humans use to describe these concepts. Subsequently, the most frequently reported human terms were used as model input parameters for discovering conceptual spaces of lower dimensionality. Our model was able to find vastly reduced subspaces corresponding to MUSICAL CREATIVITY and POETIC CREATIVITY, which again captured semantically relevant terms, many corresponding directly to participants’ terms, and others extending the list of terms to insightful new dimensions. In addition, by sampling word-vectors that fall near the centroid of the discovered conceptual mappings, we aimed to find potentially useful terms for the evaluation of creativity. Although computational and AI methods have generated many systems which aim to display creative behavior or produce creative artefacts, the evaluation of computational creativity remains distinctly problematic. Therefore, we offer our method and results as a formal approach to delineating conceptuallyrelevant criteria on which to base the evaluation of creativity and creative artefacts in future studies. We saw, in terms of the most common dimensions in lexical space and the highest-mean word vectors in conceptual space, that the model is able to discover semantic categories and indices of concepts that are alligned to human conceptualizations. This said, the model did not capture all of the semantic categories cited by humans. The most noteworthy omission is in regards to emotion, as terms relating to affect and evoked emotional response were some of the most frequently cited terms for both musical and poetic creativity. Accordingly, future work will investigate why the model does not capture this cluster of emotion-related terms. Further directions for the future include the application of this computational approach to other domains, such as “culinary creativity”, both for the ontologically useful task of elaborating concepts themselves, and to create well-tailored terminology for the assessment of creative output from the corresponding domains. This methodology may also be used to approach the task of conceptual blending: rather than specifying input vectors that belong to only one concept, one may supply input dimensions from several. This could result in output terms discovered at the intersection of the lexical regions specified by the vectors’ different input dimensions. Acknowledgments The projects Lrn2Cre8 and ConCreTe acknowledge the financial support of the Future and Emerging Technologies (FET) programme within the Seventh Framework Programme for Research of the European Commission, under FET grants number 610859 and 611733, respectively. This research is also supported by EPSRC grant EP/L50483X/1. References Allott, N., and Textor, M. 2012. Lexical pragmatic adjustment and the nature of ad hoc concepts. International Review of Pragmatics 4(2). Barsalou, L. W. 1993. Flexibility, structure, and linguistic vagary in concepts: Manifestations of compositional system of perceptual symbols. In Collins, A. F.; Gathercole, S. E.; Conway, M. A.; and Morris, P. E., eds., Theories of Memory. Hove: Lawrence Erlbaum Associates. 29–101. Clark, S. 2015. Vector space models of lexical meaning. In Lappin, S., and Fox, C., eds., The Handbook of Contemporary Semantic Theory. Wiley-Blackwell. Fauconnier, G., and Turner, M. 2008. The way we think: Conceptual blending and the mind’s hidden complexities. Basic Books. Gardenfors, P. 2000. ¨ Conceptual Space: The Geometry of Thought. Cambridge, MA: The MIT Press. Grice, H. P. 1969. Utterer’s meaning and intention. The Philosophical Review 78(2):147–177. Harris, Z. 1957. Co-occurrence and transformation in linguistic structure. Language 33(3):283–340. Heath, D.; Norton, D.; Ringger, E.; and Ventura, D. 2013. Semantic models as a combination of free association norms and corpus-based approaches. In IEEE International Conference on Semantic Computing, 48–55. Hesse, M. B. 1963. Models and Analogies in Science. New York: Sheed and Ward. Hill, F.; Korhonen, A.; and Bentz, C. 2014. A quantitative empirical analysis of the abstract/concrete distinction. Cognitive Science 38:162–177. Jager, G. 2009. Natural color categories are convex sets. In ¨ Logic, Language and Meaning - 17th Amsterdam Colloquium. Koestler, A. 1964. The Act of Creation. : Hutchinson. Landauer, T.; Laham, D.; Rehder, B.; and Schreiner, M. E. 1997. How well can passage meaning be derived without using word order? a comparison of latent semantic analysis and humans. In Proceedings of the 19th Annual Conference of the Cognitive Science Society, 412–417. Mikolov, T.; Chen, K.; Corrado, G.; and Dean, J. 2013. Ef- ficient estimation of word representations in vector space. In Proceedings of ICLR Workshop. Ontanon, S.; Zhu, J.; and Plaza, E. 2012. Case-based story ´ generation through story amalgamation. In Proceedings of the ICCBR 2012 Workshops, 223–232. Citeseer. Pennington, J.; Socher, R.; and Manning, C. D. 2014. Glove: Global vectors for word representation. In Conference on Empirical Methods in Natural Language Processing. Schutze, H. 1992. Dimensions of meaning. In ¨ Proc. ACM/IEEE Conference, 787–796. Turney, P. D., and Patel, P. 2010. From frequency to meaning: Vector space models of semantics. Journal of Artifi- cial Intelligence Research (37):141–188. Veale, T. 2006. An analogy-oriented type hierarchy for linguistic creativity. Knowledge-Based Systems. Wittgenstein, L. 1953. Philosophical Investigations. Oxford: Basil Blackwell. 3rd ed. Trans. G. E. M. Anscombe. Proceedings of the Sixth International Conference on Computational Creativity June 2015 125