A Creative Analogy Machine: Results and Challenges Diarmuid P. O’Donoghue Department of Computer Science, National University of Ireland Maynooth, Co. Kildare, Ireland. diarmuid.odonoghue@nuim.ie Mark T. Keane Department of Computer Science and Informatics, University College Dublin, Ireland. mark.keane@ucd.ie Abstract Are we any closer to creating an autonomous model of analogical reasoning that can generate new and creative analogical comparisons? A three-phase model of analogical reasoning is presented that encompasses the phases of retrieval, mapping and inference validation. The model of the retrieval phase maximizes its creativity by focusing on domain topology, combating the semantic locality suffered by other models. The mapping model builds on a standard model of the mapping phase, again making use of domain topology. A novel validation model helps ensure the quality of the inferences that are accepted by the model. We evaluated the ability of our tri-phase model to re-discover several hcreative analogies (Boden, 1992) from a background memory containing many potential source domains. The model successfully re-discovered all creative comparisons, even when given problem descriptions that more accurately reflect the original problem – rather than the standard (post hoc) representation of the analogy. Finally, some remaining challenges for a truly autonomous creative analogy machine are assessed. Introduction Analogy has a long and illustrious history within creativity, particularly within scientific and intellectual contexts (Brown, 2003). Many episodes of scientific creativity are driven by analogical comparisons (Dunbar and Blanchette, 2001), often involving image related analogies (Clement, 2008). Much progress has been made in cognitive science on modeling this analogical reasoning process (see below), prompting the following questions. Are we any closer to creating an autonomous model of the analogical reasoning that can generate new creative analogies? What progress has been made towards such a creative analogy model? What are the main challenges that lie ahead? In this paper we envisage a creative process that can take any given target description and using a pre-stored collection of domain descriptions, identify potentially creative source domains with which to re-interpret the given problem. This paper explores and evaluates the potential for a model of analogy to act as a creativity engine. While Boden (1992) argues that analogy is effectively the lowest form of creativity (improbable), we argue that analogical creativity should be seen a part of a cohesive human reasoning system. If the inferences mandated by an analogy contradicts a fundamental belief, especially one that has accrued many consequent implications, then resolving this contradiction might well involve the “shock and amazement” of transformational creativity. As such, it appears that analogies may drive creativity at any of Boden’s levels of creativity. Our creativity model is domain independent and does not include a pragmatic component or domain context. So, as our model does not use domain-specific knowledge, arguably it cannot be easily cast as improbable, exploratory or transformational creativity (Boden, 1992). The current work was driven by three main aims. Firstly, we wished to assess the creative potential of a three-phase model of analogy. Secondly, we wished to assess the impact of using differing knowledge bases upon the creative potential of our analogy model. Finally, we wished to assess the wider implications of analogical models for computational creativity. Is a three-phase model either necessary or sufficient to function as an engine of creativity? Can such a model re-discover analogies considered to be creative by people? Since people often overlook analogies (Gick and Holyoak, 1980) even when they are present, will such a model uncover many creative analogies or are creative analogies, in some way, different and rare? We see the current model as being potentially useful in three distinct ways, but for now we do not commit to using it in one particular manner. Firstly, it could be used as a simple model of creativity, yielding creative interpretations for a presented problem. Secondly, it could be used as a tool to assist human creativity; suggesting source domains to people, to enable them to re-interpret a given target problem. Finally, it could be used as one possible model of how people analogize in a creative means. The paper is structured as follows: first we describe the Kilaza1 model for generating creative analogies, briefly illustrating its operation on the famous atom:solar-system 1 Kilaza is not an acronym. International Conference on Computational Creativity 2012 17 analogy. Then we present results that reflect the model’s ability to re-generate some well-known h-creative analogies (Boden, 1992). Finally, the implications of these results are assessed and some remaining challenges are discussed. Analogy as an Engine of Creativity An analogy is a conceptual comparison between two collections of concepts, a source and target (Gentner, 1983), such that the source highlights particular aspects of the target, possibly suggesting some new inferences about it. In creative analogies, an productive source domain conjures up a new and revolutionary interpretations of the target domain, triggering novel inferences that help explain some previously incongruous phenomena or that help integrate some seemingly unrelated phenomena (Boden, 1992; Eysenck and Keane, 1995). Creative analogies differ from “ordinary” analogies primarily in the conceptual “distance” between the source and target domains (i.e., these two domains may never have been linked before) and the usefulness of the resulting comparison. Both creative and mundane analogies appear to use the same analogical reasoning process, as described in the following section, but different in their inputs and outputs. Kekulé’s is famous for his analogy between the carbonchain and a snake biting its own tail. But this analogy could have been triggered by many alternative and more mundane source domains – from tying his own shoe-lace to buckling his belt. While many source domains could have generated the creative carbon-ring structure, Gick and Holyoak (1980) have shown most people (including Kekulé) frequently fail to notice many potential analogies. This highlights one potential advantage of a computational model, in that a model can tirelessly explore all potential analogies, returning only the most promising comparisons to a user for more detailed consideration. Thus, computational models could potentially act a tools helping people overcome one barrier; namely, their failure to perceive analogies when they are present. Kilaza Analogical Creativity Engine Keane (1994) presented a five-phase model of the analogical reasoning process, which recognises the distinct phases of representation, retrieval, mapping, validation and induction. While other authors describe slightly different subdivisions of this process, there is broad agreement on these phases. Our computational model encompasses the three central phases of analogy (see Figure 1). We highlight that Walls & Hadamard subdivide creativity into the phases of preparation, incubation, illumination and verification (Boden, 1992), which is reminiscent of several multi-phase models of analogy. The heart of our creativity model is the central mapping phase and this borrows heavily from Keane and Brayshaw’s (1988) IAM model (see also Keane, Ledgeway & Duff, 1994). Our model of the retrieval phase attempts to overcome the semantic bias suffered by many previous models, improving the diversity of the source domains that are returned. It was intended that this diversity might address the quality of novelty (Ritchie, 2001) associated with creativity, retrieving more “unexpected” and potentially creative sources. Finally, our model of the validation phase attempts to filter out invalid inferences, addressing the quality (Ritchie, 2001) factor associated with computational creativity. Ritchie (2001) identifies the essential properties of creativity as being directed, novel and useful. We argue that our model is directed in that it focuses on re-interpreting some given target domain. Our model addresses the novelty property by its ability to retrieve potentially useful but semantically distant, even disconnected, source domains. Finally, the useful property is addressed through a validation process that imposes a quality measure on the inferences that are accepted by the model. Figure 1: Kilaza is a three-phase model of Analogy Analogical Retrieval Phase-Model Existing models for analogical retrieval suffer from the limitations in the range of possible retrievals because their they either (i) focus exclusively on domain semantics (like MAC/FAC; Forbus, Gentner and Law, 1995) or (ii) focus primarily on domain semantics (like HRR; Plate, 1998). Other models -- such as ARCS (Thagard et al, 1990) and Rebuilder (Gomes et al, 2006) - supplement domain representations by elaboration from external sources (like WordNet) to widen the net to include more semantically non-identical sources. However, all of these approaches arguably over-constraint retrieval for the the proposes of creativity. We argue that a creative retrieval process must allow semantically distant and even semantically disconnected sources to be retrieved, ideally without overwhelming the subsequent phase-models with irrelevant domains. Figure 2: Topology is a key characteristic in retrieving creative source domains Gentner (1983) mentions two specific qualities are required of analogical comparisons: semantic similarity and structural similarity. The model presented in this paper performs retrieval based exclusively on structural similarity, performing retrieval based exclusively on the graph structure (or topology) of each domain description. This design decision was taken to overcome the semantic narrowness that constrains existing models, with the hope that retrieval mapping validation loves loves tom jo liz loves (b) Unrequited Love loves loves loves tom jo liz (a) Love Triangle International Conference on Computational Creativity 2012 18 this would increase the possibility of retrieving surprising and creative source domains. As the example in Figure 2 illustrates, semantics and domain topology are often intertwined. Each domain description is mapped onto a location in an n-dimensional structure space (Figure 3), where each dimension represents a particular topological quality of that domain. Structure space is somewhat akin to feature vectors (Yanner and Goel, 2006; Davies, Goel and Yanner, 2008). Image related analogies are often involved in creative comparisons (Clement, 2008) and a variety of imagebased analogy models has been developed, focusing on specific topics such as; geometric proportional (IQ type) analogies (Evans, 1967; Bohan and O’Donoghue, 2000), geo-spatial comparisons (O’Donoghue et al, 2006), spatial representations of conceptual analogies (Davies et al, 2008; Yanner et al, 2008) and reasoning about sketch diagrams (Forbus et al, 2011). Our model performs a single retrieval process for each presented target, in contrast to the iterative retrieval and spreading activation phases employed by KDSA to retrieve semantically distant sources (Wolverton and Hayes-Roth, 1994). Specific topological features used by our retrieval model include quantifying the number of objects and predicates (first order and higher order) and number of root predicates etc. Thus, the representation in Figure 4 might be mapped onto the location (4 0 2 2 0 0 1) in structure space – 4 object references, 0 high-order predicates, 2 unique firstorder relations, 2 first-order relations and 2 root predicates etc. The distinction between unique and non-unique relations, for example, distinguishes between domains repeatedly using a small number of relations and domains that typically have one instance of each relation in its description. One advantage of this scheme is that the distance between domains is not impacted by the number of domains contained in memory so the retrieval system should scale reasonably well. For the retrieval results presented later in this paper a maximum retrieval distance of 10 is imposed – and only candidate source inside this threshold are considered. Figure 3: Displacing the Locus of Retrieval within a 3D representation of n-dimensional Structure Space. Only source domains within the displaced boundary are retrieved and passed to the remaining phases of analogy. Topologically similar (i.e., homomorphic as well as isomorphic) domains are mapped onto similar locations within this topology-based structure space (O’Donoghue and Crean, 2002). To account for the inferences that were sought from any inspiring source domain, the locus of retrieval was slightly offset to account for this additional source domain material. Included in this offset is the desire for sources containing additional first-order relations and high-order relations. However, this offset has relatively little impact on the final results. (heavier nucleus electron) (attracts nucleus electron) Figure 4: Simplified Model of Rutherford’s Problem Analogical Mapping Phase-Model The model for the mapping phase is based on the Incremental Analogy Machine (IAM) model (Keane & Bradshaw, 1988; Keane et al, 1994). It consists of the three subprocesses of root-selection, root-elaboration and inference generation. Mapping proceeds as a sequence of rootselection and root-elaboration activities, gradually building up a single inter-domain mapping. Typically a domain description will consist of a small number of root predicates, each controlling a large number of (partly overlapping) lower-order predicates. Root selection Root selection identifies “root predicates” within a representation, which are typically the controlling causal relations in that domain. Each root predicate lies at the root of a tree of predicates and each root is be seen as “controlling” the relations lower down the tree. In our implementation of IAM, the root-selection process examines the “order” of each predicate. Objects are defined as order zero and first-order relations that connect two objects are defined as order one. The order of a causal relation is defined as one plus the maximum order of its arguments. Mapping begins with the highest order relations and maps any unmapped low-order root-predicates last. Root elaboration Root elaboration extends each rootmapping, placing the corresponding arguments of these relations in alignment. If these arguments are themselves relations, then their arguments are mapped in turn and so on until object arguments are mapped. Items are only added to the inter-domain mapping when they conform to the 1-to-1 mapping constraint (Gentner, 1983). Inference Generation Each analogical comparison is passed to the inference generation sub-process. Analogical inferences are generated using the standard algorithm for pattern completion CWSG – Copy With Substitution and # root predicates # object references nucleus electron heavier attracts potential source domains International Conference on Computational Creativity 2012 19 Generation (Holyoak et al, 1994). In effect, additional information contained in the source domain is carried over to the target, creating a more cohesive understanding of that target problem. Analogical Validation Phase-Model The third part of our tri-phase model is focused on analogical validation. Validation attempts to ensure that the analogical inferences that are produced are correct and useful. O’Donoghue (2007) discusses the accuracy of this validation process, using human raters to assess the goodness of inferences that were rated as either valid or invalid. However, this paper did not assess the models ability to discover creative analogies. Phineas (Falkenhainer, 1990) is a multi-phase model of analogy that incorporates a post-mapping verification process. To achieve this Phineas incorporates a model of the target domain – qualitative physics simulation - illustrating the power of embedding an analogy model within a specific problem domain. However, this qualitativesimulation process effectively limits Phineas to reasoning only about physical and physics-related analogies. The validation model presented in this paper is relatively simple, aimed at rejecting those predicates that are deemed invalid – rather than guaranteeing the validity of those inferences that are accepted. This approach helped maximise the creative potential of this model, by resisting the rejection of potentially plausible inferences. Of course, a more complex validation process could make use of problemspecific domain knowledge (where available). In the absence of such domain-specific knowledge verification and validation of the analogy could be carried using user feedback, employing Kilaza in a tool-like way. The validation phase-model is composed to two main parts. The first performs validation by comparing the newly generated inference to predicates already stored somewhere in memory. The second mode of validation is more general and driven in part by the functionally relevant attributes that play a role in analogical inference (Keane, 1985). Validation by Predicate Comparison The validation process compares newly inferred predicates (produced by CWSG) to the previous contents of memory. Inferences are firstly compared to predicates in memory, with both the agent and patient roles potentially being validated independently. This validation mechanism thus has access to the entire contents of memory, accessing predicates from any of the domains stored in that memory. This model of validation captures the advantages of simplicity and generality, but it does of course mean that dependencies between arguments are not captured. This limitation was deemed acceptable within the context of our desire for a creativity engine. While many simple inferences were validated by this mechanism, many creative inferences were not. This may be partly attributed to the relatively small number of predicates contained in memory and to the novelty associated with creative inferences. To address this shortcoming validation using functional attributes was introduced. Validation with Functional Attributes Functional attributes specify necessary attribute requirements for each role of a predicate – being inspired by the functionally relevant attributes of Keane (1985). Functional attributes are intrapredicate constraints that ensure each predicate appears to be a plausible combination of a relation coupled with each of its arguments. It should be pointed out that functional attributes have only been used with first-order predicates – those whose arguments are objects. Although validating higher-order (causal) relations might make use of the spatio-temporal contiguity associated with causality, but this cannot be relied upon (Pazzani, 1991) and is not enforced by our model. Thus our model treats all causal inferences as implicitly valid. Functional attribute definitions connect each role of a predicate directly into an attribute hierarchy, whereby arguments filling those roles must conform to these attribute constraints. Kilaza stores functional attributes for both the agent and patient arguments of each relation independently. More general relations (part-of, next-to) typically have few functional attributes, whereas more specific relations (hit, eat) possess a greater number of attribute restrictions. For example the agent role of hit might require the hitter to be a physical object, whereas the agent of an eat relation might have to be a living organism or an animal. Relations that are more specific are seen to be more amenable to the validation process, while their more general counterparts are more difficult to validate accurately. In addition, functional attributes have also been used to support a form of inference adaptation. This allows an inferred relation to be adapted to a semantically similar relation that better suits the arguments that pre-existed within in the target domain. Adaptation uses the functional attributes to conduct a local search of the taxonomy, to identify a more semantically suitable relation that better fits the given arguments. Data Sets Three datasets were used to conduct experiments using the described model. These are referred to as the Professions dataset, the Assorted dataset and an Alphanumeric dataset. The dataset contained a total of 158 domains and our creativity engine attempted to find creative source analogues for a given number of target problems. It was hoped that the differing natures of these collections would provide a reasonable grounds on which to evaluate the computational model – and to assess its potential to act as a creativity engine. Professions Dataset consists of descriptions of fourteen professions, including accountant, butcher, priest and scientist. These are rather large domain descriptions created by Veale (1995) and range in size from 10 to 105 predicates (M=55.4, SD=29.3). One important feature of the Professions dataset is its reliance on many different instances of a small number of relational predicates, including control, affect, depend, and part. The domains range from using just 6 distinct relational predicates (ignoring International Conference on Computational Creativity 2012 20 duplicates) to the most diverse domain that uses 15 (M=8.9, SD=2.2). Another important feature is that this dataset does not appear to use a set of clearly identifiable high-order relations (such as a cause, result-in or inhibit) between first-order predicates. Assorted Dataset consists of a large number of smaller and more varied domain descriptions, including many of the frequently referenced domains in the analogy literature; such as the solar-system, atom, heat-flow and water-flow domains. It also includes an assortment of other domains describing golf, soccer and story-telling. The 81 domains of the Assorted dataset use 108 distinct (ie non-repeated) relations. Each of these domains contains between 1 and 15 predicates (M=4.16, SD = 2.9). The average number of distinct relational predicates in each domain is M=3.48, indicating that most relational predicates are used just once in each of the Assorted domains. Alphanumeric Dataset One final dataset contained 62 semantically constrained domains. However, these domains contained a great deal of topological diversity. It was hoped that this mixture of topologies might support some novel comparisons and inferences and provided a counterpoint to the semantic richness of the other domains. Example: p-Creative Re-Discovery of Rutherford’s Analogy Before presenting detailed results, we will first see how Kilaza can re-discover Rutherford’s famous solarsystem:atom analogy. We highlight that this is a test for the p-creativity (Boden 1992) of our model – though not necessarily a model how Ernest Rutherford actually conducted his own reasoning. The traditional representation of this analogy (Figure 5) is heavily based on a post hoc description of the domains involved. These descriptions and are heavily influenced by the analogy itself. We shall first look at the traditional representation of this domain, before examining how our model can also deal with more realistic version of how Rutherford might have thought of the target problem before arriving at his famous comparison. Figure 5: Traditional representation of Rutherford’s Solution First, the semantically impoverished target problem (Figure 4) is mapped onto its location in structure space. We highlight that the “locus of retrieval” is slightly displaced from the targets original location to account for the additional information that one expects to be found in a useful source domain. In this instance the desired source was retrieved at a distance of just over 6 “units” in structure space. The desired source (the solar-system domain) and all other candidate sources near the locus of retrieval were passed in turn to the mapping and validation phases of the model. In total 10 other candidate source domains that were retrieved also generated inferences, most yielding only one inference each. Three domains generated more than one candidate inference – but all three were different versions of the solar-system domain. We point out that our semantic “free” retrieval process can also trigger identification of the same source, even if it was represented in a number of alternate ways (O’Donoghue, 2007). Our mapping model successfully generated the correct inter-domain mapping and CWSG generated the desired inferences without adaptation. Representation Issues in de novo Discovery of p-creative analogies We argue that the traditional presentation of Rutherford’s analogy is a simplified pedagogical device (Figure 5). This description of the target problem effectively removes much of the complexity of the real discovery task as encountered by Rutherford. The description of the target problem uses terminology specifically designed to accentuate the semantic (and structural) similarity that is the result of Rutherford’s comparison – and should not be treated as an input when re-creating this creative episode. This distinction between the problem domain as it would have existed before the creative analogy and its subsequent representation after discovering that analogy is a serious problem - one that is easily overlooked. Any model that attempts to re-discover known creative analogies must address the original problem, not just the representation that accentuates the desired similarity. Differences in domain terminology and topology are central to the distinction between elaborating a given analogy, and the much more difficult task of generating a novel h-creative (or pcreative) analogy (Boden, 1992). We argue that generating Rutherford’s analogy using the representation in Figure 6 is a far better test of a models creative ability, than the normal post hoc representation in Figure 5. Terminological differences are particularly prevalent in distant between-domains analogies as the firstorder relationships describing the problem domains originate in different disciplines. When modeling analogical creativity, we must expect to encounter these differences in terminology, and our models of retrieval, mapping and validation must be able to overcome these problems. Ernest Rutherford would most likely have thought of the target relation between the nucleus and electron as electromagnetic-attraction, and not the more generic attracts relation. The corresponding relationship between source’s sun and planet is gravitation. It is only after he found the analogy (which involved mapping revolves nucleus electron attracts heavier and cause International Conference on Computational Creativity 2012 21 electromagnetic-attraction with gravitation) that these relationships can be generalized to a common super-class like attracts (Gentner, 1983). We point out that our model can operate successfully on either the simplified or more realistic domain descriptions. This is primarily the result of our retrieval and mapping models using domain topology, rather than using identicality (or similarity) between the predicates in both domains. Figure 6: More realistic representation of Rutherford’s Analogy Results of Individual Phase Models We shall first briefly examine the performance of the retrieval and validation models in isolation, before looking at their combined performance in the next section. We shall briefly examine the results of the mapping model, but our focus will remain on the inferences that it produced. Results were produced from a memory containing the three previously described datasets. Retrieval Results Retrieval was performed in structure space. The distance between domains in structure space varied from 2.645 to 230 (M= 80, SD=57.3), with a large number of domains being given a unique structural index in this space. A small number of locations contained multiple domains – these mostly involved small domains of just a few predicates from the Assorted dataset. Retrieval and Mapping A broad tendency was identified between structure-based retrieval and the size of the resulting inter-domain mapping, although the correlation was low. A range effect was identified between structure space and the size of the resulting mapping, indicating that larger distances between domains in structure space tend to produce smaller inter-domain mappings. This indicates a weak connection between structure-based retrieval and the size of any resulting mappings. Validation Results Although the validation model was very simplistic, it proved surprisingly effective. For example with the inferences generated on the Professions dataset, the average (human) rating awarded to predicates that Kilaza categorized as valid was M=2.62 (SD=2.09), while the average rating awarded to the invalid predicates was M=1.57 (SD=1.23). As ratings were given between 1 and 7 with 7 representing clearly valid inferences, this indicates that many of the generated inferences were of rather poor quality. Adaptation Results In addition, 24 inferences were passed to the adaptation process and 20 of these were adapted. While we cannot realistically assess if these adapted inferences matched what was “intended” by our analogy model, we did assess the validity of these inferences using two human raters. When we look at human ratings for the 20 adapted predicates before and after adaptation, we see that the average ratings were increased by the adaptation process - from 1.57 (SD=1.23) to 2.57 (SD=1.70). The average ratings of the adapted predicates was broadly in line with the predicates from Kilaza’s valid category above (M=2.62, SD=2.09). Before adaptation, 18 of the 20 (90%) predicates were given rated as invalid and after adaptation just 12 (60%) were rated as invalid. Thus, adaptation has a distinct influence on improving the ratings of the rejected inferences. It may well be argued that this adaptation process is itself somewhat creative – identifying new relations that better fit the available target arguments. In contrast to the top-down nature of the creative analogy approach, predicate adaption is a very much a bottom-up process that is motivated by the detection of a potential analogical comparison. Creativity Test Results To assess the creative potential of our model, we assess its performance at the p-creative task of re-creating some well-known h-creative analogies (Boden, 1992). These include some of the famous examples of creative analogical comparisons including the Rutherford’s solarsystem:atom analogy, the heat-flow:water-flow and the tumour:fortress analogies. Our descriptions are based on the standard representation of these domains as found in the analogy literature. Creative Retrieval We now examine the performance of our model on the creative retrieval task. We presented our model with the target domain of each of 10 creative analogies, together with a memory of 158 source domains. From this memory of 158 potential sources, the retrieval model selected a number of these domains as candidate sources. Only the selected candidate sources were passed to the mapping and validation phase-models. Evaluating only the selected source domains was necessary in order to avoid an exhaustive search through all possible analogical comparisons. While computationally feasibly in this instance, an exhaustive search would be impractical on a larger collection of domains. Before looking at the results, we point out that many comparisons did not generate a viable inter-domain mapping. Furthermore, most analogies did not generate any valid inferences. The following results ignore these unproductive comparisons and we focus only on the productive analogies. All of the desired creative sources were among the candidate sources that were retrieved by the model. This gives nucleus electron electromagnetic attraction heavierthan sun planets gravitation more-massive than International Conference on Computational Creativity 2012 22 our retrieval model a recall value of 100% on this creative retrieval task. While a large number of other candidate sources were also retrieved, this was still a pleasantly surprising result. The distance within structure space between the target and the creative sources ranged from 3.1 to 7.9, suggesting that structure based retrieval was reasonably accurate in locating candidate sources. The precision of the retrieval processing is summarised in Figure 7. As can be seen, precision was above 0.2 for two problems showing that few other sources were located near the structural index of those targets. However, precision was much lower for most problems, indicating that the desired source was merely one of a larger number of candidate sources that had to be explored. Figure 7 –Precision of retrieval for 10 Creative Analogies Creative Inferences Next we summarise the inferences that were generated by each of these comparisons (Table 1). These results implicitly encompass a productive interdomain mapping between the target and each candidate source in turn. Kilaza generated and validated the correct inferences for 9 (70%) of the creative analogies. The cycling:driving analogy correctly generated no inferences. Target Correct Inferences Validated Inferences Atom: Solar-System y 4 Atom-Falkenhainer: Solar-SystemFalkenhainer y 3 General: Surgeon y 4 Heat-flow: Water-Flow y 4 Leadbelly : Caravaggio y 4 Love-triangle: Triangle-Directed y 0 Requited-love: Love Triangle y 3 Fish : Bird y 4 Vampire : Banker y 3 Cycling: Driving n 0 Table 1 – Number of Inferences generated by different analogies One of these analogies also required one inference to be adapted. The bird:fish analogy generated the inference (flies-through fish water), which was correctly adapted to (swim fish water). Conclusion We presented a three-phase model of analogy, adapting it to function as a tool for discovering creative analogies. This model encompasses the three central phases of analogy, namely retrieval, mapping and validation. We argue that a model encompassing these three core phases of analogy is the minimum required to be considered a model of analogical creativity. Our retrieval model overcomes the semantic bias of previous retrieval models, helping retrieve new and surprising source domains. This helps to improve the novelty of the source domains identified by our creativity engine. Our model of the post-mapping validation phase attempts to filter out any clearly invalid inferences, thereby improving the quality of the analogies identified as being creative. We note that novelty and quality are two attributes strongly associated with creativity (Ritchie, 2001). Our three-phase model of analogy successfully rediscovered 10 examples of creative analogies, including the heat-flow:water-flow and solar-system-atom analogies. In doing so, the model retrieved the correct source from a large memory of potential sources. It then developed the correct mapping and successfully validated (and adapted) the resulting inferences. We point out that these analogical comparisons, if produced by a human analogizer, would be considered creative. Our focus on creative analogies rather that the more normal (or pedagogical) analogies had a far-reaching impact on the model. Terminological differences are particularly prevalent in creative between-domains analogies, as the first-order relations describing each domain originate in different disciplines. When modeling analogical creativity, we must expect to encounter these differences and cannot rely heavily on the presence of identical relations. Our model successfully created Rutherford’s famous solarsystem:atom analogy, even when the target was represented in a more realistic and challenging from. Our model shows that very significant progress has been made towards an autonomous creativity machine, re-discovering many creative analogies. We briefly outline three remaining challenges to analogical creativity, beginning with the issue of knowledge representation. Our results illustrate a trade-off between the specifity and the generality of domain descriptions. Overly specific representations make comparisons more difficult to discover, but overly general representations appear too profligate and can overwhelm the validation (and subsequent) processes. Perhaps multiple representations of each domain might offer a useful avenue for progress. Multiple representations might also help explain why exerts are more fluent in their use of analogy within their own domains (Dunbar and Blanchette, 2001). Our model does not currently include an explicit re-representation process International Conference on Computational Creativity 2012 23 highlighting “tiered identicality” (Gentner and Kurtz, 2006). It seems that the greatest challenge to computational analogizing might lie with the post-mapping phases. Challenges include assessing analogical inferences for validity, evaluating the significance of an analogy and considering the implications of creative comparisons. Surprisingly little attention has been given to this phase – partly because of its ultimate dependency on the target problem domain. Phineas (Falkenhainer, 1990) and also Rebuilder (Gomes et al, 2006) showed that integration of the analogy and case-based reasoning within the target domain can have very positive effects. While tight integration of all target domains into an analogy model seems most unlikely, Kilaza has show that a generic validation model can play a part improving the quality of the inferences that are accepted. Overall, the results presented in this paper highlight that a three-phase model of analogical reasoning can operate successfully as a model of analogical creativity. Our results highlight the improbability of finding a suitable source domain to re-interpret a given target in a creative manner. Extending this model will necessitate a tighter integration of the analogy process with other facets of intelligence. References Boden, M. 1992. The Creative Mind. London: Abacus. Bohan, A. O’Donoghue, D.P. 2000. A Model for Geometric Analogies using Attribute Matching, Proc AICS, Galway, Ireland, pp 110-119. Brown, T.L. 2003. Making Truth: Metaphor in Science. University of Illinois Press, New York, USA. Clement, J. 2008. Creative Model Construction in Scientists and Students: The Role of Imagery, Analogy, and Mental Simulation. Dordrecht: Springer. Davies, J. Goel, A.K. Yaner P.W. 2008. Proteus:Visuospatial analogy in problem-solving, KnowledgeBased Systems 21(7): 636-654. Dunbar, K. and Blanchette, I. 2001. The in vivo/in vitro approach to cognition: The case of analogy, Trends in Cognitive Sciences, 5(8): 334–339. Falkenhainer, B. 1990 A Unified Approach to Explanation and Theory Formation, in Shrager, J. & Langley, P. (eds.) Computational Models of Scientific Discovery and Theory Formation, Morgan Kaufman: CA. pp 157-196. Forbus, K. Gentner, D. Law K. 1995. MAC/FAC: A Model of Similarity-based Retrieval, Cognitive Science, 19(2): 141-205. Forbus, K. Usher, J. Lovett, A Lockwood, K. Wetzel J. 2011. CogSketch: Sketch Understanding for Cognitive Science Research and for Education, Topics in Cognitive Science, 3(4): 648–666. Gentner, D. Kurtz, J. 2006. Relations, Objects, and the Composition of Analogies, Cognitive Science, 30(4): 609– 642. Gentner, D. 1983. Structure-mapping: A theoretical framework for analogy, Cognitive Science, 7(2): 155–170. Gomes P, Seco N, Pereira FC, Paiva P, Carreiro P, Ferreira JL, Bento C. 2006. The importance of retrieval in creative design analogies, Knowledge-Based Systems, 19(7): 480– 488. Holyoak K. J. Novick L. Melz E. 1994. Component Processes in Analogical Transfer: Mapping, Pattern Completion and Adaptation, in Analogy, Metaphor and Reminding, Eds. Barnden and Holyoak, Ablex, Norwood, NJ. Keane, M.T. 1985. On Drawing Analogies When Solving Problem, British Journal of Psychology, 76: 449-458. Keane, M.T., Bradshaw, M. 1988. The Incremental Analogical machine: In D. Sleeman (Ed.) 3rd European Working Session on Machine Learning, Kaufmann CA: 53–62. Keane, M.T., Ledgeway, T. Duff, S. (1994). Constraints on analogical mapping, Cognitive Science, 18: 387-438. O'Donoghue, D.P. 2007. Statistical Evaluation of ProcessCentric Computational Creativity: 4th International Joint Workshop on Computational Creativity (IJWCC), Goldsmiths, University of London, 17-19 June. O'Donoghue, D.P, Bohan, A. Keane M.T, 2006. Seeing Things: Inventive Reasoning with Geometric Analogies and Topographic Maps, New Generation Computing, Ohmsha Ltd. and Springer 24(3): 267-288. O'Donoghue, D.P. Crean, B. 2002. Searching for Serendipitous Analogies, ECAI - Workshop on Creative Systems, Lyon, France, 21-26 July. Pazzani, M. 1991. A Computational Theory of Learning Causal Relationships, Cognitive Science, 15(3): 401-424. Plate T. 1998. Structured Operations with Distributed Vector Representations, in Advances in Analogy Research, New Bulgarian University, Sofia, Bulgaria. Ritchie G. 2007. Some Empirical Criteria for Attributing Creativity to a Computer Program, Minds & Machines, 17: 67–99. Ritchie G. 2001. Assessing Creativity, Proc. AISB Symposium on AI and Creativity, York, March. Thagard, P. Holyoak K. J. Nelson, G. Gochfeld, D. 1990. Analogue Retrieval by Constraint Satisfaction, Artificial Intelligence, 46: 259-10. Veale, T. 1995. Metaphor, Memory and Meaning, PhD Thesis, Trinity College, Dublin, Ireland. Wolverton M. Hayes-Roth, B. 1994. Retrieving Semantically Distant Analogies with Knowledge-Directed Spreading Activation, Proceedings AAAI, pp 56-61. Yaner, P.W. Goel A.K. 2006 Visual analogy: Viewing analogical retrieval and mapping as constraint satisfaction problems, Applied Intelligence, 25(1): 91-105. International Conference on Computational Creativity 2012 24