<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Department of Knowledge Technologies</title>
	<atom:link href="https://kt.ijs.si/feed/" rel="self" type="application/rss+xml" />
	<link>https://kt.ijs.si</link>
	<description></description>
	<lastBuildDate>Thu, 28 May 2026 10:10:25 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://kt.ijs.si/wp-content/uploads/2021/04/cropped-Knowledge-tehnologies_favicon_s-32x32.png</url>
	<title>Department of Knowledge Technologies</title>
	<link>https://kt.ijs.si</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Member of our department recieved Best Poster Award at the Future of Integrated Assessment Models: Pathways Towards Carbon Neutrality for Climate, Environment, Health and Socio‐economic Co‐benefits</title>
		<link>https://kt.ijs.si/news/member-of-our-department-recieved-best-paper-award-at-the-future-of-integrated-assessment-models-pathways-towards-carbon-neutrality-for-climate-environment-health-and-socio%e2%80%90economic-co/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Thu, 28 May 2026 09:19:40 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7441</guid>

					<description><![CDATA[A member of our department, Sabin Roman, received the award for best poster at the conference Future of Integrated Assessment Models: Pathways Towards Carbon Neutrality for Climate, Environment, Health and Socio‐economic Co‐benefits with his poster titled: Modelling the collapse of complex societies. Congratulations!]]></description>
										<content:encoded><![CDATA[<p>A member of our department, Sabin Roman, received the award for best poster at the conference <a href="https://www.we-heraeus-stiftung.de/veranstaltungen/future-of-integrated-assessment-models-pathways-towards-carbon-neutrality-for-climate-environment-health-and-socio-economic-co-benefits"><em>Future of Integrated Assessment Models: Pathways Towards Carbon Neutrality for Climate, Environment, Health and Socio‐economic Co‐benefits</em></a> with his poster titled: <a href="https://kt.ijs.si/wp-content/uploads/2026/05/Poster___Modelling_of_societal_growth_and_collapse.pdf"><em>Modelling the collapse of complex societies</em></a>.</p>
<p>Congratulations!</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Taja Kuzman Pungeršek successfully defended her doctorate thesis</title>
		<link>https://kt.ijs.si/news/taja-kuzman-pungersek-successfully-defended-her-doctorate-thesis/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Fri, 15 May 2026 11:34:48 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7438</guid>

					<description><![CDATA[Taja Kuzman Pungeršek successfully defended her doctorate thesis titled Robust Multilingual Automatic Genre Identification in Texts. Congratulations! Abstract: Collecting texts from the web has significantly accelerated the creation of large text datasets which are essential for the development of advanced language technologies, including large language models. However, because these texts are gathered automatically, their linguistic [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>Taja Kuzman Pungeršek successfully defended her doctorate thesis titled<em> Robust Multilingual Automatic Genre Identification in Texts.</em></p>
<p>Congratulations!</p>
<p>Abstract:</p>
<p>Collecting texts from the web has significantly accelerated the creation of large text datasets<br />
which are essential for the development of advanced language technologies, including large<br />
language models. However, because these texts are gathered automatically, their linguistic<br />
and functional characteristics are largely unknown, limiting their reliable use in research<br />
and applications. Automatic genre identification, a text classification task that categorizes<br />
texts into specific genre categories, provides key insights into such large-scale text<br />
collections and enables their filtering for applications in language technology and linguistic<br />
research.<br />
This thesis advances automatic genre identification for web-scale multilingual text data<br />
through the creation of novel genre schemata, the development of manually-annotated<br />
genre datasets, and the exploration of robust machine learning methods. First, the<br />
study introduces new genre schemata designed to improve annotation reliability and<br />
cross-schema comparability, thereby addressing a key limitation of prior work where<br />
incompatible schemata hindered meaningful comparison across studies. We demonstrate<br />
that high-quality manual annotation with acceptable inter-annotator agreement can<br />
be achieved through a carefully designed genre schema, detailed guidelines, and the<br />
employment of expert annotators.<br />
To address the lack of high-coverage genre datasets in our target languages, we develop<br />
high-quality, manually-annotated genre datasets in Slovenian and English, as well as<br />
multilingual test collections spanning eleven typologically diverse languages and different<br />
scripts. Building on the test datasets, we establish the AGILE benchmark, which enables<br />
standardized, reproducible cross-dataset and cross-lingual evaluation of genre classifiers.<br />
The benchmark also supports a systematic comparison of modern large language models<br />
across languages.<br />
Using this infrastructure, we develop genre classifiers that achieve robust performance<br />
across various languages, datasets, and evaluation settings. To ascertain which machine<br />
learning methodology offers the most robust generalization, we experiment with a range<br />
of text classification techniques, ranging from traditional non-neural machine learning<br />
methodologies to cutting-edge approaches based on large language models. Our findings<br />
indicate that a BERT-like model, fine-tuned on our newly developed training dataset,<br />
achieves state-of-the-art performance across various datasets and languages, including languages<br />
using non-Latin scripts and languages not closely related to the fine-tuning languages.<br />
The resulting model is publicly released, providing a practical tool for large-scale<br />
automatic genre annotation of multilingual web text collections.<br />
Furthermore, we propose the LLM Teacher-Student Framework, a novel approach for<br />
training text classifiers without manually-annotated data. By leveraging a large language<br />
model to generate training labels, this method enables scalable, cost-efficient development<br />
of genre classifiers, particularly for low-resource languages and specialized domains, and is<br />
applicable beyond genre identification to other text classification tasks.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>AI for Materials Science: Tuning Laser-Induced Graphene Production</title>
		<link>https://kt.ijs.si/aijsi-seminar-upcoming/ai-for-materials-science-tuning-laser-induced-graphene-production/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Wed, 08 Apr 2026 13:42:34 +0000</pubDate>
				<category><![CDATA[AI@JSI Seminar Upcoming]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7422</guid>

					<description><![CDATA[23 Apr 2025 at 13:00]]></description>
										<content:encoded><![CDATA[<p>AI and machine learning have advanced the state of the art in many application domains. We present an application to materials science; in particular, we use surrogate models with Bayesian optimization for automated parameter tuning to optimize the fabrication of laser-induced graphene. This process allows to create thin conductive lines in thin layers of insulating material, enabling the development of next-generation nano-circuits. This is of interest for example for in-space manufacturing. We are able to achieve improvements of up to a factor of two compared to existing approaches in the literature and to what human experts are able to achieve, in a reproducible manner. Our implementation is based on the open-source mlr and mlrMBO frameworks and generalizes to other applications.</p>
<p>Zoom link: <a href="https://zoom.us/j/92010489234?pwd=TnP9ZtXTfT0aZayMJEHaNBlInprgNU.1" target="_blank" rel="noopener">Click me.</a></p>
<p><strong>Note:</strong> The lecture room is different this time: instead of the usual E-lecture room, the lecture will be held in the JSI Physics Seminar Room (Jamova 39, Building A, Ground Floor, Room 106).</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>From Understanding People to Securing AI: A Human-Centered Research Journey Through Large Language Models</title>
		<link>https://kt.ijs.si/aijsi-seminar/from-understanding-people-to-securing-ai-a-human-centered-research-journey-through-large-language-models/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Tue, 10 Mar 2026 08:38:53 +0000</pubDate>
				<category><![CDATA[AI@JSI Seminar]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7387</guid>

					<description><![CDATA[The talk presents a connected view of recent research on large language models (LLMs) through a human-centered lens, moving from what these systems can infer about people to how they interact with them, where they present security risks, and why such vulnerabilities matter in socially consequential settings. It begins with work on how LLMs can [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>The talk presents a connected view of recent research on large language models (LLMs) through a human-centered lens, moving from what these systems can infer about people to how they interact with them, where they present security risks, and why such vulnerabilities matter in socially consequential settings. It begins with work on how LLMs can infer personality from short texts and on the role of communication style in shaping user experience and task outcomes, showing both the potential of LLMs for personalization and the importance of designing interaction carefully across contexts. It then broadens to questions of trustworthiness and toxicity, covering security risks in prompt-based interaction, attack surfaces, and the growing challenge posed by multilingual, multimodal, and autonomous jailbreaks. Further, it examines representational harms through methods for measuring gender bias in gendered and under-resourced languages. The talk is concluded with a high-stakes application of mental health crisis response, where clinically informed evaluation reveals that increasing model capability does not automatically translate into safe, appropriate, or context-aware behavior. Across these topics, the unifying theme is that progress in LLMs should be matched by rigorous work on evaluation, safety, fairness, and responsible deployment.</p>
<p>Zoom link: <a href="https://zoom.us/j/91818829507?pwd=4vBcLWxNYJBaMHPlG80l0NzVnXrNa7.1" target="_blank" rel="noopener">Click me. (Audio issues from last time are being actively addressed.)</a></p>
<p>The seminar is also organized under <a href="https://kt.ijs.si/project/elliot/">the ELLIOT project</a>.<br />
<img decoding="async" class="alignnone size-medium wp-image-7401" src="https://kt.ijs.si/wp-content/uploads/2026/03/ELLIOT_blue-01-300x134.jpg" alt="" width="300" height="134" /></p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>AI-enabled Mammography</title>
		<link>https://kt.ijs.si/aijsi-seminar/ai-enabled-mammography/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Thu, 05 Mar 2026 12:18:05 +0000</pubDate>
				<category><![CDATA[AI@JSI Seminar]]></category>
		<category><![CDATA[AI@JSI]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7382</guid>

					<description><![CDATA[Breast cancer is the most common malignancy and leading cause of cancer deaths in women around the world. The preferred diagnostic and prevention method used to fight breast cancer is mammography, being effective, cheap, reliable and suitable for screening large populations. The analysis of mammograms requires considerable effort from qualified radiologists, who have been increasingly [&#8230;]]]></description>
										<content:encoded><![CDATA[<p style="font-weight: 400;">Breast cancer is the most common malignancy and leading cause of cancer deaths in women around the world. The preferred diagnostic and prevention method used to fight breast cancer is mammography, being effective, cheap, reliable and suitable for screening large populations. The analysis of mammograms requires considerable effort from qualified radiologists, who have been increasingly relying on computer-aided diagnosis and the development of tools that require little supervision could save the lives of hundreds of thousands of women across the world.</p>
<p style="font-weight: 400;">In recent years, the integration of Artificial Intelligence (AI) in the field of medicine has brought about a significant transformation in healthcare delivery. Computer vision technology finds itself at the forefront of this proces. Its applications to mamography data represent an active research field, stimulated by the fact that among the clinical imaging modalities, mammography stands out in terms of high spatial resolution (full-field digital mammography systems usually produce images at resolutions ranging from 1920&#215;2304 to 4708&#215;5844 pixels.</p>
<p style="font-weight: 400;">I will discuss research work stemming from projects conducted at the Institute for Artificial Intelligence Research and Development of Serbia, aimed at implementing AI-assisted mamography screening at the national level. Viewed through the lens of medical applications, the talk will cover the evolution of state-of-the-art research in the domain of computer vision and multimodal AI over the last 5 years, from the applications of “classical” convolutional neural networks and visual transformer models, explainable AI extensions, generative (diffusion) models, to self supervised and multiple instance learning approaches to address specific challenges and (downstream) tasks such as: image classification, lesion detection and synthetic mammogram generation.</p>
<p>Zoom link: <a href="https://zoom.us/j/95911473672?pwd=aaOhXi58vSa5eTvkdavW1SwQvccISu.1" target="_blank" rel="noopener">Click me. (Audio issues from last time are being actively addressed.)</a></p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>SLaLoM 2026 workshop and EMMA meeting</title>
		<link>https://kt.ijs.si/events/slalom-2026-workshop-and-emma-meeting/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Tue, 17 Feb 2026 09:10:33 +0000</pubDate>
				<category><![CDATA[Events]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7376</guid>

					<description><![CDATA[On the 12th and 13th of February 2026, the SLaLoM 2026 workshop (2nd Slovenian workshop on Large Language Models Techniques and Applications) and an intensive EMMA (EMMA: Embeddings-based techniques for Media Monitoring Applications) meeting were held in Kranjska Gora, Slovenia. The SLaLoM 2026 proceedings will be made available on EMMA website.]]></description>
										<content:encoded><![CDATA[<p>On the 12th and 13th of February 2026, the <strong>SLaLoM 2026 workshop </strong>(2nd Slovenian workshop on Large Language Models Techniques and Applications<strong>)</strong> and an intensive <strong>EMMA</strong> (EMMA: Embeddings-based techniques for Media Monitoring Applications) meeting were held in Kranjska Gora, Slovenia. The SLaLoM 2026 proceedings will be made available on <a href="https://emma.ijs.si/en/2026/02/17/slalom-2026-workshop-and-emma-meeting/">EMMA website</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Grand opening of the Slovenian Artificial Intelligence Factory (SLAIF)</title>
		<link>https://kt.ijs.si/events/slaif/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Tue, 10 Feb 2026 13:18:35 +0000</pubDate>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7368</guid>

					<description><![CDATA[Prof. Sašo Džeroski, PhD, the technical coordinator of the SLAIF project, presented the project with a total value of EUR 135 million, co-funded by the Republic of Slovenia and the European programme of the EuroHPC Joint Undertaking. The SLAIF project is part of the broader European EuroHPC initiative and brings together the expertise of leading [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>Prof. Sašo Džeroski, PhD, the technical coordinator of the SLAIF project, presented the project with a total value of EUR 135 million, co-funded by the Republic of Slovenia and the European programme of the EuroHPC Joint Undertaking. The SLAIF project is part of the broader European EuroHPC initiative and brings together the expertise of leading research and educational institutions and connects them with the needs of industry.</p>
<p>👉The project’s development and activities focus on four key thematic areas: artificial intelligence for the green transition, for health and biotechnology, for the digital society, and for science. The goal is to actively promote collaboration between industry, academia, and research institutions, create opportunities for joint projects, and enable the transfer of knowledge and technologies into practice.</p>
<p><a href="https://lnkd.in/dVAhR-R8">Video</a></p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Diffusion Language Models: Problem Solving and Reasoning</title>
		<link>https://kt.ijs.si/aijsi-seminar/diffusion-language-models-problem-solving-and-reasoning/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Tue, 03 Feb 2026 13:08:09 +0000</pubDate>
				<category><![CDATA[AI@JSI Seminar]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7353</guid>

					<description><![CDATA[Masked diffusion models (MDMs) offer a compelling alternative to traditional autoregressive language models. They generate strings by iteratively refining partially masked inputs in parallel. This makes them efficient, but their computational capabilities and the limitations inherent to the parallel generation process remain largely unexplored. In this talk, I will talk about what types of reasoning [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>Masked diffusion models (MDMs) offer a compelling alternative to traditional autoregressive language models. They generate strings by iteratively refining partially masked inputs in parallel. This makes them efficient, but their computational capabilities and the limitations inherent to the parallel generation process remain largely unexplored.<br />
In this talk, I will talk about what types of reasoning problems MDMs can provably solve and how efficiently they can do it. We will describe the relationship between MDMs and the well-understood reasoning frameworks of chain of thought (CoT) and padded looped transformers (LTs): We will see that MDMs and polynomially padded LTs are, in fact, equivalent, and that MDMs can solve all problems that CoT-augmented transformers can. Moreover, we will showcase classes of problems (including regular languages) for which MDMs are inherently more efficient than CoT transformers, where parallel generation allows for substantially faster reasoning.</p>
<p>Zoom link: <a href="https://zoom.us/j/91819954058?pwd=DbP4LO3XvymqNW92WDxU9Dm6aO9Tbr.1" target="_blank" rel="noopener">Click me.</a></p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Learning Non-Stationary Dynamics: Oscillations, Chronotaxicity, and Time-Varying Structure in Complex Systems</title>
		<link>https://kt.ijs.si/aijsi-seminar/learning-non-stationary-dynamics-oscillations-chronotaxicity-and-time-varying-structure-in-complex-systems-2/</link>
		
		<dc:creator><![CDATA[Nina Omejc]]></dc:creator>
		<pubDate>Mon, 26 Jan 2026 10:09:48 +0000</pubDate>
				<category><![CDATA[AI@JSI Seminar]]></category>
		<category><![CDATA[AI@JSI]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7320</guid>

					<description><![CDATA[Many real-world systems—from physiology and neuroscience to climate and finance—are inherently non-stationary, operating far from equilibrium with structure that evolves in time. This talk presents a physical and data-driven framework for understanding such systems through time-varying oscillatory dynamics. Central to this approach is the concept of chronotaxic systems: non-autonomous oscillators with dynamically stable, time-dependent characteristic [&#8230;]]]></description>
										<content:encoded><![CDATA[<div>Many real-world systems—from physiology and neuroscience to climate and finance—are inherently non-stationary, operating far from equilibrium with structure that evolves in time. This talk presents a physical and data-driven framework for understanding such systems through time-varying oscillatory dynamics. Central to this approach is the concept of chronotaxic systems: non-autonomous oscillators with dynamically stable, time-dependent characteristic frequencies, providing a principled way to model robustness and adaptability without assuming stationarity or time-asymptotic behaviour.</div>
<div>I will discuss how instantaneous frequency, nonlinear mode decomposition, and multiscale ridge extraction enable the learning of interpretable latent dynamics from complex, noisy time series where classical spectral methods fail. These methods form the basis of the MODA toolbox and support reliable separation, tracking, and interpretation of interacting components with evolving amplitudes and frequencies.</div>
<div>The framework further enables inference of time-varying interactions and causal structure using wavelet-based coherence, information-theoretic measures, and Bayesian dynamical inference, including in networks with non-stationary coupling. Applications will be drawn primarily from physiology and neuroscience—such as neurovascular coupling, anaesthesia-induced phase transitions, and ageing—but the methods generalise across domains, including physical, engineered, and socio-economic systems.</div>
<div>The talk highlights how physically grounded models of non-stationary dynamics complement machine learning by providing interpretable representations, principled constraints, and reliable inference in settings where purely data-driven approaches struggle.</div>
<p>Zoom link: <a href="https://zoom.us/j/96145053296?pwd=JtaZ2z82Zz3kYaCmTAd4LF2Ab07k3K.1" target="_blank" rel="noopener">Click me.</a></p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Members of our department had a lecture on JSI colloquia</title>
		<link>https://kt.ijs.si/events/members-of-our-department-had-a-lecture-on-jsi-colloquia/</link>
		
		<dc:creator><![CDATA[Anja Glusic]]></dc:creator>
		<pubDate>Wed, 10 Dec 2025 15:15:02 +0000</pubDate>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">https://kt.ijs.si/?p=7257</guid>

					<description><![CDATA[JSI colloquia are prestigious scientific events at the “Jožef Stefan” Institute, where top lecturers present their research achievements. Colloquia have a long tradition and an international reputation. Recording is already available. Methods for semi-automated hypothesis generation from scientific literature: an open science approach The rapid growth of scientific publications makes it difficult to manually review [&#8230;]]]></description>
										<content:encoded><![CDATA[<p><a href="https://kolokviji.ijs.si/">JSI colloquia</a> are prestigious scientific events at the “Jožef Stefan” Institute, where top lecturers present their research achievements. Colloquia have a long tradition and an international reputation.</p>
<p><a href="https://video.arnes.si/watch/jl2ywq3dv3g1">Recording</a> is already available.</p>
<h4><strong>Methods for semi-automated hypothesis generation from scientific literature: an open science approach </strong></h4>
<p>The rapid growth of scientific publications makes it difficult to manually review and keep up to date with new research findings. Literature-based discovery (LBD) is a field of artificial intelligence at the intersection of natural language processing and machine learning, which enables semi-automated hypothesis generation by discovering new associations between previously unconnected scientific sources. In the lecture, we will present a selection of approaches and methods from our recently published monograph <em>Bisociative Literature-Based Discovery: Methods with Tutorials in Python</em> (Springer, 2025). We will present also a collection of Python notebooks that facilitate the reproducibility of procedures for data acquisition, text processing, hypothesis generation and their evaluation, in alignment with the principles of open science.</p>
<p><strong>About the lecturers</strong>: Nada Lavrač is a scientific councilor at the Jožef Stefan Institute and a full professor at the Jožef Stefan International Postgraduate School. Bojan Cestnik is the director of the computer engineering company Temida, a researcher at the Jožef Stefan Institute and a full professor at the School of Engineering and Management of the University of Nova Gorica. Andrej Kastrin is a researcher at the Institute for Biostatistics and Medical Informatics at the Faculty of Medicine, University of Ljubljana, and a lecturer at this faculty.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
