Author: Kai

  • Human-Centred Agentic Science: New Paradigm for Scientific Discovery

    Human-Centred Agentic Science: New Paradigm for Scientific Discovery

    Key technologies: Front end: React or Flutter and JavaScript-based visualisation such as d3.js and Observable. Back end: vector database (chroma). Machine learning: prompt engineering, agent orchestration framework (such as LangChain/LangGraph), and MCP. Background AI has been behind some of the recent scientific breakthroughs. One of the most prominent examples is the Nobel Prize work of…

  • Machine Learning for Automated Trading

    Machine Learning for Automated Trading

    Key technologies: Front end: React or Flutter. Back end: (online) relational database, especially for time-series data. Machine learning: deep learning, anomaly detection, model tuning and performance tracking (such as MLFlow), LLM, and agent orchestration framework (such as LangChain/LangGraph). The source code is here (for front end). Background The main idea is to use machine learning…

  • Human-AI Collaboration – LLM for Qualitative Analysis

    Human-AI Collaboration – LLM for Qualitative Analysis

    The recent breakthrough in Large Language Models (LLMs), such as chatGPT, and generative models, such as Stable Diffusion, can be tremendously valuable in supporting analysis and creative tasks. While powerful, such models can be difficult to use, especially for domain experts, such as qualitative researchers and visual artists, who are not experts in machine learning.…

  • Human-AI Collaboration: Generative Models for Artistic Creation

    Human-AI Collaboration: Generative Models for Artistic Creation

    The recent breakthrough in Large Language Models (LLMs), such as chatGPT, and generative models, such as Stable Diffusion, can be tremendously valuable in supporting analysis and creative tasks. While powerful, such models can be difficult to use, especially for domain experts, such as qualitative researchers and visual artists, who are not experts in machine learning.…

  • Vitality 2: Chat with Your Papers

    Vitality 2: Chat with Your Papers

    Key technologies: Front end: React or Flutter. Back end: vector database (chroma). Machine learning: LLM, RAG (such as RAGFlow), agent orchestration framework (such as LangChain/LangGraph), and MCP. You can see all the source code here. Background There are many types of qualitative analyses for academic publications. These are analyses that do not involve numerical computation…

  • Visual Analytic for Sensemaking

    Visual Analytic for Sensemaking

    What is Visual Analytics? Visual Analytics is closely related to Data Visualisation, which is also known as Information Visualisation, that helps users discover insights (from data) by presenting it visually. This takes advantage of the visual cognition system (brain is part of it), which is a very powerful computer for detecting certain patterns (we became…

  • Provenance for Sensemaking

    Provenance for Sensemaking

    What is Provenance? The word provenance originally was mainly used for art work, refers to ‘the history of ownership of a valued object or work of art or literature’. For a painting, this includes the information of who painted it when and how the painted changed hands over time. More recently, the concept is expanded…

  • Data Science Provenance

    Data Science Provenance

    Background While there are some efforts to automate the process of build machine learning model (commonly known as ‘AutoML‘), there are many tasks in the different stages of a data science pipeline (see the diagram below) cannot be fully automated. For example, for ‘Data Wrangling’ a decisions have to be made on which dataset to…

  • 1st IEEE Workshop on Visualization and Provenance Across Domains

    1st IEEE Workshop on Visualization and Provenance Across Domains

    October 22nd, 2023 at IEEE VIS in Melbourne, Australia https://visxprov.github.io/ Our ambition is to build this into a series of workshops, targeting a different research community outside visualization each year, and eventually create a provenance research network that connects all the relevant communities. For this year, we will target the database community, which is one…

  • Dagstuhl Seminar: Human-Centered Approaches for Provenance in Automated Data Science

    Dagstuhl Seminar: Human-Centered Approaches for Provenance in Automated Data Science

    Sep 10 – Sep 15, 2023, Dagstuhl, Germany https://www.dagstuhl.de/23372 This Dagstuhl Seminar aims to bring together an interdisciplinary group of researchers and practitioners, spanning Data Science (DS) and Machine Learning (ML), Visualization and Human-Computer Interactions (HCI), and Provenance; to tackle the challenges in automated data science (AutoDS). We specifically focus on ways that methods from…