Kategorier
Interactive HPC Supercomputing UCloud Vejledning Webinarer og vejledninger - video Workshop

Webinaroptagelse: Fine-Tuning and Deploying  Large Language Models

In this video we will guide you through the complete pipeline of fine-tuning large language models (LLMs) for specialised tasks such as medical question-answering using NeMo Framework and Triton Inference Server.

  • Prepare and preprocess open-source datasets for fine-tuning.
  • Apply Parameter-Efficient Fine-Tuning (PEFT) using LoRA with NVIDIA NeMo Framework.
  • Deploy optimised LLMs using NVIDIA Triton Inference Server and TensorRT-LLM.
  • Generate a synthetic Q&A dataset using Label Studio connected to a live inference backend.
  • Fine-tune and evaluate your customised LLM for domain-specific applications.

All workflows will be executed inside a UCloud project environment with access to GPU resources.

Target audience: Machine learning practitioners, researchers, and engineers interested in LLM customisation, domain adaptation, or scalable model deployment.

Technical Level: Intermediate to Advanced.

Notebooks: https://github.com/emolinaro/ucloud-workshop-28-05-2025

Kategorier
UCloud Undervisning Vejledning Webinarer og vejledninger - video Workshop

Webinaroptagelse: Introduction to hosting courses on UCloud

I denne video introducerer vi den nye UCloud Courses-app – et værktøj til at hoste og administrere universitetskurser i UCloud.

Dr. Federica Lo Verso guider dig igennem appens koncept og baggrund.
Du får en praktisk demonstration af, hvordan du tilgår og bruger værktøjet, udforsker dets integration med GitHub, og hører direkte fra Dr. Himanshu Khandelia, som deler sine erfaringer med at bruge UCloud Courses i virkelige undervisningssituationer.

Vi guider dig også gennem ansøgningsprocessen, viser dig de tekniske og økonomiske krav og giver dig en forsmag på et kommende praktisk kursus.

Tidskoder:

00:00 – Introduktion ved Dr. Federica Lo Verso
01:23 – Baggrund: Lanceringen af den nye UCloud Courses-app
04:12 – Gennemgang: Hvor du finder Courses-appen, og hvordan du bruger den
07:00 – Opsummering af demonstrationen af UCloud Courses-appen
07:56 – Gennemgang: Sådan fungerer GitHub-repositoriet
10:07 – Fordele ved at bruge Courses på UCloud
11:12 – Introduktion til brugsscenarie: Dr. Himanshu Khandelia
11:58 – Dr. Khandelia deler sine erfaringer med at bruge UCloud Courses
14:00 – Live-demo: Dr. Khandelia viser sin brug af Courses og GitHub-integration
23:00 – Sådan ansøger du: Ansøgningsprocessen for UCloud Courses
24:21 – Påkrævede ressourcer til at køre et UCloud-kursus
25:28 – Sådan genbruger og opdaterer du eksisterende UCloud Courses
26:20 – Økonomisk model: Omkostninger og tilgængelig støtte
27:12 – Outro og teaser til det kommende praktiske kursus

Kategorier
Supercomputing UCloud Undervisning Vejledning Workshop

Workshop 11/6: UCloud Courses hands-on

Udvikling af din egen UCloud-kursusapp med en nyudviklet skabelonbaseret tilgang

Deltag i en praktisk workshop, hvor vi guider dig gennem alle trin i udviklingen af en UCloud-kursusapp ved hjælp af vores nyudviklede skabelonbaserede tilgang. Konceptet går ud på at have en dedikeret app på UCloud til dit universitetskursus, som studerende kan bruge f.eks. i øvelses-/labsessioner og/eller derhjemme. En introduktion til tilgangen findes i denne webinaroptagelse.

I denne workshop lærer du at:

  • Oversætte din kursusstruktur til en struktur, der er kompatibel med en UCloud-kursusapp.
  • Opsætte kursusudviklingsmiljøet, hvilket indebærer at klone GitHub-repositoriet og køre et kursus-setup-script.
  • Tilpasse de medfølgende skabeloner for at bygge en UCloud-kursusapp, der inkluderer alle nødvendige komponenter – softwareafhængigheder, scripts, datasæt og mere.
  • Teste kurset på din egen computer under udviklingen for at sikre, at alt fungerer korrekt. Vi ser også, hvordan den færdige kursusapp ser ud, når den er lagt på UCloud.

Git/GitHub og Docker er essentielle værktøjer i udviklingen af UCloud-kursusapps. I workshoppen giver vi korte introduktioner til begge værktøjer, primært rettet mod deltagere uden forudgående erfaring. Deltagerne kan med fordel gennemgå introduktionsmateriale til Git og Docker på forhånd, men det er ikke et krav.

Dato: 11. juni 2025

Tidspunkt: 12:30–14:30 (CET)

Sted: Online, via Zoom (link følger)

Målgruppe: Forskere og undervisere fra alle afdelinger på alle danske universiteter

Teknisk niveau: Grundlæggende til avanceret

Tilmeld dig denne workshop

Kategorier
Vejledning Webinarer og vejledninger - video Workshop

Workshopoptagelse: AI Applications on DeiC Interactive HPC UCloud – Harnessing Hardware & Tools for AI Development

Timestamps:

00:00 – Introduction and welcome
00:50 – Introduction to UCloud
05:27 – DeiC Interactive HPC website
06:07 – UCloud: Log in
07:06 – UCloud: HPC providers on the UCloud platform
11:08 – UCloud: Initial resource allocations in “My workspace”
12:27 – UCloud: Storage in “My workspace”
13:28 – UCloud: Resources and applications for new resource allocations
13:50 – UCloud: Completing the resource application
21:05 – UCloud: Resource pools and limits (what does it cost?)
23:05 – UCloud: Apps, the app store, applications index in UCloud docs
26:00 – UCloud: Advanced use cases and integration patterns

27:45 – UCloud: Transcriber: Intro and resource needs
30:08 – UCloud: Transcriber: Uploading files inside a project
31:20 – UCloud: Transcriber: Finding and launching Transcriber
32:00 – UCloud: Transcriber: Run Transcriber for the first time (Completing the app launch screen)
34:28 – UCloud: Transcriber: Running multiple Transcriber jobs simultaneously
35:00 – UCloud: Transcriber: Import previous Transcriber job parameters
35:35 – UCloud: Transcriber: Opening running jobs from the “Recent runs” pane
37:15 – UCloud: Transcriber: Transcriber output directories (Jobs folder)
38:26 – UCloud: Transcriber: Transcriber outputs in “Recent runs”
39:54 – UCloud: Transcriber: Output inspection and data download of zip file

41:36 – UCloud: Chat UI: Introduction
43:27 – UCloud: Chat UI: Run Chat UI for the first time (Completing the app launch screen)
48:40 – UCloud: Chat UI: First look at the Chat UI interface (disable new sign-ups and download a model)
52:36 – UCloud: Chat UI: Including documents to support Retrieval Augmented Generation (RAG) (i.e. supplementing the model with an additional document)
57:18 – UCloud: Chat UI: Extend the job time on any UCloud job (if needed)
58:28 – UCloud: Chat UI: Text-to-image generation (stable diffusion with a standard LLM model)
1:01:45 – UCloud: Chat UI: RAG for (best guess) document summarization (beware of model hallucinations)

1:04:45 – UCloud: Label Studio: Introduction
1:05:15 – UCloud: Label Studio: Run Label Studio for the first time (Completing the app launch screen)
1:07:55 – UCloud: Label Studio: First look at the Label Studio interface
1:09:40 – UCloud: Label Studio: Brief view of the Label Studio documentation
1:10:52 – UCloud: Label Studio: Introduction to the coming “Speech Analyser” application
1:15:45 – UCloud: Label Studio: Documentation

1:16:08 – Conclusion

Kategorier
Interactive HPC Supercomputing UCloud Workshop

Workshops om AI applikationer

Join us for three new and free online workshops to explore how these tools can transform your work. Discover AI Applications on DeiC Interactive HPC – UCloud

Workshop 1:

Transcribing and editing audio transcriptions with Transcriber and Speech Analyzer apps

Date: 22 May 2025

Time: 13:00 – 15:00 (CET)

Location: Online, via Zoom (link TBA)

Join us for a hands-on workshop where we guide you through the complete pipeline of transcribing audio files from speech to text and editing and classifying transcription segments.

In this session, you’ll learn how to:

  • Use Transcriber for transcribing audio/video files. Transcriber is based on Open AI’s Whisper language model. The app can transcribe speech audio to text in various formats and uses the WhisperX package to perform speaker recognition.
  • Navigate the new, simple, drag and drop Transcriber user interface to make it easier for you to use AI to transcribe audio files.
  • Edit and classify the transcriptions with Speech Analyzer. Speech Analyzer is an application built on top of Label Studio, specifically optimized for dialogue analysis. It enables you to label, edit, and annotate transcriptions generated using Transcriber.
  • Perform a comprehensive dialogue analysis on UCloud involving transcribing audio files using Transcriber, followed by transcription analysis with Speech Analyzer.

All workflows will be executed inside a UCloud project environment with access to GPU resources.

Target audience: Researchers across all Departments, particularly Digital Humanities and Social Science, Students, AI interested.

Technical Level: Basic to Intermediate.

Sign up for this workshop

Workshop 2:

ChatUI and CVAT pipelines

Date: 27 May 2025

Time: 13:00 – 15:00 (CET)

Location: Online, via Zoom (link TBA)

Join us for a hands-on workshop where we guide you through two different AI based workflows, involving ChatUI and CVAT apps.

In this session, you’ll learn how to:

  • Use Chat UI as a flexible interface for hosting of various LLM models, and interact via a chat or API environment.
  • Use ChatUI for semantic search in a knowledge base.
  • Use CVAT as a powerful annotation tool, including image classification, object detection, semantic and instance segmentation, and video / 3D annotations.
  • Use advanced CVAT features including auto-annotation, algorithmic assistance, management and analytics.

All workflows will be executed inside a UCloud project environment with access to GPU resources.

Target audience: Researchers across all fields, particularly transport, robotics, digital humanities, social sciences, machine learning and students.

Technical Level: Basic to Intermediate.

Sign up for this workshop

Workshop 3:

Fine-Tuning and Deploying  Large Language Models with NeMo Framework and Triton Inference Server

Date: 28 May 2025

Time: 13:00 – 15:00 (CET)

Location: Online, via Zoom (link TBA)

Join us for a hands-on workshop where we guide you through the complete pipeline of fine-tuning large language models (LLMs) for specialized tasks such as medical question-answering!

In this session, you’ll learn how to:

  • Prepare and preprocess open-source datasets for fine-tuning.
  • Apply Parameter-Efficient Fine-Tuning (PEFT) using LoRA with NVIDIA NeMo Framework.
  • Deploy optimized LLMs using NVIDIA Triton Inference Server and TensorRT-LLM.
  • Generate a synthetic Q&A dataset using Label Studio connected to a live inference backend.
  • Fine-tune and evaluate your customized LLM for domain-specific applications.

All workflows will be executed inside a UCloud project environment with access to GPU resources.

Target audience: Machine learning practitioners, researchers, and engineers interested in LLM customization, domain adaptation, or scalable model deployment.

Technical Level: Intermediate to Advanced.

Sign up for this workshop


DeiC Interactive HPC provides researchers at Danish universities with access to a variety of AI applications on UCloud that enable them to accelerate their research through powerful and secure computational tools.

Through online workshops the DeiC Interactive HPC Consortium will introduce both new and experienced users to DeiC Interactive HPC/UCloud’s AI app portfolio.

The sessions are designed to equip researchers and students with the knowledge and skills needed to effectively harness DeiC Interactive HPC/UCloud’s AI tools for their research.

Feel free to share with colleagues and peers who might benefit. See you there!

Kategorier
Supercomputing Workshop

Interactive HPC Konsortium workshop

I går var vi på First Hotel Grand Odense til årets første DeiC Interactive HPC-workshop i 2025.

Det var tid til at give hinanden en status på udviklingen og arbejdet siden vores sidste konsortieworkshop i Rebild Bakker i efteråret 2024. Centrale emner omfattede:

  • Serviceudvikling
  • Branding og outreach
  • AI, undervisning og UCloud-workshops

Tak til SDU eScience Center for at være værter for det gennemførte arrangement. Og en hjertelig tak til alle deltagere, der var med til at diskutere de centrale emner.

En særlig tak skal rettes til vores gæstetaler, leder af DeiC Data Management Anne Sofie Fink, for hendes oplæg om FAIR-principperne og for at udforske mulige synergier mellem DeiC DM-tjenesterne og DeiC Interactive HPC-tjenesten.

Vi ser frem til at fortsætte disse vigtige samtaler og samarbejder i fremtiden. Vores næste konsortieworkshop finder sted i efteråret 2025 og arrangeres af Center for Humanities Computing.

Kategorier
UCloud Workshop

Workshops om UCloud AI applikationer

Join us for two free online workshops this December to explore how these tools can transform your work. Discover AI Applications on DeiC Interactive HPC – UCloud

Workshop 1:
Introduction to AI Tools on DeiC Interactive HPC – UCloud

Date: December 10th

Time: 12:30–14:00

This beginner-friendly session will introduce you the basics of AI tools with a focus on transcription and text annotation. Learn how to use these tools to streamline tasks like analyzing text and creating datasets.

Highlights:

  • Intro to fundamental AI tools
  • Hands-on demonstrations
  • Live Q&A

Whether you’re familiar with DeiC Interactive HPC -UCloud or exploring DeiC Interactive HPC -UCloud for the first time, this workshop will provide practical skills to get started.

Workshop 2:
Advanced AI Tool Development on DeiC Interactive HPC – UCloud

Date: December 11th

Time: 12:30–14:00

For experienced users ready to go deeper! Discover advanced tools like Nvidia Nemo and Triton to design and develop custom AI solutions.

Highlights:

  • Advanced AI application design
  • Tool showcases
  • Live Q&A

This workshop will focus on more advanced AI tools, offering researchers insights into designing and developing their own AI solutions.
It will showcase the Nvidia apps, Nemo and Triton, available on DeiC Interactive HPC/UCloud, both specifically designed to support these efforts.


DeiC Interactive HPC provides researchers at Danish universities with access to a variety of AI applications on UCloud that enable them to accelerate their research through powerful and secure computational tools.

During two online workshops on December 10th and 11th, the DeiC Interactive HPC Consortium will introduce both new and experienced users to DeiC Interactive HPC/UCloud’s AI app portfolio.

The sessions are designed to equip researchers and students with the knowledge and skills needed to effectively harness DeiC Interactive HPC/UCloud’s AI tools for their research.

Feel free to share with colleagues and peers who might benefit. See you there!

Kategorier
Workshop

Interactive HPC Consortium workshop

A few days ago, Center for Humanities Computing at Aarhus Universitet had the pleasure of inviting Interactive HPC Consortium colleagues from CLAAUDIA, Aalborg Universitet and the SDU eScience Center to a workshop day at the wonderful Moesgaard Museum.

“We have these collaborative workshops every six months to stay informed and improve the Interactive HPC service, but also to stay connected with what we consider to be close colleagues from the other two universities.
Meeting up in person adds an essential layer to this cross-university collaboration, positively impacting the operation and development of the DeiC Interactive HPC facility. Through these gatherings, innovations are cultivated collectively among partners, each contributing their unique perspective. Given the consistently productive outcomes of these workshops resulting in several working groups that tackle delegated tasks in the coming months, we are considering expanding to a two-day workshop next time to facilitate more in-depth discussions and collaboration,” says Kristoffer Nielbo, Director of Center for Humanities Computing.

This recent workshop hosted no fewer than 25 people who convened to discuss the general status, infrastructure, education, and branding of DeiC Interactive HPC. The day concluded perfectly with a visit to the exhibitions at Moesgaard Museum and its grounds. Everyone looks forward to the next Interactive HPC workshop hosted by CLAAUDIA in wonderful Aalborg.

Kategorier
Undervisning Workshop

CodeRefinery workshop March 21-23 and 28-30, 2023

Course goals

In this course, you will become familiar with tools and best practices for scientific software development. This course will not teach a programming language, but we teach the tools you need to do programming well and avoid common inefficiency traps. The tools we teach are practically a requirement for any scientist who needs to write code. The main focus is on using Git for efficiently writing and maintaining research software.

Audience

Do you identify with any of these below, then this course is for you:

  • You write scripts to process data.
  • You change scripts written by your colleagues.
  • You write code that is used in research by you or others.
  • You wish you could re-run your own code after a few months.
  • You wish you could reproduce your own results better.
  • You wish you could automate your work better.
  • You, or your group, can’t share or reuse code.
  • You overall want to become more efficient at your work, by using the best possible tools.

Registration

The workshop will be held on March 21-23 and 28-30, 2023

Go to the CodeRefinery workshop webpage for more information and registration.

About CodeRefinery

CodeRefinery acts as a hub for FAIR (Findable, Accessible, Interoperable, and Reusable) software practices. It currently focuses on the Nordic/Baltic countries, but aims to expand beyond this region. CodeRefinery aims to operate as a community project with support from academic organisations.

CodeRefinery is a project within the Nordic e-Infrastructure Collaboration (NeIC). NeIC is a joint initiative between the Nordic countries, and the NeIC Board based on nominations by the national e-infrastructure provider organisations. These strategic partner organisation are CSC (Finland), SNIC (Sweden), Sigma2 (Norway), DeiC (Denmark), RH Net (Iceland) and ETAIS (Estonia).

Kategorier
Workshop

Participant reflections on data(Tinget) and using UCloud

By the end of 2021, students and staff interested in digital methods, data wrangling, text and data mining from Aarhus University and University of Copenhagen were once again invited to join the annually recurring datasprint organised by The University Libraries at The Royal Danish Library (Det Kgl. Bibliotek).

With the purpose of developing competencies within the field of digital humanities, the datasprint focused on the importance of open political data and the potential of text and data mining in this context.

Large historical data sets were made available to the participants as raw material to explore using the cloud based Interactive High Performance Computing service, UCloud, developed for Danish Universities. A hybrid group of staff from Center for Humanities Computing Aarhus (CHCAA) and students from Information Science, Aarhus University participated in the datasprint in Aarhus (November 18th and 19th) and gained experience with applying UCloud in their work with large datasets.

Benefits of UCloud

High Performance Computing systems (HPC), colloquially referred to as ‘super computers’, are characterised by their immense amount of computing power that far surpasses the abilities of regular desktop computers.

With the cloud based service UCloud, though, complex HPC systems are made accessible for researchers and students even when working with large datasets on laptops.

According to the participants from CHCAA, Aarhus University one main advantage of working with UCloud at the datasprint was the efficiency gained from the use of UCloud as it inflicts more computer power and works faster than similar systems. The ability to process large amounts of data in a relatively short amount of time is also described as a significant feature of UCloud next to its intuitive interface and easy error recovery.

The value of UCloud in the datasprint

UCloud formed an important tool at the datasprint in Aarhus as the topic of the datasprint involved a considerable amount of data, that is the complete collection of Folketinget’s proceedings from 1953 to 2021.

A notable challenge working with the large dataset from the Danish parliament was that only contemporary data from the 2000’s onwards had already been categorised into subjects, a challenge that the participants from our hybrid group sought to solve in order to favour the conditions for analysing the dataset.

By creating a new classifier for the old datasets lacking categories of subjects, the dataset will thus become more accessible and available for further analyses: We’re working with only 20 subjects, so it is very generic …like economy, labour, foreign affairs.

– Jan Kostkan, Center for Humanities Computing, Aarhus University

A broader comprehension of the dataset from Folketinget can thus be gained, and the group found a way to categorise the proceedings making them available for further analyses by experts with subject-matter knowledge, for example historians.

Evaluating the datasprint

UCloud thus served a valuable tool at the datasprint in Aarhus this November. All four participants unanimously agree that UCloud contains significant advantages when it comes to working with large datasets as in the datasprint, mainly because UCloud has more computer power and works faster than other systems.

One specific quality of UCloud that is emphasised by the participants is its ability to support the collaborative working process as the system makes it easy to work with others, even on a distance. Apart from minor issues in the user interface, UCloud is generally commended for its usability, even for beginners, and both students and staff from the group stress the potential of including UCloud in teaching.

Read more about the Data(Tinget) datasprint