Open-source enthusiast, LLM expert, co-founder and CTO of ellamind, and co-founder of DiscoResearch, our open-source research and development community. Come chat with me in our Discord!
- LeoLM: German LLM: I used large-scale continued pretraining to transfer the English-language capablities of Llama-2 to German. Together with LAION and Hessian.AI we released LeoLM: Linguistically Enhanced Open Language Model at different model scales. Check out our Blog post for more info: https://laion.ai/blog/leo-lm/
- Vision-Language Explanations: Transformer explainability is lacking but they are great at producing text. Why not have it explain it's own decisions? A large research project investigating natural language explanations for multimodal transformer applications. Currently under review. Arxiv preprint: https://arxiv.org/abs/2212.04231
- KOSMOS-1 Reimplimentation: The KOSMOS-1 paper (multimodal foundation model) was super interesting to me at the time but no code to be found anywhere. This is a very rudimentary reimplementation of the core aspects.
- Tagesschau: Simple scrape of Tagesschau news articles.
In my repositories you'll find some projects:
- DiscoveredWeekly contains the source code for my website discoveredweekly.com where users can log in with their Spotify account and every monday their new Discover Weekly playlist will get copied automatically, making sure no valuable song suggestions are ever lost.
- AutoObjectRemoval is a combination of Instance Segmentation using Detectron2, and Flow-Guided Video Completion to create a system which can automatically mask and remove objects from videos.
- VideoSilenceRemover is a tool for automatically cutting segments of silence out of a video. Created this tool for a friend to facilitate the boring parts of his job.
- DirectoryStats is a python CLI for efficiently counting large amounts of files and subdirectories. Needed this to keep track of directory size during creation of the dataset for my thesis project.
- PaypalTransactionVisualizer is a Jupyter notebook which shows you some interesting infos about your past spending with PayPal. This is a project I implemented mostly to gain some insight on my own spending habits but also to practice using Jupyter and some interesting python features.
- YoutubeHistoryVisualizer is a notebook along a similar line which shows you some stats regarding the Youtube videos you've used in the past. It works with data from Google Takeout.
- ColorFlow is an Android game written in Java, which was a cool side project. The repo is not well maintained and used primarily as my own VCS. Check out the game in the Play Store.
See my IEEE author profile for an updated list of publications.
B. Plüster, C. Weber, L. Qu and S. Wermter, "Hearing Faces: Target Speaker Text-to-Speech Synthesis from a Face," 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2021, pp. 757-764, doi: 10.1109/ASRU51503.2021.9687866.
Best way to reach me is via e-mail [email protected].