Jaisidh Singh

me.jpg

I am a master’s student at the University of Tübingen studying machine learning and a fellow at Zuse School ELIZA. I’m also a guest researcher at the Max Planck Institute for Intelligent Systems Tübingen advised by Antonio Orvieto. Currently, I’m writing my master’s thesis on the scaling behaviour of LLMs with hybrid attention with Aaron Klein from OpenEuroLLM. Previously, I was an intern at Bosch Research India during my undergrad at IIT Jodhpur.

Coming from an engineering background, my approach to deep learning research seeks to harmonise applied and foundational research. My interests primarily align with, but are not limited to, scaling behaviour of foundation models, particularly how architecture design influences hyper-parameters as we scale up model size, as well as depth-scaling.

If you’re interested in collaborating or chatting about these topics, reach out to me via bluesky or email. I like getting messages! For more information about my publications or work experience, you can check my curriculum vitae.



Blog posts