Professional Documents
Culture Documents
Student Assistant For Legal Document Querying and Visual - Plot Question Answering
Student Assistant For Legal Document Querying and Visual - Plot Question Answering
We are looking for a student assistant in an exciting project to enhance data accessibility
through natural language querying of multi-modal data (tables, text, images, …) together
with our partner Hochtief / Nexplore.
Project Description:
Extracting information from large collections of documents is a difficult task, especially when
the required information is scattered throughout these documents in texts, images, diagrams
(plots) and tables. Together with our partner Hochtief / Nexplore we aim to build a system
that allows querying multi-modal data extracted from their large PDF collections using simple
natural language queries. In particular, we build on CAESURA
(https://arxiv.org/abs/2308.03424), a system that translates natural language queries over
multi-modal data into several processing steps using Large Language Models to answer
user queries. However, there are still some exciting challenges to be tackled. For instance,
currently, CAESURA relies on external off-the-shelf machine learning tools from Huggingface
to extract information for modalities different from text.
Task:
Extend CAESURA with tools for querying plots in addition to other modalities. For this
existing PlotQA models should be fine-tuned or new PlotQA models should be developed.
Qualifications:
- Proficiency in machine learning, computer vision, and NLP concepts.
- Hands-on experience with deep learning frameworks such as PyTorch and
TensorFlow.
If you're passionate about advancing AI and data accessibility, we encourage you to apply.
Your contributions will be instrumental in solving real-world problems and making information
more accessible than ever before.
If you are interested, please send your application documents (CV, transcript) to
matthias.urban@cs.tu-darmstadt.de and carsten.binnig@cs.tu-darmstadt.de