Symphony

Retrieval-augmented language models using multi-modal data lakes.

Multi-modal data lakes, which contain datasets in different formats such as text, tables, and knowledge graphs, have become increasingly popular for many organizations.

Large language models, as generative models, cannot ensure the correctness of generative data.

Given any natural language query, Symphony will first retrieve (possibly multiple) datasets from data lakes, which are then used for reasoning to answer the given query.

References

2023

  1. Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes
    Zui Chen, Zihui Gu, Lei Cao, Ju Fan, and 2 more authors
    In 13th Conference on Innovative Data Systems Research, CIDR 2023, Amsterdam, The Netherlands, January 8-11, 2023, 2023