YC 9f8ca75a81 fixing a bug of handling header row when parsing xls file, and tune xls/xlsx parsing result to be more structured (#3600) 1 gadu atpakaļ
..
blod 0737e930cb chore: remove Langchain tools import (#3407) 1 gadu atpakaļ
entity 5b953c1ef2 Fix some RAG bugs (#2570) 1 gadu atpakaļ
unstructured b5204111da Add UNSTRUCTURED_API_KEY env support (#4369) 1 gadu atpakaļ
csv_extractor.py 58db719a2c dep: bump pandas from 1.x to 2.x (#4820) 1 gadu atpakaļ
excel_extractor.py 9f8ca75a81 fixing a bug of handling header row when parsing xls file, and tune xls/xlsx parsing result to be more structured (#3600) 1 gadu atpakaļ
extract_processor.py 176d91937d fix 'NoneType' and new ContentType supported. (#4818) 1 gadu atpakaļ
extractor_base.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 gadu atpakaļ
helpers.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 gadu atpakaļ
html_extractor.py 5b953c1ef2 Fix some RAG bugs (#2570) 1 gadu atpakaļ
markdown_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 gadu atpakaļ
notion_extractor.py 026175c8f7 feat: update notion extractor (#3898) 1 gadu atpakaļ
pdf_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 gadu atpakaļ
text_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 gadu atpakaļ
word_extractor.py 233c4150d1 support images and tables extract from docx (#4619) 1 gadu atpakaļ