Bowen Liang 39c14ec7c1 improve: unify Excel files parsing in either xls or xlsx file format by Pandas (#4965) 1 yıl önce
..
blod f976740b57 improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 1 yıl önce
entity 12c815c597 fix: ExtractSetting optional value missing None as default val (#5238) 1 yıl önce
firecrawl ba5f8afaa8 Feat/firecrawl data source (#5232) 1 yıl önce
unstructured b5204111da Add UNSTRUCTURED_API_KEY env support (#4369) 1 yıl önce
csv_extractor.py 58db719a2c dep: bump pandas from 1.x to 2.x (#4820) 1 yıl önce
excel_extractor.py 39c14ec7c1 improve: unify Excel files parsing in either xls or xlsx file format by Pandas (#4965) 1 yıl önce
extract_processor.py ba5f8afaa8 Feat/firecrawl data source (#5232) 1 yıl önce
extractor_base.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 yıl önce
helpers.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 yıl önce
html_extractor.py 5b953c1ef2 Fix some RAG bugs (#2570) 1 yıl önce
markdown_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 yıl önce
notion_extractor.py ba5f8afaa8 Feat/firecrawl data source (#5232) 1 yıl önce
pdf_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 yıl önce
text_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 yıl önce
word_extractor.py 3b60c28b3a deal the external image when extract docx image (#5024) 1 yıl önce