|  Jyong | 6454e1d644
							
							chunk-overlap None check (#2781) | 1 year ago | 
				
					
						|  Jyong | 31070ffbca
							
							fix qa index processor  tenant id is None error (#2713) | 1 year ago | 
				
					
						|  Charlie.Wei | fa7ba30ba3
							
							Fix rebuild index&csv parsing (#2705) | 1 year ago | 
				
					
						|  Jyong | 5b953c1ef2
							
							Fix some RAG bugs (#2570) | 1 year ago | 
				
					
						|  Jyong | 0620fa3094
							
							Feat/vdb migrate command (#2562) | 1 year ago | 
				
					
						|  Jyong | 4be3087642
							
							Fix/new RAG bugs (#2547) | 1 year ago | 
				
					
						|  Jyong | 91ea6fe4ee
							
							Fix/langchain document schema (#2539) | 1 year ago | 
				
					
						|  Jyong | 6c4e6bf1d6
							
							Feat/dify rag (#2528) | 1 year ago | 
				
					
						|  Jyong | 97fe817186
							
							Fix/upload limit (#2521) | 1 year ago | 
				
					
						|  Bowen Liang | 063191889d
							
							chore: apply ruff's pyupgrade linter rules to modernize Python code with targeted version (#2419) | 1 year ago | 
				
					
						|  crazywoola | 243ca5b1e2
							
							fix: typo in package path of core.splitter (#2411) | 1 year ago | 
				
					
						|  Bowen Liang | 843280f82b
							
							enhancement: introduce Ruff for Python linter for reordering and removing unused imports with automated pre-commit and sytle check (#2366) | 1 year ago | 
				
					
						|  takatost | 9f637ead38
							
							bump version to 0.5.3 (#2306) | 1 year ago | 
				
					
						|  KVOJJJin | 89fcf4ea7c
							
							Feat: chunk overlap supported (#2209) | 1 year ago | 
				
					
						|  takatost | 6cf93379b3
							
							fix: split chunks return empty strings (#2197) | 1 year ago | 
				
					
						|  Jyong | 869690c485
							
							fix notion estimate (#2090) | 1 year ago | 
				
					
						|  Jyong | cb7a608d75
							
							ascii filter Unicode  U+FFFE (#2038) | 1 year ago | 
				
					
						|  Jyong | a63a9c7d45
							
							text spliter length method use default embedding model tokenizer (#2011) | 1 year ago | 
				
					
						|  Bowen Liang | cc9e74123c
							
							improve: introduce isort for linting Python imports (#1983) | 1 year ago | 
				
					
						|  Jyong | 24bdedf802
							
							fix get embedding model provider in empty dataset (#1986) | 1 year ago | 
				
					
						|  Jyong | 4a3d15b6de
							
							fix customer spliter  character (#1915) | 1 year ago | 
				
					
						|  takatost | a938e1f184
							
							fix: notion_indexing_estimate embedding_model_instance NPE (#1907) | 1 year ago | 
				
					
						|  Yeuoly | 9134849744
							
							fix: remove tiktoken from text splitter (#1876) | 1 year ago | 
				
					
						|  takatost | d069c668f8
							
							Model Runtime (#1858) | 1 year ago | 
				
					
						|  Jyong | df1509983c
							
							ppt & pptx improve (#1790) | 1 year ago | 
				
					
						|  Jyong | 5e34f938c1
							
							Feat/add unstructured support (#1780) | 1 year ago | 
				
					
						|  crazywoola | 994fceece3
							
							fix: qa regex (#1738) | 1 year ago | 
				
					
						|  Pascal M | bc54cdc537
							
							refactor: typo in dataset docstore (#1711) | 1 year ago | 
				
					
						|  Pascal M | 5d10cf0fe6
							
							fix: error Class 'builtins.list' is not mapped (#1710) | 1 year ago | 
				
					
						|  Jyong | 4588831bff
							
							Feat/add retriever rerank (#1560) | 1 year ago |