- Removed multiple failed document entries from `documents.json`. - Added a new document entry with updated metadata and changed the index name to `regulations_dense_1024_v2`. - Updated architecture documentation to reflect changes in the Milvus collection name. - Adjusted requirements by removing the sqlalchemy dependency. - Modified test cases to align with new document structure and naming conventions. - Introduced a new test file for Milvus vector index runtime recovery and error handling. - Updated assertions in various test files to ensure compatibility with the new schema.
45 lines
703 B
Plaintext
45 lines
703 B
Plaintext
# AI+合规智能中枢 - 法律法规文档解析入库
|
|
# MVP核心依赖包
|
|
|
|
# 向量数据库
|
|
pymilvus>=2.4.0
|
|
|
|
# API框架
|
|
fastapi>=0.100.0
|
|
uvicorn[standard]>=0.23.0
|
|
python-multipart>=0.0.6
|
|
|
|
# RAG框架
|
|
langchain>=0.1.0
|
|
langchain-milvus>=0.1.0
|
|
|
|
# PDF解析
|
|
pymupdf>=1.24.0
|
|
|
|
# Word文档解析
|
|
python-docx>=0.8.11
|
|
|
|
# 阿里云文档解析
|
|
alibabacloud-docmind-api20220711>=1.0.6
|
|
alibabacloud-tea-openapi>=0.3.11
|
|
alibabacloud-tea-util>=0.3.13
|
|
|
|
# 任务队列
|
|
celery>=5.3.0
|
|
redis>=4.5.0
|
|
|
|
# 对象存储
|
|
minio>=7.1.0
|
|
|
|
# 数据库
|
|
psycopg2-binary>=2.9.0
|
|
# mysql-connector-python>=8.0.0
|
|
|
|
# 工具库
|
|
pydantic>=2.0.0
|
|
pydantic-settings>=2.0.0
|
|
python-dotenv>=1.0.0
|
|
loguru>=0.7.0
|
|
tenacity>=8.2.0
|
|
httpx>=0.24.0
|