微信扫码
与创始人交个朋友
我要投稿
pip install magic-pdf
cp magic-pdf.template.json ~/magic-pdf.json
magic-pdf pdf-command --pdf "pdf_path" --model "model_json_path"
图像
,表格
,标题
,文本
等;行内公式
和行间公式
;conda create -n pdfpipeline python=3.10
git clone https://github.com/opendatalab/PDF-Extract-Kit.git
pip3 install -r requirements+cpu.txt
pip install https://github.com/opendatalab/PDF-Extract-Kit/raw/main/assets/whl/detectron2-0.6-cp310-cp310-macosx_11_0_arm64.whl
PDF-Extract-Kit/configs/model_configs.yaml:2
PDF-Extract-Kit/modules/layoutlmv3/layoutlmv3_base_inference.yaml:72
git lfs clone https://huggingface.co/wanderkid/PDF-Extract-Kit
python pdf_extract.py --pdf data/pdfs/ocr_1.pdf
File "/Users/linyu/ai/PDF-Extract-Kit/modules/layoutlmv3/layoutlmft/models/layoutlmv3/modeling_layoutlmv3.py", line 71, in forward
position_embedding = F.interpolate(position_embedding, size=(Hp, Wp), mode='bicubic')
NotImplementedError: The operator 'aten::upsample_bicubic2d.out' is not currently implemented for the MPS device. If you want this op to be added in priority during the prototype phase of this feature, please comment on https://github.com/pytorch/pytorch/issues/77764. As a temporary fix, you can set the environment variable `PYTORCH_ENABLE_MPS_FALLBACK=1` to use the CPU as a fallback for this op. WARNING: this will be slower than running natively on MPS.
export PYTORCH_ENABLE_MPS_FALLBACK=1
或者在py 运行文件中加import os
os.environ['PYTORCH_ENABLE_MPS_FALLBACK'] = '1'
RuntimeError: Expected repeatBuffer && cumsumBuffer && resultBuffer to be true, but got false.
magic-pdf pdf-command --pdf "/Users/linyu/ai/pdf储能政策知识库/《“十四五”能源领域科技创新规划》2021.pdf" --model ""
53AI,企业落地应用大模型首选服务商
产品:大模型应用平台+智能体定制开发+落地咨询服务
承诺:先做场景POC验证,看到效果再签署服务协议。零风险落地应用大模型,已交付160+中大型企业
2024-11-15
如何用 Multi Agent 优化你的多智能体系统?详解开发与应用!
2024-11-15
刚刚,OpenAI发布Windows版ChatGPT,高级语音能用了
2024-11-14
手把手教你开发Agent:聊聊DB-GPT Agent的架构设计、源码解读和实战开发
2024-11-14
Alibaba开源UReader:通用免OCR文档理解
2024-11-13
这家小公司的700 亿参数模型竟然比Open o1还强?
2024-11-13
通义牛逼!!比肩Chatgpt-4o 还自带 Artifact
2024-11-13
Ollama 更新!本地跑 LLama3.2,轻量级+视觉能力,能媲美GPT-4o?
2024-11-13
开源版SearchGPT来了,两张3090就可复现,超越Perplexity付费版
2024-05-06
2024-08-13
2024-07-25
2024-06-12
2024-06-16
2024-07-11
2024-07-20
2024-06-15
2024-07-25
2024-07-25
2024-11-13
2024-11-13
2024-10-07
2024-09-22
2024-09-20
2024-09-14
2024-09-14
2024-09-12