详解C#如何提取PDF文档中的图片_Asp.net

当 pdf 文件中包含有价值的图片，如艺术画作、设计素材、报告图表等，提取图片可以将这些图像资源进行单独保存，方便后续在不同的项目中使用，避免每次都要从 pdf 中查找。本文将介绍如何使用c#通过代码从pdf文档中提取图片，包含以下两个示例：

提取pdf图片需要用到 spire.pdf for .net 库。可以通过此链接下载产品包后手动添加引用，或者直接通过nuget安装。

c# 提取指定 pdf 页面中的图片

pdfimagehelper 类可用于帮助用户管理 pdf 文档中的图像，要从某个指定的pdf页面中提取图片，参考以下步骤：

使用 pdfdocument 类的 loadfromfile() 方法加载 pdf 文件。

通过 pdfdocument 类的 pages[index] 属性获取指定页面。

创建 pdfimagehelper 对象，然后使用其 getimagesinfo() 方法获取页面中图像信息集合。

遍历图像信息集合，并使用 pdfimageinfo.image.save() 方法将每一张图片以png格式储存到指定文件路径。

c# 代码：

using spire.pdf;
using spire.pdf.utilities;
using system.drawing;

namespace extractimagesfromspecificpage
{
    class program
    {
        static void main(string[] args)
        {
            // 加载pdf文档
            pdfdocument pdf = new pdfdocument();
            pdf.loadfromfile("e:\\pythonpdf\\ai.pdf");

            // 获取第一页
            pdfpagebase page = pdf.pages[0];

            // 创建pdfimagehelper对象
            pdfimagehelper imagehelper = new pdfimagehelper();

            // 获取页面上的图片信息 
            pdfimageinfo[] imageinfos = imagehelper.getimagesinfo(page);

            // 遍历图片信息
            for (int i = 0; i < imageinfos.length; i++)
            {
                // 获取某个指定图片信息
                pdfimageinfo imageinfo = imageinfos[i];

                // 获取指定图片
                image image = imageinfo.image;

                // 将图片保存为png格式
                image.save("图片\\图-" + i + ".png");
            }

            pdf.dispose();
        }
    }
}

c# 提取pdf 文档中所有图片

要获取整个pdf文档中的图片，就需要遍历每一页然后再提取，具体参考以下步骤：

使用 pdfdocument 类的 loadfromfile() 方法加载 pdf 文件。
创建 pdfimagehelper 对象。
遍历文档中的每一个页面。
通过 pdfdocument 类的 pages[index] 属性获取指定页面。
使用 pdfimagehelper.getimagesinfo() 方法获取页面中图像信息集合。
遍历图像信息集合，并使用 **pdfimageinfo.image.save()**方法将每一张图片以png格式储存到指定文件路径。

c# 代码：

using spire.pdf;
using spire.pdf.utilities;
using system.drawing;

namespace extractallimages
    {
        class program
        {
            static void main(string[] args)
            {
                // 加载pdf文档
                pdfdocument pdf = new pdfdocument();
                pdf.loadfromfile("e:\\pythonpdf\\ai.pdf");

                // 创建pdfimagehelper对象
                pdfimagehelper imagehelper = new pdfimagehelper();

                int m = 0;
                // 遍历pdf页面
                for (int i = 0; i < pdf.pages.count; i++)
                {
                    // 获取指定页面
                    pdfpagebase page = pdf.pages[i];

                    // 获取页面上的图片信息 
                    pdfimageinfo[] imageinfos = imagehelper.getimagesinfo(page);

                    // 遍历图片信息
                    for (int j = 0; j < imageinfos.length; j++)
                    {
                        // 获取某个指定图片信息
                        pdfimageinfo imageinfo = imageinfos[j];

                        // 获取指定图片
                        image image = imageinfo.image;

                        // 将图片保存为png格式
                        image.save("pdf图片\\图-" + m + ".png");
                        m++;
                    }

                }

                pdf.dispose();
            }
        }
    }

到此这篇关于详解c#如何提取pdf文档中的图片的文章就介绍到这了,更多相关c#提取pdf图片内容请搜索代码网以前的文章或继续浏览下面的相关文章希望大家以后多多支持代码网！

C#中DrawCurve的用法小结

drawcurve方法在 c# 中通常用于绘制一条平滑的曲线通过一系列给定的点。不过，需要注意的是drawcurve并不是 c# 语言本身的一部分，而是在 .n... [阅读全文]

C#中CompareTo的用法小结

在c#中，compareto方法通常用于比较当前对象与另一个对象的顺序。这个方法广泛应用于实现了icomparable<t>或者icomparer&... [阅读全文]

C#中EventWaitHandle的用法小结

eventwaithandle是 c# 中用于线程间同步的一个类，它提供了对共享资源的访问控制，以及线程间的同步机制。eventwaithandle类位于sys... [阅读全文]

C#TextBox设置提示文本方式(SetHintText)

c#textbox设置提示文本效果展示核心代码[dllimport("user32.dll", charset = charset.auto)]private ... [阅读全文]

C#随机数(Random)生成与应用实战之从基础到高级详解

在当今的软件开发中，随机数的应用无处不在。无论是游戏开发中的随机事件生成，还是数据处理中的随机抽样，亦或是用户界面中的随机元素展示，随机数都扮演着不可或缺的角色... [阅读全文]

C#中async await异步关键字用法和异步的底层原理全解析

c#异步编程一、异步编程基础异步编程是啥玩意儿就是让程序在干等着某些耗时操作（比如等网络响应、读写文件啥的）的时候，能把线程腾出来干别的活儿，这样程序就能更灵敏... [阅读全文]


验证码：

验证码：

详解C#如何提取PDF文档中的图片

2025年04月03日 • Asp.net •我要评论

相关文章:

发表评论