hdfs集成springboot使用方法_Java

1.导入maven依赖

<dependency>
    <groupid>org.apache.hadoop</groupid>
    <artifactid>hadoop-client-api</artifactid>
    <version>3.3.6</version>
</dependency>
<dependency>
    <groupid>org.apache.hadoop</groupid>
    <artifactid>hadoop-client-runtime</artifactid>
    <version>3.3.6</version>
</dependency>

2.配置configuration信息

1）方法1：通过将hdfs的两个配置文件（hdfs-site.xml、core-site.xml）放到resources文件夹下后，新建configuration的时候设置为true会自动读取，也可以通过conf.set(“配置”,“值”)来修改配置项

//创建配置，是否引用core-site.xml和hdfs-site.xml配置文件，true是引用
configuration conf = new configuration(true);
//创建文件连接流，指定namenode、conf和连接的用户名
filesystem fs = filesystem.get(new uri("mycluster"),conf,"hadoop");

2）方法2：将configuration设置为false，不加载默认配置文件，直接指定namenode对应的ip和端口如：hdfs://192.168.132.101:8081替换mycluster

configuration conf = new configuration(false);
filesystem fs = filesystem.get(new uri("hdfs://192.168.132.101:8081"),conf,"hadoop");

3.hdfs集成springboot基本命令

1）判断文件是否存在

fs.exists(new path("/out.txt"))

2）创建文件夹

fs.mkdirs(new path("/dir1"));

3）创建文件夹并设置权限为文件所有者可读可写，文件所有组可读可写，其他人可读

fs.mkdirs(new path("/dir2"),new fspermission(fsaction.read_write,fsaction.read_write,fsaction.read));

4）删除文件夹

fs.delete(new path("/dir1"),true);

5）创建文件并输入文本
如果文件存在，默认会覆盖, 可以通过第二个参数进行控制。第三个参数可以控制使用缓冲区的大小

fsdataoutputstream out = fs.create(new path("/test.txt"),true, 4096);
out.write("hello hadoop!".getbytes());
out.flush();
out.close();

6）读取文本

fsdatainputstream inputstream = fs.open(new path("/test.txt"));
byte[] contextbytes = new byte[1024];
inputstream.read(contextbytes);
string context = new string(contextbytes,"utf-8");
system.out.println(context);

7）文件重命名

boolean result = fs.rename(new path("/test.txt"), new path("/testnew.txt"));

8）上传文件

fs.copyfromlocalfile(new path("./data/hello.txt"), new path("/hdfshello.txt"));

9）下载文件

fs.copytolocalfile(false, new path("/hdfshello.txt"), new path("./data/testdata.txt"), true);

10）输出所有列表所有文件和文件夹信息

filestatus[] statuses = fs.liststatus(new path("/"));
for (filestatus filestatus : statuses) {
    system.out.println(filestatus.tostring());
}

11）递归查询目录所有文件信息，比liststatus多了文本大小，副本系数，块大小信息

remoteiterator<locatedfilestatus> files = fs.listfiles(new path("/"), true);
while (files.hasnext()) {
    system.out.println(files.next());
}

12）查询文件块信息

filestatus filestatus = fs.getfilestatus(new path("/user/master01/data.txt"));
blocklocation[] blocks = fs.getfileblocklocations(filestatus, 0, filestatus.getlen());
for (blocklocation block : blocks) {
    system.out.println(block);
}

13）查询文件块信息并跳转读取

filestatus filestatus = fs.getfilestatus(new path("/user/master01/data.txt"));
blocklocation[] blocks = fs.getfileblocklocations(filestatus, 0, filestatus.getlen());
fsdatainputstream input = fs.open(new path("/user/master01/data.txt"));
input.seek(blocks[1].getoffset());
//input.seek(0)是让指针回到开始
system.out.println(input.readline());

到此这篇关于hdfs集成springboot使用的文章就介绍到这了,更多相关hdfs集成springboot内容请搜索代码网以前的文章或继续浏览下面的相关文章希望大家以后多多支持代码网！

Spring Boot 统一数据返回格式的解决方案

实现统一数据格式统⼀的数据返回格式使⽤ @controlleradvice 和 responsebodyadvice 的⽅式实现；@controlleradvice ：表⽰控制器…

2024年05月18日 • 编程语言

异常解决SpringBoot项目启动卡住,无任何异常信息问题

项目场景springboot项目启动的时候console控制台日志打印卡住，无任何异常信息打印问题描述之前项目是好的，但是后面经人写了一部分代码之后，项目启动不... [阅读全文]

Java目录树的创建与获取

在java开发中,经常会涉及到生成目录树的需求,目录树是一种树状结构,用于表示文件系统中的目录和文件之间的层次关系。下面就来介绍一下java目录树的创建与获取，... [阅读全文]

SpringMVC全局异常处理小结

一、为什么要全局异常处理？我们知道，系统中异常包括：编译时异常和运行时异常runtimeexception，前者通过捕获异常从而获取异常信息，后者主要通过规范代码开发、测试通过手段…

2024年05月18日 • 编程语言

java动态目录树的实现示例

引言在开发过程中，常常需要对目录结构进行操作和展示。本文将介绍如何使用java实现动态目录树，并通过详细的步骤和代码示例来指导新手开发者完成这个任务。整体流程首... [阅读全文]

SpringBoot全局异常处理之多个处理器匹配顺序(最新推荐)

spring版本：5.0.6多个处理器的两种情况1. 存在一个类中@restcontrolleradvicepublic class exceptionhandle { @e…

2024年05月18日 • 编程语言


验证码：

验证码：

hdfs集成springboot使用方法

2024年05月18日 • Java •我要评论

1.导入maven依赖

2.配置configuration信息

3.hdfs集成springboot基本命令

相关文章:

Spring Boot 统一数据返回格式的解决方案

SpringMVC全局异常处理小结

SpringBoot全局异常处理之多个处理器匹配顺序(最新推荐)

发表评论