Initializing the HDFS
Function
Hadoop distributed file system (HDFS) initialization is a prerequisite for using application programming interfaces (APIs) provided by the HDFS. The process of initializing the HDFS is
- Load the HDFS service configuration file.
- Instantiate a FileSystem.
Configuration File Description
Table 1 lists the configuration files to be used during the login to the HDFS. These files already imported to the conf directory of the hadoop-examples project.
File |
Function |
Path |
---|---|---|
core-site.xml |
Configures HDFS parameters. |
FusionInsight_Cluster_<Cluster ID>_Services_ClientConfig\HDFS\config\core-site.xml |
hdfs-site.xml |
Configures HDFS parameters. |
FusionInsight_Cluster_<Cluster ID>_Services_ClientConfig\HDFS\config\hdfs-site.xml |
smallfs-site.xml |
Configures SmallFS parameters. |
FusionInsight_Cluster_<Cluster ID>_Services_ClientConfig\SmallFS\config\smallfs-site.xml |
- The configuration file smallfs-site.xml is necessary for running SmallFS example code, but not for running HDFS example code.
- The log4j.properties file under the conf directory can be configured based on your needs.
Example Codes
The following is code snippets. For complete codes, see the HdfsExample class in com.huawei.bigdata.hdfs.examples.
//initialization /**confLoad(); // Creating a sample project HdfsExample hdfs_examples = new HdfsExample("/user/hdfs-examples", "test.txt"); /** * * If the application is running in the Linux OS, the path of core-site.xml, hdfs-site.xml must be modified to the absolute path of the client file in the Linux OS. * * */ private static void confLoad() throws IOException { conf = new Configuration(); // conf file conf.addResource(new Path(PATH_TO_HDFS_SITE_XML)); conf.addResource(new Path(PATH_TO_CORE_SITE_XML)); // conf.addResource(new Path(PATH_TO_SMALL_SITE_XML)); } /** *Create a sample project. */ public HdfsExample(String path, String fileName) throws IOException { this.DEST_PATH = path; this.FILE_NAME = fileName; instanceBuild(); } private void instanceBuild() throws IOException { fSystem = FileSystem.get(conf); }
- In the preceding example code,
//conf.addResource(new Path(PATH_TO_SMALL_SITE_XML));
is the SmallFS example code that is commented out. To run SmallFS, it is essential to add this code.
- (Optional) Specify a user to run the example code. To run the example code related to the Colocation operation, the user must be a member of the supergroup group. The following describes two ways to specify the user who runs the example code:
Add the environment variable HADOOP_USER_NAME: For operation details in a Windows-based environment, see 2 in section For operation details in a Linux-based environment, see 4 in section or 4.in section .
Modify the code: If HADOOP_USER_NAME is not specified, modify "USER" in the code to the actual user name:System.setProperty("HADOOP_USER_NAME", USER);
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.