Reading HDFS from Matlab - what toolboxes do I need?

2 visualizzazioni (ultimi 30 giorni)
Anna Dunblad
Anna Dunblad il 8 Set 2017
Risposto: Chad Greene il 11 Set 2017
We're planning to implement Hadoop at my work, and I need a way to retreive the data from the Hadoop clusters in the data lake and get it into Matlab. What toolboxes do I need for this? Note that I'm only reading the data from HDFS-files.
Additionally, would I need other toolboxes to be able to read data?

Risposte (2)

Brandon Eidson
Brandon Eidson il 11 Set 2017
Hadoop Sequence Files can be read directly in base MATLAB.
If you want to do "mapreduce" on a Hadoop cluster, then you need to have licenses for the Parallel Computer Toolbox and MATLAB Distributed Computer Server.  Documentation on how to Configure a Hadoop cluster and run "mapreduce" on it is linked to below.

Chad Greene
Chad Greene il 11 Set 2017
The h5read function has come standard since Matlab release 2011a, and requires no special toolboxes.

Categorie

Scopri di più su Workspace Variables and MAT Files in Help Center e File Exchange

Tag

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by