Deploying Big Data Applications to Spark
Import three years of New York City taxi fare data from a Hadoop® Distributed File System (HDFS) into a MATLAB®tall table. Then use Statistics and Machine Learning Toolbox™ to predict fares based on location and time of day in the city. The algorithm scales automatically to multiple cores on your local computer or a compute cluster using Parallel Computing Toolbox™. With MATLAB Compiler™, the algorithm is deployed to an Apache Spark™ cluster with minimal changes needed to the code.
You can also select a web site from the following list:
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.