{"data":{"markdownRemark":{"html":"<p>The Simudyne SDK can export all simulation data to <a href=\"https://parquet.apache.org/\">Parquet</a> files for further analysis. For example, these Parquet files can be used as an input into a Jupyter Notebook to allow the user to perform further exploratory analysis of their simulation data.</p>\n<h2 id=\"working-with-parquet-on-windows\"><a href=\"#working-with-parquet-on-windows\" aria-hidden=\"true\" class=\"anchor\"><svg aria-hidden=\"true\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>Working with Parquet on Windows</h2>\n<p>You will need a file named <code class=\"language-text\">winutils.exe</code> to be able to use Parquet on Windows.\n<br />\nYou can find it in the <code class=\"language-text\">hadoop-winutils</code> directory <a href=\"http://content.simudyne.com/$web/hadoop-winutils-master.zip\">here</a>,\nor you can copy-paste the following URL into your browser : <code class=\"language-text\">http://content.simudyne.com/$web/hadoop-winutils-master.zip</code>.</p>\n<p>Once you have downloaded the <code class=\"language-text\">hadoop-winutils</code> , run the <code class=\"language-text\">Winutils_setup.bat</code> batch file to set your environment variable accordingly.</p>\n<p>If you already have an installed version of Hadoop and just lack the <code class=\"language-text\">winutils.exe</code>, you can add it to your <code class=\"language-text\">C:\\hadoop-x.x.x\\bin</code> directory manually.</p>\n<p>When using parquet on Windows, the system will try to access <code class=\"language-text\">...\\hadoop-winutils\\bin</code> (or <code class=\"language-text\">...\\hadoop-x.x.x\\bin</code> if you already had hadoop installed) to find the file <code class=\"language-text\">winutils.exe</code>.\nIf you are getting error messages like <code class=\"language-text\">Shell Failed to locate the winutils binary in the hadoop binary path</code>,\ncheck that your <code class=\"language-text\">HADOOP_HOME</code> environment variable is set and that your <code class=\"language-text\">winutils.exe</code> is located in the <code class=\"language-text\">bin</code> directory inside the directory of the <code class=\"language-text\">HADOOP_HOME</code> destination.\nFor instance, if the location of your <code class=\"language-text\">hadoop-winutils</code> directory is <code class=\"language-text\">C:\\hadoop-winutils</code>, then <code class=\"language-text\">HADOOP_HOME</code> must be <code class=\"language-text\">C:\\hadoop-winutils</code>.</p>\n<h2 id=\"enabling-parquet-output\"><a href=\"#enabling-parquet-output\" aria-hidden=\"true\" class=\"anchor\"><svg aria-hidden=\"true\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>Enabling Parquet Output</h2>\n<p>The Simudyne SDK will not export files to Parquet by default. To enable, set the value of the config field <code class=\"language-text\">core.parquet-export.enabled</code> in the <code class=\"language-text\">simudyneSDK.properties</code> file to true. (<a href=\":version/reference/modelling/model-configuration\">More about Model Config</a>.)</p>\n<p>The path to create the Parquet files in should be provided in the config field <code class=\"language-text\">core.export-path</code>. This can be an HDFS path, or a local file system path. If no value is specified for <code class=\"language-text\">core.export-path</code>, the Parquet files will be dumped to a tmp directory, or the HDFS home if running with spark.</p>\n<p>Furthermore, there are two additional details both for local Parquet and Hive output that a user may wish to change. These are <code class=\"language-text\">core.data-export.generic-flush</code> and <code class=\"language-text\">core.data-export.values-flush</code>. These typically would be the same values (the option to change is left to the user for altering default export or custom channels) and refers to how many records will be outputting to a single file, or in the case of Hive how many entries are sent in a single query.</p>","headings":[{"value":"Working with Parquet on Windows","depth":2},{"value":"Enabling Parquet Output","depth":2}],"frontmatter":{"title":"Parquet","toc":null,"experimental":null}},"site":{"siteMetadata":{"title":"Simudyne Docs","latestVersion":"2.6"}}},"pageContext":{"absolutePath":"/home/vsts/work/1/s/content/2.6/reference/data_export/parquet.md","versioned":false,"version":"2.6","kind":"reference","pagePath":"/reference/data_export/parquet","chronology":{"prev":{"name":"Data Output","path":"/reference/data_export"},"next":{"name":"JSON","path":"/reference/data_export/json"}},"lastUpdated":"2026-04-21T13:56:54.868Z"}}