docs: update website synapse info and fabric installation (#2000)

This commit is contained in:
JessicaXYWang 2023-06-28 16:53:47 -07:00 коммит произвёл GitHub
Родитель 63491dd8f1
Коммит aa2a0b202d
Не найден ключ, соответствующий данной подписи
Идентификатор ключа GPG: 4AEE18F83AFDEB23
1 изменённых файлов: 38 добавлений и 3 удалений

Просмотреть файл

@ -249,15 +249,16 @@ function Home() {
and cloud native. and cloud native.
</p> </p>
<p> <p>
Note: SynapseML will be built-in for{" "} Note: SynapseML is built-in for{" "}
<a href="https://docs.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-3-runtime"> <a href="https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-33-runtime">
Azure Synapse soon. Azure Synapse Analytics.
</a> </a>
</p> </p>
<Tabs <Tabs
defaultValue="Synapse" defaultValue="Synapse"
values={[ values={[
{ label: "Synapse", value: "Synapse" }, { label: "Synapse", value: "Synapse" },
{ label: "Fabric", value: "Fabric" },
{ label: "Spark Packages", value: "Spark Packages" }, { label: "Spark Packages", value: "Spark Packages" },
{ label: "Databricks", value: "Databricks" }, { label: "Databricks", value: "Databricks" },
{ label: "Docker", value: "Docker" }, { label: "Docker", value: "Docker" },
@ -296,6 +297,40 @@ function Home() {
"spark.sql.parquet.enableVectorizedReader": "false", "spark.sql.parquet.enableVectorizedReader": "false",
"spark.sql.legacy.replaceDatabricksSparkAvro.enabled": "true" "spark.sql.legacy.replaceDatabricksSparkAvro.enabled": "true"
} }
}`}
lang="bash"
></CodeSnippet>
</TabItem>
<TabItem value="Fabric">
<p>SynapseML is preinstalled on Fabric. To install a different version, adding the following to the first cell of a notebook:</p>
For Spark3.3 pool:
<CodeSnippet
snippet={`%%configure -f
{
"name": "synapseml",
"conf": {
"spark.jars.packages": "com.microsoft.azure:synapseml_2.12:0.11.1-spark3.3",
"spark.jars.repositories": "https://mmlspark.azureedge.net/maven",
"spark.jars.excludes": "org.scala-lang:scala-reflect,org.apache.spark:spark-tags_2.12,org.scalactic:scalactic_2.12,org.scalatest:scalatest_2.12,com.fasterxml.jackson.core:jackson-databind",
"spark.yarn.user.classpath.first": "true",
"spark.sql.parquet.enableVectorizedReader": "false"
}
}`}
lang="bash"
></CodeSnippet>
For Spark3.2 pool:
<CodeSnippet
snippet={`%%configure -f
{
"name": "synapseml",
"conf": {
"spark.jars.packages": "com.microsoft.azure:synapseml_2.12:0.11.1,org.apache.spark:spark-avro_2.12:3.3.1",
"spark.jars.repositories": "https://mmlspark.azureedge.net/maven",
"spark.jars.excludes": "org.scala-lang:scala-reflect,org.apache.spark:spark-tags_2.12,org.scalactic:scalactic_2.12,org.scalatest:scalatest_2.12,com.fasterxml.jackson.core:jackson-databind",
"spark.yarn.user.classpath.first": "true",
"spark.sql.parquet.enableVectorizedReader": "false",
"spark.sql.legacy.replaceDatabricksSparkAvro.enabled": "true"
}
}`} }`}
lang="bash" lang="bash"
></CodeSnippet> ></CodeSnippet>