如何使用Calcite优化SQL并将其从一个数据库引擎重写到另一个数据库引擎-解网

问：

我正在尝试开发一个通用系统，该系统公开和 API，如下所示，

val outputSql: String = SQLRewriter
  .inputSql(query = " <query> ", engine = Engine.SparkSQL, connectionHook = < connection >)
  .rewriteTo(engine = Engine.MySQL)

这个系统应该，

能够读取一些受支持引擎的输入 SQL 查询字符串
优化目标引擎的查询（项目下推）
将目标引擎的优化查询重写为字符串

我认为 Apache Calcite 非常适合这一点，或者它已经具备了这些功能。为此，我尝试在代码中浏览文档、博客和文档字符串，但我觉得我在兜圈子。

我想知道，如果，

方解石已经具备了这些能力
如果它非常适合此用例
您可以向我指出的任何代码示例

一些专家可以帮助我吗？谢谢。

java sql apache-calcite

import org.apache.calcite.adapter.enumerable.EnumerableConvention
import org.apache.calcite.jdbc.CalciteSchema
import org.apache.calcite.plan.volcano.VolcanoPlanner
import org.apache.calcite.rel.RelNode
import org.apache.calcite.rel.core.JoinRelType
import org.apache.calcite.rel.externalize.RelWriterImpl
import org.apache.calcite.rel.rel2sql.RelToSqlConverter
import org.apache.calcite.sql.dialect.*
import org.apache.calcite.test.CalciteAssert
import org.apache.calcite.tools.{Frameworks, RelBuilder}

import java.io.PrintWriter

   
val rootSchema = CalciteSchema.createRootSchema(true).plus()
val config = Frameworks
  .newConfigBuilder()
  .defaultSchema(CalciteAssert.addSchema(rootSchema, CalciteAssert.SchemaSpec.HR))
  .build()
val builder = RelBuilder.create(config)

// Create a example plan using calcite, this should be replaced with real business logic
val opTree: RelNode = builder
  .scan("emps") // scan table 1
  .scan("depts") // scan table 2
  .join(JoinRelType.INNER, "deptno") // inner join between the 2 tables on deptno
  .filter(builder.equals(builder.field("empid"), builder.literal(100))) // filter on empid
  .build

val rw = new RelWriterImpl(new PrintWriter(System.out, true))

// Print basic Logical Plan
opTree.explain(rw)

val cluster = opTree.getCluster
val planner = cluster.getPlanner.asInstanceOf[VolcanoPlanner]

val desiredTraits = cluster.traitSet.replace(EnumerableConvention.INSTANCE)
val newRoot = planner.changeTraits(opTree, desiredTraits)
planner.setRoot(newRoot)

val optimized: RelNode = planner.findBestExp

// Print optimized Logical Plan
// filter happens before join to reduce the amount of data joined. Rules can be configured.
optimized.explain(rw)

// Rewrite Logical Plan as SQL queries based on different dialects
// Each of these dialects can be configured, with UDFs/ procedures etc.
val sqlDialects = Seq(
  SparkSqlDialect.DEFAULT,
  MysqlSqlDialect.DEFAULT,
  PostgresqlSqlDialect.DEFAULT,
  SnowflakeSqlDialect.DEFAULT,
  TeradataSqlDialect.DEFAULT,
  RedshiftSqlDialect.DEFAULT,
  HiveSqlDialect.DEFAULT)

sqlDialects.foreach(dialect => {
  // print name of dialect
  println(dialect)

  // print SQL as per the dialect. dialect parser can be heavily configured.
  val conv = RelToSqlConverter(dialect)
  println(conv.visitRoot(optimized).asQueryOrValues().toString)
})

上一个：java.sql.SQLSyntaxErrorException：意外标记：ON

下一个：Hibernate 变量，引用一个表或另一个表，具体取决于列的值

如何使用Calcite优化SQL并将其从一个数据库引擎重写到另一个数据库引擎

How to use Calcite to optimise and rewrite SQL from one DB engine to another

评论