Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Efficient XSLT Processing in Relational Database System Zhen Hua Liu Anguel Novoselsky Oracle Corporation VLDB 2006 Agenda XML Processing Languages Overview XSLT Processing - Coprocessor Vs Integrated Approach XSLT to XQuery Rewrite Translation Technique Performance Evaluation Conclusion Q&A XML Processing Languages XQuery/XPath XSLT SQL/XML –integration XML and XQuery/XPath into SQL (http://www.sqlx.org) SQL/XML (Oracle XMLDB) – XMLTransform – Oracle XMLDB extension operator: embedding XSLT in SQL SELECT XMLTransform(emp.resume, ‘XSLT code’) FROM emp; // resume is of XMLType column of table emp XQuery, XSLT, SQL/XML Comparison All are Declarative Languages ! (Central Dogma in declarative query processing) Share common XQuery Data Model SQL/XML XMLType is based on XQuery Data Model XQuery & SQL/XML More database centric Share the same paradigm – Selection, Projection (XML Construction), Join, Order XSLT Database Foreign - Template Rule Matching based Execution Model XLST is more declarative than XQuery Impedance Mismatch with XQuery/SQL Processing Model, how to run XSLT in RDBMS ? Multi-Coprocessors Approach Embed off-the-shelf Xquery/XSLT processors into a SQL engine XSLT(Transform) XSLT Engine XQuery(XMLQue ry) SQL Engine XQuery Data Model Instances Input XQuery DM instances output XQuery Engine Issues & Challenges with Coprocessor Approach Fully composable in SQL/XML: Can we optimize XSLT, XQuery, XPath as one language ? Cross Language Optimization Feasible ? XML is stored & Indexed: Can XSLT processing leverage index on XML in RDBMS? How to make XSLT template rule matching based execution model “fit” into RDBMS processing Model ? XQuery/XSLT/SQL/XML Integrated Architecture XSLT to XQuery Rewrite SQL/XML XQuery XSLT Common algebraic operator tree – storage independent optimization XMLType Abstraction XML Storage/Index dependent Optimization OR Storage Binary XML/ XMLIndex SQLX View/Relational Data Extended/Hybrid Model XSLT to XQuery Rewrite Translation General XSLT to XQuery Translate Technique Fokoue etc “Compiling XSLT 2.0 into XQuery 1.0” paper at WWW 2005 Translate XSLT template into XQuery Function Translate XSLT instruction into corresponding XQuery construct Translate <xsl:apply-template> into XQuery function calls with large XQuery conditional expression matching XSLT pattern Issues Resultant XQuery is cumbersome and requires aggressive optimization Where to Add Intelligence in the translation ? XSLT to XQuery Rewrite example XMLType view over relational data CREATE VIEW dept_emp AS SELECT XMLElement("dept", XMLElement("dname", dname), XMLElement("loc", loc), XMLElement("employees", (SELECT XMLAgg(XMLElement("emp", XMLElement("empno", empno), XMLElement("ename", ename), XMLElement("sal", sal))) FROM emp WHERE emp.deptno = dept.deptno))) as dept_content FROM dept Example contd.. Result of XMLType View ============================================================ <dept> dname>ACCOUNTING</dname> <loc>NEW YORK</loc> <employees> <emp> <empno>7782</empno> <ename>CLARK</ename> <sal>2450</sal> </emp> <emp> <empno>7934</empno> <ename>MILLER</ename> <sal>1300</sal> </emp> </employees> </dept> Example- XSLT on XMLType -1 SELECT XMLTransform(dept_emp.dept_content, '<?xml version="1.0"?><xsl:stylesheet version="1.0“ xmlns:xsl="http://www.w3.org/1999/XSL/Transform" > <xsl:template match="dept"> <H1>HIGHLY PAID DEPT EMPLOYEES</H1> <xsl:apply-templates/> </xsl:template> <xsl:template match="dname"> <H2>Department name: <xsl:value-of select="."/></H2> </xsl:template> <xsl:template match="loc"> <H2>Department location: <xsl:value-of select="."/></H2> </xsl:template> Example- XSLT on XMLType -2 <xsl:template match="employees"> <H2>Employees Table</H2> <table border="2"> <td><b>EmpNo</b></td> <td><b>Name</b></td> <td><b>Weekly Salary</b></td> <xsl:apply-templates select="emp[sal > 2000]"/> </table> </xsl:template> <xsl:template match = "emp"> <tr> <td><xsl:value-of select="empno"/></td> <td><xsl:value-of select="ename"/></td> <td><xsl:value-of select="sal"/></td> </tr> </xsl:template> Example- XSLT on XMLType -3 <xsl:template match="text()"> <xsl:value-of select="."/> </xsl:template> </xsl:stylesheet>') FROM dept_emp; Input XML Structural Info with XSLT template Analysis roo t dept template dept default template dnam e employees loc dname template loc template emp text text Employees template emp template text Text template XSLT Template Invocation Graph root Default template dept dept template dname dname template text Text template loc loc template employees Employees template text Text template Emp[sal>2000] emp template text Text template Example- XQuery from XSLT SELECT XMLQuery( 'declare variable $var000 := .; (: builtin template :) ( let $var002 := $var000/dept return (: <xsl:template match="dept"> :) ( <H1>HIGHLY PAID DEPT EMPLOYEES</H1>, ( let $var003 := $var002/dname return (: <xsl:template match="dname"> :) <H2>{fn:concat("Department name: ", fn:string($var003))}</H2>, let $var003 := $var002/loc return (: <xsl:template match="loc"> :) <H2>{fn:concat("Department location: ",fn:string($var003))}</H2>, Example- XQuery from XSLT let $var003 := $var002/employees return (: <xsl:template match="employees"> :) ( <H2>Employees Table</H2>, <table border="2"> { <td><b>EmpNo</b></td>, <td><b>Name</b></td>, <td><b>Weekly Salary</b></td>, ( Example- XQuery from XSLT Rewrite for $var005 in ($var003/emp[sal > 2000]) return (: <xsl:template match="emp"> :) <tr> <td>{fn:string($var005/empno)}</td> <td>{fn:string($var005/ename)}</td> <td>{fn:string($var005/sal)}</td> </tr> ) } </table> ) ) ) )' PASSING dept_emp.dept_content RETURNING CONTENT) FROM DEPT FROM dept_emp Final Optimized SQL/XML Query SELECT XMLConcat( XMLElement( "H1",'HIGHLY PAID DEPT EMPLOYEES'), XMLElement( "H2",'Department name: ' ||"SYS_ALIAS_4"."DNAME"), XMLELement( "H2",'Department location:’ ||"SYS_ALIAS_4"."LOC"), XMLELement( "H2",'Employees Table'), XMLElement( "table",XMLAttributes('2' AS "border"), XMLElement( "td", XMLElement( "b",'EmpNo')), XMLElement( "td",XMLElement( "b",'Name')), XMLElement( "td",XMLElement( "b",'Weekly Salary')), (SELECT XMLAGG( XMLElement( "tr", XMLElement( "td","EMP"."EMPNO"), XMLElement( "td","EMP"."ENAME"), XMLElement( "td","EMP"."SAL"))) FROM EMP WHERE SAL > 2000 AND DEPTNO=DEPT.DEPTNO))) FROM DEPT XSLT to XQuery Rewrite Key Leverge XML structural information to generate Template Invocation Graph XML Schema, DTD, SQL/XML construction functions Inline Template with caller This generates compact XQuery amendable for further optimization Cancellation with XML view / relational data Path/Value Index for binary XML Partial Evaluation Partial Evaluation to obtain template invocation graph Application computation is described as F(X,Y), X changes less frequently than Y and significant part of F’s computation depends on X. Optimize F statically by holding X as constant Key observation – let F be the XSLT stylesheet, X be the input XML structural information, Y be the actual XML instance document content Comparison with Related Work Fokoue - “Compiling XSLT 2.0 into XQuery 1.0” WWW 2005 Not leveraging Input XML Structure Information for optimizing XQuery generation process Concluded context sensitive flow analysis & function specialization for static optimization Our work – Optimization of XSLT based on input XML structure Leverage Partial Evaluation for obtaining template invocation graph Comparison with Related Work Moerkotte - “Incorporating XSL Processing Into Database Engines” VLDB 2002 XSLT into internal algebra and integrate with RDBMS Concluded future research in combined optimizations of XSLT with XML construction Our work: XSLT into XQuery Combined optimizations of XSLT with XML input Comparison with Related Work Jain & Li etc - “Translating XSLT Programs to Efficient SQL Queries” WWW 2002 & “Composing XSL Transformations with XML Publishing Views” SIGMOD 2003 Our work: XQuery as intermediate language Work with any XML storage/Index Model Performance Evaluation XSLT Mark Db-one row query – Index Probe (Table Scan Vs Index Scan) Rewrite – Integrated Approach No-Rewrite – Coprocessor Approach 7000 6000 5000 4000 Rewrite 3000 No-Rewrite 2000 1000 0 8M 16M 32M 64M XSLT Mark Avts, metric: XML construction Chart, total: XSLT uses xquery aggregate functions: count/sum 1200 1000 800 600 No-Rewrite 400 Rewrite 200 0 avts case chart case metric case total case Conclusions Efficient XSLT processing in RDBMS is feasible despite the template rule based XSLT language Use XQuery as intermediate language to which XSLT is translated Leverage XML input structural information to get efficient & compact XQuery Index Probing, pull based execution model, parallel aggregation, sort applicable to XSLT in RDMS engine XSLT is native to RDBMS, just as XQuery,SQL/XML Questions