> For the complete documentation index, see [llms.txt](https://docs.appstrategy.com/apprules-r-documentation/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.appstrategy.com/apprules-r-documentation/platform/platform-features/system-settings/data-sources/sql-compliance/bigdata/hdfs.md). # Apache HDFS ### SELECT Statements A SELECT statement can consist of the following basic clauses. * SELECT * INTO * FROM * JOIN * WHERE * GROUP BY * HAVING * UNION * ORDER BY * LIMIT ### SELECT Syntax The following syntax diagram outlines the syntax supported by the SQL engine of the provider:
\|

SELECT {

\[ TOP \ | DISTINCT ]

{

*

| {

\ \[ \[ AS ] \ ]

| { \ | \ } .*

} \[ , ... ]

}

\[ INTO csv:// \[ filename= ] \ \[ ;delimiter=tab ] ]

{

FROM \ \[ \[ AS ] \ ]

} \[ , ... ]

\[ \[

INNER | { { LEFT | RIGHT | FULL } \[ OUTER ] }

] JOIN \ \[ ON \ ] \[ \[ AS ] \ ]

] \[ ... ]

\[ WHERE \ ]

\[ GROUP BY \ \[ , ... ]

\[ HAVING \ ]

\[ UNION \[ ALL ] \ ]

\[

ORDER BY

\ \[ ASC | DESC ] \[ NULLS FIRST | NULLS LAST ]

]

\[

LIMIT \

\[

{ OFFSET | , }

\

]

}

\ ::=

| \

| @ \

| ?

| COUNT( \* | { \[ DISTINCT ] \ } )

| { AVG | MAX | MIN | SUM | COUNT } ( \ )

| NULLIF ( \ , \ )

| COALESCE ( \ , ... )

| CASE \

WHEN { \ | \ } THEN { \ | NULL } \[ ... ]

\[ ELSE { \ | NULL } ]

END

| \

\ ::=

{

\ { = | > | < | >= | <= | <> | != | LIKE | NOT LIKE | IN | NOT IN | IS NULL | IS NOT NULL | AND | OR | CONTAINS | BETWEEN } \[ \ ]

} \[ { AND | OR } ... ]

| \| ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | #### Examples 1. Return all columns:
| `SELECT * FROM Files` | | --------------------- | 2. Rename a column:
| `SELECT [ChildrenNum] AS MY_ChildrenNum FROM Files` | | --------------------------------------------------- | 3. Cast a column's data as a different data type:
| `SELECT CAST(Length AS VARCHAR) AS Str_Length FROM Files` | | --------------------------------------------------------- | 4. Search data:
| `SELECT * FROM Files WHERE FileId = '119116'` | | --------------------------------------------- | 5. Return the number of items matching the query criteria:
| `SELECT COUNT(*) AS MyCount FROM Files` | | --------------------------------------- | 6. Return the number of unique items matching the query criteria:
| `SELECT COUNT(DISTINCT ChildrenNum) FROM Files` | | ----------------------------------------------- | 7. Return the unique items matching the query criteria:
| `SELECT DISTINCT ChildrenNum FROM Files` | | ---------------------------------------- | 8. Summarize data:
| `SELECT ChildrenNum, MAX(Length) FROM Files GROUP BY ChildrenNum` | | ----------------------------------------------------------------- | See Aggregate Functions below for details. 9. Retrieve data from multiple tables.
| `SELECT c.Owner, o.OwnerRead, o.OwnerWrite, o.OwnerExecute FROM Files c INNER JOIN Permissions o ON c.FullPath = o.FullPath` | | ---------------------------------------------------------------------------------------------------------------------------- | See JOIN Queries below for details. 10. Sort a result set in ascending order:
| `SELECT FileId, ChildrenNum FROM Files ORDER BY ChildrenNum ASC` | | ----------------------------------------------------------------- | 11. Restrict a result set to the specified number of rows:
| `SELECT FileId, ChildrenNum FROM Files LIMIT 10` | | ------------------------------------------------ | 12. Parameterize a query to pass in inputs at execution time. This enables you to create prepared statements and mitigate SQL injection attacks.
| `SELECT * FROM Files WHERE FileId = @param` | | ------------------------------------------- | ### Aggregate Functions #### COUNT Returns the number of rows matching the query criteria.
| `SELECT COUNT(*) FROM Files WHERE FileId = '119116'` | | ---------------------------------------------------- | #### COUNT(DISTINCT) Returns the number of distinct, non-null field values matching the query criteria.
| `SELECT COUNT(DISTINCT FileId) AS DistinctValues FROM Files WHERE FileId = '119116'` | | ------------------------------------------------------------------------------------ | #### AVG Returns the average of the column values.
| `SELECT ChildrenNum, AVG(Length) FROM Files WHERE FileId = '119116'` `GROUP BY ChildrenNum` | | -------------------------------------------------------------------------------------------- | #### MIN Returns the minimum column value.
| `SELECT MIN(Length), ChildrenNum FROM Files WHERE FileId = '119116'` `GROUP BY ChildrenNum` | | ------------------------------------------------------------------------------------------- | #### MAX Returns the maximum column value.
| `SELECT ChildrenNum, MAX(Length) FROM Files WHERE FileId = '119116'` `GROUP BY ChildrenNum` | | ------------------------------------------------------------------------------------------- | #### SUM Returns the total sum of the column values.
| `SELECT SUM(Length) FROM Files WHERE FileId = '119116'` | | ------------------------------------------------------- | ### JOIN Queries The Provider for HDFS supports standard SQL joins like the following examples. #### Inner Join An inner join selects only rows from both tables that match the join condition:
| `SELECT c.Owner, o.OwnerRead, o.OwnerWrite, o.OwnerExecute FROM Files c INNER JOIN Permissions o ON c.FullPath = o.FullPath` | | ---------------------------------------------------------------------------------------------------------------------------- | #### Left Join A left join selects all rows in the FROM table and only matching rows in the JOIN table:
| `SELECT c.Group, o.GroupRead, o.GroupWrite, o.GroupExecute FROM Files c LEFT JOIN Permissions o ON c.FullPath = o.FullPath` | | --------------------------------------------------------------------------------------------------------------------------- | ### Date Literal Functions The following date literal functions can be used to filter date fields using relative intervals. Note that while the <, >, and = operators are supported for these functions, <= and >= are not. #### L\_TODAY() The current day.
| `SELECT * FROM MyTable WHERE MyDateField = L_TODAY()` | | ----------------------------------------------------- | #### L\_YESTERDAY() The previous day.
| `SELECT * FROM MyTable WHERE MyDateField = L_YESTERDAY()` | | --------------------------------------------------------- | #### L\_TOMORROW() The following day.
| `SELECT * FROM MyTable WHERE MyDateField = L_TOMORROW()` | | -------------------------------------------------------- | #### L\_LAST\_WEEK() Every day in the preceding week.
| `SELECT * FROM MyTable WHERE MyDateField = L_LAST_WEEK()` | | --------------------------------------------------------- | #### L\_THIS\_WEEK() Every day in the current week.
| `SELECT * FROM MyTable WHERE MyDateField = L_THIS_WEEK()` | | --------------------------------------------------------- | #### L\_NEXT\_WEEK() Every day in the following week.
| `SELECT * FROM MyTable WHERE MyDateField = L_NEXT_WEEK()` | | --------------------------------------------------------- | Also available: * L\_LAST/L\_THIS/L\_NEXT MONTH * L\_LAST/L\_THIS/L\_NEXT QUARTER * L\_LAST/L\_THIS/L\_NEXT YEAR #### L\_LAST\_N\_DAYS(n) The previous n days, excluding the current day.
| `SELECT * FROM MyTable WHERE MyDateField = L_LAST_N_DAYS(3)` | | ------------------------------------------------------------ | #### L\_NEXT\_N\_DAYS(n) The following n days, including the current day.
| `SELECT * FROM MyTable WHERE MyDateField = L_NEXT_N_DAYS(3)` | | ------------------------------------------------------------ | Also available: * L\_LAST/L\_NEXT\_90\_DAYS #### L\_LAST\_N\_WEEKS(n) Every day in every week, starting n weeks before current week, and ending in the previous week.
| `SELECT * FROM MyTable WHERE MyDateField = L_LAST_N_WEEKS(3)` | | ------------------------------------------------------------- | #### L\_NEXT\_N\_WEEKS(n) Every day in every week, starting the following week, and ending n weeks in the future.
| `SELECT * FROM MyTable WHERE MyDateField = L_NEXT_N_WEEKS(3)` | | ------------------------------------------------------------- | Also available: * L\_LAST/L\_NEXT\_N\_MONTHS(n) * L\_LAST/L\_NEXT\_N\_QUARTERS(n) * L\_LAST/L\_NEXT\_N\_YEARS(n) --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://docs.appstrategy.com/apprules-r-documentation/platform/platform-features/system-settings/data-sources/sql-compliance/bigdata/hdfs.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.