Skip to content

Commit

Permalink
[SPARK-28800][DOC][SQL] Document REPAIR TABLE statement in SQL Reference
Browse files Browse the repository at this point in the history
### What changes were proposed in this pull request?
Document REPAIR TABLE statement in SQL Reference.

### Why are the changes needed?
To complete SQL reference.

### Does this PR introduce any user-facing change?
Yes.

After the change, we will have the following
![image](https://user-images.githubusercontent.com/13592258/66271480-461f7480-e813-11e9-9b40-cbffec1221ae.png)

![image](https://user-images.githubusercontent.com/13592258/66261968-4fb1c980-e78c-11e9-9db0-fcd6f458fd39.png)

### How was this patch tested?
Tested using jykyll build --serve

Closes #25884 from huaxingao/spark-28800.

Authored-by: Huaxin Gao <huaxing@us.ibm.com>
Signed-off-by: Sean Owen <sean.owen@databricks.com>
  • Loading branch information
huaxingao authored and srowen committed Oct 6, 2019
1 parent 77510c6 commit 5a512e8
Showing 1 changed file with 42 additions and 1 deletion.
43 changes: 42 additions & 1 deletion docs/sql-ref-syntax-ddl-repair-table.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,4 +19,45 @@ license: |
limitations under the License.
---

**This page is under construction**
### Description
`MSCK REPAIR TABLE` recovers all the partitions in the directory of a table and updates the Hive metastore. When creating a table using `PARTITIONED BY` clause, partitions are generated and registered in the Hive metastore. However, if the partitioned table is created from existing data, partitions are not registered automatically in the Hive metastore. User needs to run `MSCK REPAIR TABLE` to register the partitions. `MSCK REPAIR TABLE` on a non-existent table or a table without partitions throws an exception. Another way to recover partitions is to use `ALTER TABLE RECOVER PARTITIONS`.

### Syntax
{% highlight sql %}
MSCK REPAIR TABLE table_name
{% endhighlight %}

### Parameters
<dl>
<dt><code><em>table_name</em></code></dt>
<dd>Specifies the name of the table to be repaired.</dd>
</dl>

### Examples
{% highlight sql %}
-- create a partitioned table from existing data /tmp/namesAndAges.parquet
CREATE TABLE t1 (name STRING, age INT) USING parquet PARTITIONED BY (age)
location "/tmp/namesAndAges.parquet";

-- SELECT * FROM t1 does not return results
SELECT * FROM t1;

-- run MSCK REPAIR TABLE to recovers all the partitions
MSCK REPAIR TABLE t1;

-- SELECT * FROM t1 returns results
SELECT * FROM t1;

+ -------------- + ------+
| name | age |
+ -------------- + ------+
| Michael | 20 |
+ -------------- + ------+
| Justin | 19 |
+ -------------- + ----- +
| Andy | 30 |
+ -------------- + ----- +

{% endhighlight %}
### Related Statements
* [ALTER TABLE](sql-ref-syntax-ddl-alter-table.html)

0 comments on commit 5a512e8

Please sign in to comment.