# DELETE

DELETE FROM [db.]table WHERE expr

DELETE FROM removes rows from the table [db.]table that match the expression expr. The deleted rows are marked as deleted immediately and will be automatically filtered out of all subsequent queries. Cleanup of data happens asynchronously in the background.

For example, the following query deletes all rows from the hits table where the Title column contains the text hello:

DELETE FROM hits WHERE Title LIKE '%hello%';

In MyScale (and ClickHouse), this operation is referred to as a lightweight delete. It involves significantly less overhead compared to the ALTER TABLE ... DELETE (opens new window) query. By default, lightweight deletes are asynchronous. However, we have configured mutations_sync (opens new window) to 1, which means the client will wait for one replica to process the statement.

Please note that we have disabled the ALTER TABLE ... DELETE query on tables with vector indexes due to its inefficiency. Instead, we recommend using DELETE FROM on all tables as it is much faster.

# How to Update Data

In ClickHouse, users can update data using the ALTER TABLE ... UPDATE (opens new window) command. However, it is not recommended to use this command for vector update scenarios in MyScale. A better approach would be to use DELETE and INSERT instead.

For instance, the following queries demonstrate how to update a row in the test_vector table where the id column equals 100.

DELETE FROM test_vector WHERE id = 100;
INSERT INTO test_vector values (100, [-0.045589335, ..., 0.026581138]);

Please note that MyScale is an OLAP database and is not optimized for frequent insert and delete operations. For optimal performance, it's crucial to execute multiple data inserts and deletes within a single statement, rather than processing each row individually. The following example demonstrates how to efficiently perform batch insert and delete operations in a single statement:

DELETE FROM test_vector where id in (2, 3, 5, 7, 11, ...);
INSERT INTO test_vector values (2, [-0.045589335, ..., 0.026581138]), (3, [-0.051993933, ..., 0.031703997]), (5, [-0.038116932, ..., -0.116080105]), ...;

Using ALTER TABLE ... UPDATE is not recommended.

ALTER TABLE test_vector UPDATE vector = [-0.045589335, ..., 0.026581138] WHERE id = 100;
Last Updated: Fri Nov 01 2024 09:38:04 GMT+0000