2022-07-31 16:42:53 +00:00
# SQL
2023-08-05 18:56:25 +00:00
Previously in my fresher software developer time, I rarely write SQL, I always
use ORM to wrap SQL. But time past and too much abstraction bites me. So I
decide to only write SQL from now as much as possible, no more ORM for me. But
if there is any cool ORM for Go, I guess I try.
2022-07-31 16:42:53 +00:00
2023-08-05 18:56:25 +00:00
This guide is not kind of guide which cover all cases. Just my little tricks
when I work with SQL.
2022-07-31 16:42:53 +00:00
2022-11-20 09:35:49 +00:00
## Stay away from database unique id
2022-09-06 17:10:02 +00:00
2023-08-05 18:56:25 +00:00
Use UUID instead. If you can, and you should, choose UUID type which can be
sortable.
2022-09-06 17:10:02 +00:00
2022-11-20 09:35:49 +00:00
## Stay away from database timestamp
2022-07-31 16:42:53 +00:00
2023-08-05 18:56:25 +00:00
Stay away from all kind of database timestamp (MySQL timestmap, SQLite
timestamp, ...) Just use int64 then pass the timestamp in service layer not
database layer.
2022-07-31 16:42:53 +00:00
2023-08-05 18:56:25 +00:00
Why? Because time and date and location are too much complex to handle. In my
business, I use timestamp in milliseconds. Then I save timestamp as int64 value
to database. Each time I get timestamp from database, I parse to time struct in
Go with location or format I want. No more hassle!
2022-07-31 16:42:53 +00:00
2022-11-20 09:35:49 +00:00
It looks like this:
```txt
[Business] time, data -> convert to unix timestamp milliseconds -> [Database] int64
```
2023-04-04 10:25:22 +00:00
## Extra field for extra things
2023-08-05 18:56:25 +00:00
Create new column in database is scary, so I suggest avoid it if you can. How to
avoid, first design table with extra field. It is black hole, put everything in
there if you want.
2023-04-04 10:25:22 +00:00
I always use MySQL json data type for extra field.
2023-04-08 10:10:10 +00:00
JSON data type also used for dumping request, response data.
2022-11-20 09:35:49 +00:00
## Use index!!!
2022-07-31 16:42:53 +00:00
2023-08-05 18:56:25 +00:00
You should use index for faster query, but not too much. Don't create index for
every fields in table. Choose wisely!
2022-07-31 16:42:53 +00:00
For example, create index in MySQL:
```sql
2023-06-15 19:31:21 +00:00
CREATE INDEX idx_user_id
ON user_upload (user_id);
```
2023-08-05 18:56:25 +00:00
If create index inside `CREATE TABLE` ,
[prefer `INDEX` to `KEY` ](https://stackoverflow.com/a/1401615 ):
2023-06-15 19:31:21 +00:00
```sql
CREATE TABLE user_upload
(
id int(11) NOT NULL,
user_id int(11) NULL DEFAULT NULL,
PRIMARY KEY (id),
INDEX idx_user_id (user_id)
);
2022-07-31 16:42:53 +00:00
```
2023-04-17 10:14:54 +00:00
Use `EXPLAIN` to check if index is used or not:
- [For MySQL 5.7 ](https://dev.mysql.com/doc/refman/5.7/en/explain-output.html )
- [For MySQL 8.0 ](https://dev.mysql.com/doc/refman/8.0/en/explain-output.html )
2022-11-20 09:35:49 +00:00
## Be careful with NULL
2022-09-06 17:10:02 +00:00
If compare with field which can be NULL, remember to check NULL for safety.
```sql
-- field_something can be NULL
2022-11-20 09:35:49 +00:00
-- Bad
2022-09-06 17:10:02 +00:00
SELECT *
FROM table
WHERE field_something != 1
2022-11-20 09:35:49 +00:00
-- Good
2022-09-06 17:10:02 +00:00
SELECT *
FROM table
WHERE (field_something IS NULL OR field_something != 1)
```
2022-11-20 09:35:49 +00:00
Need clarify why this happpen? Idk :(
## `VARCHAR` or `TEXT`
2023-08-05 18:56:25 +00:00
Prefer `VARCHAR` if you need to query and of course use index, and make sure
size of value will never hit the limit. Prefer `TEXT` if you don't care, just
want to store something.
2022-11-20 09:35:49 +00:00
2023-06-15 19:31:21 +00:00
## `LIMIT`
Prefer `LIMIT 10 OFFSET 5` to `LIMIT 5, 10` to avoid misunderstanding.
2022-11-20 09:35:49 +00:00
## Be super careful when migrate, update database on production and online!!!
2023-08-05 18:56:25 +00:00
Plase read docs about online ddl operations before do anything online (keep
database running the same time update it, for example create index, ...)
2022-11-20 09:35:49 +00:00
2023-08-05 18:56:25 +00:00
- [For MySQL 5.7 ](https://dev.mysql.com/doc/refman/5.7/en/innodb-online-ddl-operations.html ),
[Limitations ](https://dev.mysql.com/doc/refman/5.7/en/innodb-online-ddl-limitations.html )
- [For MySQL 8.0 ](https://dev.mysql.com/doc/refman/8.0/en/innodb-online-ddl-operations.html ),
[Limitations ](https://dev.mysql.com/doc/refman/8.0/en/innodb-online-ddl-limitations.html )
2022-11-20 09:35:49 +00:00
2023-08-17 09:40:38 +00:00
## Heathcheck
Use `SELECT 1` to check if database failed yet.
2022-11-20 09:35:49 +00:00
## Tools
2023-08-05 18:56:25 +00:00
- Use [sqlfluff/sqlfluff ](https://github.com/sqlfluff/sqlfluff ) to check your
SQL.
- Use [k1LoW/tbls ](https://github.com/k1LoW/tbls ) to grasp your database reality
:)
2022-11-20 09:35:49 +00:00
## Thanks
2022-07-31 16:42:53 +00:00
- [Use The Index, Luke ](https://use-the-index-luke.com/ )
- [Essential elements of high performance applications: SQL indexes ](https://www.foxhound.systems/blog/essential-elements-of-high-performance-sql-indexes/ )
- [Things You Should Know About Databases ](https://architecturenotes.co/things-you-should-know-about-databases/ )
2023-04-08 10:10:10 +00:00
- [When to use JSON data type in database schema design? ](https://shekhargulati.com/2022/01/08/when-to-use-json-data-type-in-database-schema-design/ )
- [My Notes on GitLab Postgres Schema Design ](https://shekhargulati.com/2022/07/08/my-notes-on-gitlabs-postgres-schema-design/ )
2023-04-17 10:14:54 +00:00
- [How to read MySQL EXPLAINs ](https://planetscale.com/blog/how-read-mysql-explains )
2023-08-17 09:40:38 +00:00
- [Honest health checks that hit the database ](https://brandur.org/fragments/database-health-check )