Optimizing SQL query on table of 10 million rows: neverending query

debugcn Published at Dev

CorneliaS

I have two tables:

CREATE TABLE routing
 (
  id integer NOT NULL,
  link_geom geometry,
  source integer,
  target integer,
  traveltime_min double precision,
  CONSTRAINT routing_pkey PRIMARY KEY (id)
  )
  WITH (
  OIDS=FALSE
  );

 CREATE INDEX routing_id_idx
    ON routing
    USING btree
    (id);

CREATE INDEX routing_link_geom_gidx
 ON routing
 USING gist
 (link_geom);

CREATE INDEX routing_source_idx
 ON routing
 USING btree
 (source);

CREATE INDEX routing_target_idx
 ON routing
 USING btree
 (target);

and

CREATE TABLE test
(
 link_id character varying,
 link_geom geometry,
 id integer NOT NULL,
 .. (some more attributes here)
 traveltime_min double precision,
 CONSTRAINT id PRIMARY KEY (id),
 CONSTRAINT test_link_id_key UNIQUE (link_id)
  )
 WITH (
  OIDS=FALSE
  );
 ALTER TABLE test
 OWNER TO postgres;

and I am trying to appy the follwing query:

update routing
set  traveltime_min = t2.traveltime_min
from test t2
where t2.id = routing.id

Both tables have near 10 millions rows. The problem is that this query runs neverending. Here what 'EXPLAIN' shows:

Update on routing  (cost=601725.94..1804772.15 rows=9712264 width=208)
 ->  Hash Join  (cost=601725.94..1804772.15 rows=9712264 width=208)
       Hash Cond: (routing.id = t2.id)"
        ->  Seq Scan on routing  (cost=0.00..366200.23 rows=9798223 width=194)"
        ->  Hash  (cost=423414.64..423414.64 rows=9712264 width=18)"
            ->  Seq Scan on test t2  (cost=0.00..423414.64 rows=9712264 width=18)"

I cannot understand what might cause the problem of such a slow response. Is it possible to be a problem caused from the server settings? The thing is that i use the default postgrSQL 9.3 settings.

Laurenz Albe

Drop all indexes on routing before you run the UPDATE and add them again afterwards. That will bring a huge improvement.

Set work_mem high in the session where you run the UPDATE. That will help with the hash.
Set shared_buffers to ¼ of the available memory, but not more than 1GB.

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-07-21

Comments

0 comments

From Dev

Related Related

Article

Optimizing SQL query on table of 10 million rows: neverending query

Optimizing SQL query on table of 10 million rows: neverending query

Query optimization on a Table with 1 Million rows

Query large table with 50 million rows

Very slow SELECT query in MySQL in a 10 Million rows table (single table)

Optimizing SQL query

Optimal MYSQL query for longest prefix matching in a table with 5 million rows

Slow Query on Medium MySQL Table (1 Million Rows)

Optimal MYSQL query for longest prefix matching in a table with 5 million rows

Slow Query on Medium MySQL Table (1 Million Rows)

SQL - Poor Performance SELECT Query on 377 million table

Optimizing SQL Server Query with #temp Table and group by Condition

Optimizing MySQL table index/query

Optimizing percentage calculation SQL query

Oracle - optimizing query with v$sql

Optimizing SQL group by and join query

Optimizing WHERE clause SQL query

Optimizing SQL Server 2012 Query

Optimizing Slick generated SQL query

Optimizing a SQL Query with Complex Filtering

Optimizing SQL group by and join query

Optimizing the Django-SQL Query

Optimizing MS-SQL Query

Optimizing SQL Server 2012 Query

Need Help Optimizing SQL query

How to query against 18million rows?

Optimizing a query, duplicate rows that have oldest date

Optimizing summing/grouping query with millions of rows on MYSQL

Self referencing table SQL Query to rows

Optimizing the mysql query - Avoid creation of temporary table?

composite index on large table, optimizing aggregate query