Memory problems when compressing and transferring a large number of small files (1TB in total)

oshirowanen

I have 5 million files which take up about 1TB of storage space. I need to transfer these files to a third party.

What's the best way to do this? I have tried reducing the size using .tar.gz, but even though my computer has 8GB RAM, I get an "out of system memory" error.

Is the best solution to snail-mail the files over?

Celada

Additional information provided in the comments reveals that the OP is using a GUI method to create the .tar.gz file.

GUI software often includes a lot more bloat than the equivalent command line equivalent software, or performs additional unnecessary tasks for the sake of some "extra" feature such as a progress bar. It wouldn't surprise me if the GUI software is trying to collect a list of all the filenames in memory. It's unnecessary to do that in order to create an archive. The dedicated tools tar and gzip are defintely designed to work with streaming input and output which means that they can deal with input and output a lot bigger than memory.

If you avoid the GUI program, you can most likely generate this archive using a completely normal everyday tar invocation like this:

tar czf foo.tar.gz foo

where foo is the directory that contains all your 5 million files.

The other answers to this question give you a couple of additional alternative tar commands to try in case you want to split the result into multiple pieces, etc...

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at
0

Comments

0 comments
Login to comment

Related

From Dev

Python fastest way to read a large number of small files into memory?

From Dev

Python fastest way to read a large number of small files into memory?

From Dev

Compressing large image to small format

From Dev

Transferring large files using scp with CPU and memory considerations

From Dev

Python memory errors when hashing large number of files in sequence

From Dev

rsync on Windows not transferring large files

From Dev

Out of memory when creating large number of relationships

From Dev

External HDD makes clicking sound only when transferring lots of small files

From Dev

Can a large number of (small) files degrade the performance of a filesystem?

From Dev

What's the bottleneck in transfer of a large number of small files?

From Dev

Transferring large files over TCP in Qt

From Dev

Transferring large (8 GB) files over ssh

From Dev

Performance problems when sending small files through Java Sockets

From Dev

FileZilla times out when transferring large file

From Dev

Memory keeps on growing after writing large number of files

From Dev

Memory Keeps Growing Parsing a Large Number of XML Files in Swift

From Dev

Count number of pages in large set of pdf files : Out of memory

From Dev

Unary operator expected when compressing static files

From Dev

Memory problems for multiple large arrays

From Dev

Out of Memory:Transferring Large Data from Amazon Redshift to Pandas

From Dev

C# transferring lots of small files over a network

From Dev

Email vs FTP Server for Transferring many Small Text Files

From Dev

Paging Large Queries: total number

From Dev

A large table or a large number of small table?

From Dev

Memory leak when comparing large files (under ARC)

From Dev

Out of Memory Exception when handling large files in C#

From Dev

How to prevent memory leaks in RubyMotion when reading large files in loops

From Dev

Out of memory exception when using xlsx module with large files

From Dev

Encrypting/Decrypting(Using AES) Large files and transferring over HTTP

Related Related

  1. 1

    Python fastest way to read a large number of small files into memory?

  2. 2

    Python fastest way to read a large number of small files into memory?

  3. 3

    Compressing large image to small format

  4. 4

    Transferring large files using scp with CPU and memory considerations

  5. 5

    Python memory errors when hashing large number of files in sequence

  6. 6

    rsync on Windows not transferring large files

  7. 7

    Out of memory when creating large number of relationships

  8. 8

    External HDD makes clicking sound only when transferring lots of small files

  9. 9

    Can a large number of (small) files degrade the performance of a filesystem?

  10. 10

    What's the bottleneck in transfer of a large number of small files?

  11. 11

    Transferring large files over TCP in Qt

  12. 12

    Transferring large (8 GB) files over ssh

  13. 13

    Performance problems when sending small files through Java Sockets

  14. 14

    FileZilla times out when transferring large file

  15. 15

    Memory keeps on growing after writing large number of files

  16. 16

    Memory Keeps Growing Parsing a Large Number of XML Files in Swift

  17. 17

    Count number of pages in large set of pdf files : Out of memory

  18. 18

    Unary operator expected when compressing static files

  19. 19

    Memory problems for multiple large arrays

  20. 20

    Out of Memory:Transferring Large Data from Amazon Redshift to Pandas

  21. 21

    C# transferring lots of small files over a network

  22. 22

    Email vs FTP Server for Transferring many Small Text Files

  23. 23

    Paging Large Queries: total number

  24. 24

    A large table or a large number of small table?

  25. 25

    Memory leak when comparing large files (under ARC)

  26. 26

    Out of Memory Exception when handling large files in C#

  27. 27

    How to prevent memory leaks in RubyMotion when reading large files in loops

  28. 28

    Out of memory exception when using xlsx module with large files

  29. 29

    Encrypting/Decrypting(Using AES) Large files and transferring over HTTP

HotTag

Archive