我需要解析大约25 MB大小的XLXS,它有大约100万条记录。我阅读了很多节点模块,包括下面的内容
https://github.com/trevordixon/excel.js
https://github.com/dkiyatkin/node-office我也尝试过在Roo中使用Ruby
https://github.com/Empact/roo但他们被绞死了。是否有任何建议来这样做,或者我需要最终将文件分割成多个小块?
同时按照"carlosramireziii“建议使用"oxcelix”!
“https://github.com/gbiczo/oxcelix”
2.0.0-p247 :001 > require 'oxcelix'
=> true
2.0.0-p247 :002 > s = Oxcelix::Workbook.new("/var/www/fullcontact/current/public/uploads/fileupload/filename/Book1.xlsx")
Killed
root@createresume:/var/www/fullcontact/current/public/uploads# irb
2.0.0-p247 :001 > require 'oxcelix'
=> true
2.0.0-p247 :002 > s = Oxcelix::Workbook.new("/var/www/fullcontact/current/public/uploads/fileupload/filename/Book1.xlsx")
Errno::EEXIST: File exists - /var/www/fullcontact/shared/uploads/tmp
from /usr/local/rvm/rubies/ruby-2.0.0-p247/lib/ruby/2.0.0/fileutils.rb:245:in `mkdir'
from /usr/local/rvm/rubies/ruby-2.0.0-p247/lib/ruby/2.0.0/fileutils.rb:245:in `fu_mkdir'
from /usr/local/rvm/rubies/ruby-2.0.0-p247/lib/ruby/2.0.0/fileutils.rb:174:in `block in mkdir'
from /usr/local/rvm/rubies/ruby-2.0.0-p247/lib/ruby/2.0.0/fileutils.rb:173:in `each'
from /usr/local/rvm/rubies/ruby-2.0.0-p247/lib/ruby/2.0.0/fileutils.rb:173:in `mkdir'
from /usr/local/rvm/gems/ruby-2.0.0-p247/gems/oxcelix-0.3.2/lib/oxcelix/workbook.rb:52:in `initialize'
from (irb):2:in `new'
from (irb):2
from /usr/local/rvm/rubies/ruby-2.0.0-p247/bin/irb:13:in `<main>'
2.0.0-p247 :003 > exit
root@createresume:/var/www/fullcontact/current/public/uploads# rm -rf tmp/
root@createresume:/var/www/fullcontact/current/public/uploads# irb
2.0.0-p247 :001 > require 'oxcelix'
=> true
2.0.0-p247 :002 > s = Oxcelix::Workbook.new("/var/www/fullcontact/current/public/uploads/fileupload/filename/Book1.xlsx")
Killed
root@createresume:/var/www/fullcontact/current/public/uploads# 发布于 2014-02-26 13:07:55
对于一个非常大的xml文件,我也遇到了类似的问题。就性能而言,最好将其“削减”为较小的垃圾,并分别处理它们。
https://stackoverflow.com/questions/22042350
复制相似问题