Dear colleagues,
I have some troubles with Greenstone when processing some pdfs. The error log
says this:
import.pl> Exception in thread "main" java.lang.OutOfMemoryError: Java heap
space<br /> at java.nio.CharBuffer.wrap(Unknown Source)<br /> at
java.nio.CharBuffer.wrap(Unknown Source)<br /> at
java.lang.StringCoding$StringDecoder.decode(Unknown Source)<br /> at
java.lang.StringCoding.decode(Unknown Source)<br /> at
java.lang.String.<init>(Unknown Source)<br /> at
java.lang.String.<init>(Unknown Source)<br /> at
org.apache.fontbox.cmap.CMapParser.createStringFromBytes(CMapParser.java:618)<br
/> at org.apache.fontbox.cmap.CMapParser.parse(CMapParser.java:224)<br /> at
org.apache.pdfbox.pdmodel.font.PDFont.parseCmap(PDFont.java:603)<br /> at
org.apache.pdfbox.pdmodel.font.PDSimpleFont.extractToUnicodeEncoding(PDSimpleFont.java:458)<br
/> at
org.apache.pdfbox.pdmodel.font.PDSimpleFont.determineEncoding(PDSimpleFont.java:426)<br
/> at org.apache.pdfbox.pdmodel.font.PDFont.<init>(PDFont.java:194)<br /> at
org.apache.pdfbox.pdmodel.font.PDSimpleFont.<init>(PDSimpleFont.java:88)<br />
at org.apache.pdfbox.pdmodel.font.PDType0Font.<init>(PDType0Font.java:65)<br />
at
org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:108)<br
/> at org.apache.pdfbox.pdmodel.PDResources.getFonts(PDResources.java:203)<br
/> at
org.apache.pdfbox.util.PDFStreamEngine.getFonts(PDFStreamEngine.java:604)<br />
at org.apache.pdfbox.util.operator.SetTextFont.process(SetTextFont.java:54)<br
/> at
org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:554)<br
/> at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:268)<br
/> at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:235)<br
/> at
org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:215)<br
/> at
org.apache.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:455)<br
/> at
org.apache.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:379)<br
/> at
org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:335)<br
/> at org.apache.pdfbox.ExtractText.startExtraction(ExtractText.java:275)<br />
at org.apache.pdfbox.ExtractText.main(ExtractText.java:85)<br />ADVIRTIENDO:
Ningún plugin podrá ser procesado 0568157-MAIN.pdf
Could anybody please help me?
Thanks a lot!
Ignacio Fernandez Sarasola