Hi,
I hope this is a problem that has a solution, because otherwise I will sadly have lost a lot of work. I am working on a project and went back into its history to see an earlier moment in the project. When I tried to return to the latest step, it told me I couldn’t due to a java.lang.IndexOutOfBoundsException. I had only gone back a few steps, so I decided to deal with it and redo the previous few actions. However, multiple columns on the project had been duplicated, resulting in an abundance of columns with the same name (which shouldn’t be possible) and which were anachronistic to the stage of the project I was on. I deleted them and it resulted in my records being split up into multiple records, and some of my data being erased. I tried to go back even further in the project’s history to before qny of this happened, but now it won’t let me go back, with an error saying java.lang.NullPointerException. I just want to return to a stage in the project before any of this confusion occured. Is there a way to do so? I have dealt with these error messages showing up when moving through history before, but the results have never been as problematic for me as now.
Thanks,
Ella
The error message has changed, and is giving a slightly better picture of the problem now. It says java.lang.IndexOutOfBoundsException: Needed to remove row 7157, but only 7084 rows were available.
Hi Ella. Sorry to hear that you're having trouble. First, before you do anything else, make a backup of your project (export it) and, ideally, your entire OpenRefine workspace. In the future, do this at the first hint that anything might be wrong. Data security is our absolute top priority, but sometimes bugs still creep in.
Is this different from the problem that you reported back in July with a similar error message? It looks like Rory attempted to follow up with you on that iteration.
Some additional information would be useful in helping to understand the problem better, especially the version of OpenRefine that you are using and the complete text of the error messages (including all lines for multiline messages).
If the data is not sensitive and you'd like me to take a look at recovering the project, feel free to send it to me. Please send the earliest / least damaged version of the project that you have.
Best,
Tom
Hello! Thanks so much for the response. Yes, this error is very similar, if not identical, to the issue I reported in July- sorry for the duplication of posts, I had forgotten posting that first one and never noticed Rory’s response. This is occurring on version 3.9.3 of OpenRefine, and the full error is
Exception caught (30ms)
java.lang.IndexOutOfBoundsException: Needed to remove row 7157, but only 7084 rows were available.
at org.freeyourmetadata.ner.operations.NERChange.deleteRows(NERChange.java:316)
at org.freeyourmetadata.ner.operations.NERChange.revert(NERChange.java:83)
at com.google.refine.history.HistoryEntry.revert(HistoryEntry.java:175)
at com.google.refine.history.History.undo(History.java:243)
at com.google.refine.history.History.undoRedo(History.java:183)
at com.google.refine.history.HistoryProcess.performImmediate(HistoryProcess.java:86)
at com.google.refine.process.ProcessManager.queueProcess(ProcessManager.java:96)
at com.google.refine.commands.history.UndoRedoCommand.doPost(UndoRedoCommand.java:73)
at com.google.refine.RefineServlet.service(RefineServlet.java:187)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:750)
at org.eclipse.jetty.servlet.ServletHolder$NotAsync.service(ServletHolder.java:1410)
at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java:764)
at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:529)
at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:131)
at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:578)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:122)
at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:223)
at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:1570)
at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:131)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:122)
at org.eclipse.jetty.server.handler.gzip.GzipHandler.handle(GzipHandler.java:790)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:122)
at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:223)
at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1384)
at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:176)
at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:484)
at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:1543)
at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:174)
at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1306)
at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:129)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:122)
at com.google.refine.ValidateHostHandler.handle(ValidateHostHandler.java:93)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:122)
at org.eclipse.jetty.server.Server.handle(Server.java:563)
at org.eclipse.jetty.server.HttpChannel$RequestDispatchable.dispatch(HttpChannel.java:1598)
at org.eclipse.jetty.server.HttpChannel.dispatch(HttpChannel.java:753)
at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:501)
at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:282)
at org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:314)
at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:100)
at org.eclipse.jetty.io.SelectableChannelEndPoint$1.run(SelectableChannelEndPoint.java:53)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
at java.base/java.lang.Thread.run(Unknown Source)
I am using the NER Extension and I wonder if that is potentially part of the issue? The problem seems to be related to the number of rows in the project, and potentially the creation of new rows within a record that occurred when extracting Named Entities threw it off somehow? The data is not sensitive, so I would very much appreciate your looking at the project. I will send it to you and see if you can get anywhere. Thanks so much for all of your help.
Best,
Ella