cogcomp-nlp
cogcomp-nlp copied to clipboard
NER Gazetteer not downloading
When trying to run the training with demo data for the first time as:
java -Xmx8g -cp target/classes:target/dependency/* edu.illinois.cs.cogcomp.ner.NerTagger -train test/Test/0224.txt test/Test/0228.txt config/ner.properties
I get the following error:
log4j:WARN No appenders could be found for logger (edu.illinois.cs.cogcomp.ner.LbjTagger.Parameters).
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
Downloading the folder from datastore . . .
GroupId: readonly.org.cogcomp.gazetteers
ArtifactId: 1.6/gazetteers.zip
augmentedGroupId: readonly.org.cogcomp.gazetteers
versionedFileName: 1.6/gazetteers.zip
zippedFileName: /home/himanshu/.cogcomp-datastore/tmp/1.6/gazetteers.zip
java.net.SocketTimeoutException: connect timed out
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350)
at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:589)
at com.squareup.okhttp.internal.Platform.connectSocket(Platform.java:101)
at com.squareup.okhttp.internal.io.RealConnection.connectSocket(RealConnection.java:137)
at com.squareup.okhttp.internal.io.RealConnection.connect(RealConnection.java:108)
at com.squareup.okhttp.internal.http.StreamAllocation.findConnection(StreamAllocation.java:184)
at com.squareup.okhttp.internal.http.StreamAllocation.findHealthyConnection(StreamAllocation.java:126)
at com.squareup.okhttp.internal.http.StreamAllocation.newStream(StreamAllocation.java:95)
at com.squareup.okhttp.internal.http.HttpEngine.connect(HttpEngine.java:281)
at com.squareup.okhttp.internal.http.HttpEngine.sendRequest(HttpEngine.java:224)
at com.squareup.okhttp.Call.getResponse(Call.java:286)
at com.squareup.okhttp.Call$ApplicationInterceptorChain.proceed(Call.java:243)
at com.squareup.okhttp.Call.getResponseWithInterceptorChain(Call.java:205)
at com.squareup.okhttp.Call.execute(Call.java:80)
at io.minio.MinioClient.execute(MinioClient.java:826)
at io.minio.MinioClient.executeHead(MinioClient.java:1018)
at io.minio.MinioClient.statObject(MinioClient.java:1154)
at io.minio.MinioClient.getObject(MinioClient.java:1343)
at org.cogcomp.Datastore.getDirectory(Datastore.java:556)
at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.TreeGazetteers.init(TreeGazetteers.java:71)
at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.TreeGazetteers.
I have tried deleting the folder followed by several retries at executing the same, but every time the same error pops up. Has the Gazetteer-1.6.zip file been moved elsewhere? Is there an updated URL to the same?
I am currently encountering a similar problem: The gazetteers cannot be downloaded because a connection to the server cannot be established (Failed to connect to smaug.cs.illinois.edu/192.17.58.151:8080). This problem persists at least for a week, is there any other way I can get the gazetteers?
Sorry about these issues.
We were having issues
Since we transitioned to UPenn, some of our infrastructure machines have also moved here. As such, the server (smaug.cs.illinois.edu) does not exist anymore and instead, we have (http://macniece.seas.upenn.edu:4008). This issue should be fixed in the main repository if you close the source code here.
FYI @HeglerTissot
Thank you, the download from the UPenn server was possible. For other users that encounter this problem while using the maven plugin from http://cogcomp.org/m2repo/: You can simply change the ResourceConfigurator endpoint to the UPenn server prior to calling the annotator: ResourceConfigurator.ENDPOINT.value = "http://macniece.seas.upenn.edu:4008";
I'm getting a similar problem with the new address :
java.net.ConnectException: Failed to connect to macniece.seas.upenn.edu/158.130.57.77:4008 at com.squareup.okhttp.internal.io.RealConnection.connectSocket(RealConnection.java:139) at com.squareup.okhttp.internal.io.RealConnection.connect(RealConnection.java:108) at com.squareup.okhttp.internal.http.StreamAllocation.findConnection(StreamAllocation.java:184) at com.squareup.okhttp.internal.http.StreamAllocation.findHealthyConnection(StreamAllocation.java:126) at com.squareup.okhttp.internal.http.StreamAllocation.newStream(StreamAllocation.java:95) at com.squareup.okhttp.internal.http.HttpEngine.connect(HttpEngine.java:281) at com.squareup.okhttp.internal.http.HttpEngine.sendRequest(HttpEngine.java:224) at com.squareup.okhttp.Call.getResponse(Call.java:286) at com.squareup.okhttp.Call$ApplicationInterceptorChain.proceed(Call.java:243) at com.squareup.okhttp.Call.getResponseWithInterceptorChain(Call.java:205) at com.squareup.okhttp.Call.execute(Call.java:80) at io.minio.MinioClient.execute(MinioClient.java:826) at io.minio.MinioClient.executeHead(MinioClient.java:1018) at io.minio.MinioClient.statObject(MinioClient.java:1154) at io.minio.MinioClient.getObject(MinioClient.java:1343) at org.cogcomp.Datastore.getDirectory(Datastore.java:556) at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.TreeGazetteers.init(TreeGazetteers.java:71) at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.TreeGazetteers.<init>(TreeGazetteers.java:50) at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.GazetteersFactory.get(GazetteersFactory.java:50) at edu.illinois.cs.cogcomp.ner.LbjTagger.Parameters.readAndLoadConfig(Parameters.java:265) at edu.illinois.cs.cogcomp.ner.LbjTagger.Parameters.readConfigAndLoadExternalData(Parameters.java:91) at edu.illinois.cs.cogcomp.ner.NerTagger.main(NerTagger.java:38)
Is there something wrong with the server ? Or my build ?
I don't think we currently have any service running on macniece port 4008.
What should it be?
On Wed, Dec 2, 2020 at 5:34 PM LucasPages [email protected] wrote:
I'm getting a similar problem with the new address :
java.net.ConnectException: Failed to connect to macniece.seas.upenn.edu/158.130.57.77:4008 at com.squareup.okhttp.internal.io.RealConnection.connectSocket(RealConnection.java:139) at com.squareup.okhttp.internal.io.RealConnection.connect(RealConnection.java:108) at com.squareup.okhttp.internal.http.StreamAllocation.findConnection(StreamAllocation.java:184) at com.squareup.okhttp.internal.http.StreamAllocation.findHealthyConnection(StreamAllocation.java:126) at com.squareup.okhttp.internal.http.StreamAllocation.newStream(StreamAllocation.java:95) at com.squareup.okhttp.internal.http.HttpEngine.connect(HttpEngine.java:281) at com.squareup.okhttp.internal.http.HttpEngine.sendRequest(HttpEngine.java:224) at com.squareup.okhttp.Call.getResponse(Call.java:286) at com.squareup.okhttp.Call$ApplicationInterceptorChain.proceed(Call.java:243) at com.squareup.okhttp.Call.getResponseWithInterceptorChain(Call.java:205) at com.squareup.okhttp.Call.execute(Call.java:80) at io.minio.MinioClient.execute(MinioClient.java:826) at io.minio.MinioClient.executeHead(MinioClient.java:1018) at io.minio.MinioClient.statObject(MinioClient.java:1154) at io.minio.MinioClient.getObject(MinioClient.java:1343) at org.cogcomp.Datastore.getDirectory(Datastore.java:556) at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.TreeGazetteers.init(TreeGazetteers.java:71) at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.TreeGazetteers.
(TreeGazetteers.java:50) at edu.illinois.cs.cogcomp.ner.ExpressiveFeatures.GazetteersFactory.get(GazetteersFactory.java:50) at edu.illinois.cs.cogcomp.ner.LbjTagger.Parameters.readAndLoadConfig(Parameters.java:265) at edu.illinois.cs.cogcomp.ner.LbjTagger.Parameters.readConfigAndLoadExternalData(Parameters.java:91) at edu.illinois.cs.cogcomp.ner.NerTagger.main(NerTagger.java:38) Is there something wrong with the server ? Or my build ?
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/CogComp/cogcomp-nlp/issues/714#issuecomment-737536860, or unsubscribe https://github.com/notifications/unsubscribe-auth/AGJ63QG2TBN37NQOZ33U4CDSS26HZANCNFSM4GRYS5KA .
What should the endpoint be? Or how can I find these:
readonly.org.cogcomp.mention
1.0\ACE_HEAD_TYPE.zip
\1.0\ACE_HEAD_TYPE
And
ner-model-enron-conll-all-data.zip
@danyaljj I have the same problem. (http://macniece.seas.upenn.edu:4008) is unusable.
@108598057 did you resolve the problem. I also encountered this problem