filecoin-plus-large-datasets icon indicating copy to clipboard operation
filecoin-plus-large-datasets copied to clipboard

[DataCap Application] <Beijing Zhongnong Leaf Eating Grass Natural Science Research Institute> - <Virus nucleic acid sequence dataset>>

Open luhong123 opened this issue 2 years ago • 139 comments

Data Owner Name

Beijing Zhongnong Leaf Eating Grass Natural Science Research Institute

What is your role related to the dataset

Data Preparer

Data Owner Country/Region

China

Data Owner Industry

Life Science / Healthcare

Website

N/A

Social Media

QQ 3415634156

Total amount of DataCap being requested

15PiB

Expected size of single dataset (one copy)

2P

Number of replicas to store

8

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f1sapvfym7hgcnydd7j6jnynspdhxo5j3m2fqmrra

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

  • [ ] Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

Beijing Zhongnong Edible Leaf Grass Natural Science Research Institute (formerly known as Beijing Zhongke Whole Brain Natural Science Research Institute) was established in 2018 and is located in Beijing. It is an enterprise mainly engaged in research and experimental development. Registered capital of the enterprise is 1 million RMB.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Virus nucleic acid sequence dataset, virus nucleic acid dataset, protein sequence database, bacterial dataset, archaea dataset, etc

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

singularity

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

RNA
https://www.aliyundrive.com/s/48WswAZkyMk

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • [X] I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

More than 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives

How do you plan to choose storage providers

Slack

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

luhong123 avatar Jul 07 '23 12:07 luhong123

The initial request for 15 PiB using a newly created account and without establishing trust within the community is concerning. I highly recommend close the application and reconsider by starting with a smaller amount to build credibility and trust before making larger requests.

Advising notaries not to engage.

herrehesse avatar Jul 10 '23 12:07 herrehesse

Could you send an email with the business license to [email protected] with your official domain in order to confirm your identity? Email name should includes the issue id #2090.

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners. You should list Miner ID, Business Entity, Location of sps you will cooperate with.

Sunnyiscoming avatar Jul 11 '23 15:07 Sunnyiscoming

The initial request for 15 PiB using a newly created account and without establishing trust within the community is concerning. I highly recommend close the application and reconsider by starting with a smaller amount to build credibility and trust before making larger requests.

Advising notaries not to engage.

My account has been created for many years. Where did you see it was newly created?

luhong123 avatar Jul 12 '23 11:07 luhong123

Could you send an email with the business license to [email protected] with your official domain in order to confirm your identity? Email name should includes the issue id #2090.

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners. You should list Miner ID, Business Entity, Location of sps you will cooperate with.

The email has been sent and all relevant questions have been replied to in the email

luhong123 avatar Jul 12 '23 13:07 luhong123

Screenshot 2023-07-13 at 08 45 21

Sigh..

herrehesse avatar Jul 13 '23 06:07 herrehesse

  • Have you prepared enough token for sector pledge?
  • Are you a data preparer? What is your previous experience as a data-preparer? List previous applications and client IDs
  • How will the data be prepared? Please include tooling used and technical details
  • If you are not preparing the data, who will prepare the data? (Name and Business)
  • Has this dataset been stored on Filecoin before? If so, why are you choosing to store it again?

Sunnyiscoming avatar Jul 13 '23 15:07 Sunnyiscoming

  • I'm not an SPS, I don't need to prepare a token, is that right?
  • The data will be prepared using singularity, and the relevant technical details have been presented in his open source address
  • This dataset has never been stored here before

luhong123 avatar Jul 14 '23 08:07 luhong123

@luhong123 If someone asks you multiple questions, and you only answer a select few of them, my doubts about the validity of your request just rises to new heights.

Let me help you do the right thing, here are some steps you can take:

  1. Seek out storage providers in different regions who are willing to store your data and ensure retrievability. Making sure these SP's are not on a blacklist/abuselist.
  2. Start with a smaller data request, such as 100T, and ask for the community's signature. I'm willing to assist you with this.
  3. Store your data, showcase its retrievability, value, and distribution.
  4. If everything goes well and you can establish trust, you can request additional datacap. I'll be the first one to offer my assistance.

By following these steps, you can build trust within the community and demonstrate the worthiness of your data.

herrehesse avatar Jul 16 '23 11:07 herrehesse

Received business license and the sps list. image How will the data be prepared? Please include tooling used and technical details If you are not preparing the data, who will prepare the data? (Name and Business)

Sunnyiscoming avatar Jul 19 '23 15:07 Sunnyiscoming

@luhong123 If someone asks you multiple questions, and you only answer a select few of them, my doubts about the validity of your request just rises to new heights.

Let me help you do the right thing, here are some steps you can take:

  1. Seek out storage providers in different regions who are willing to store your data and ensure retrievability. Making sure these SP's are not on a blacklist/abuselist.
  2. Start with a smaller data request, such as 100T, and ask for the community's signature. I'm willing to assist you with this.
  3. Store your data, showcase its retrievability, value, and distribution.
  4. If everything goes well and you can establish trust, you can request additional datacap. I'll be the first one to offer my assistance.

By following these steps, you can build trust within the community and demonstrate the worthiness of your data.

Why do most newly applied LDNs see your comments, and you are not part of the PL? I have seen many newly applied LDNs questioning your identity, and the PL has not said anything, while you have been blocking these newly applied LDNs.

luhong123 avatar Jul 20 '23 02:07 luhong123

Received business license and the sps list. image How will the data be prepared? Please include tooling used and technical details If you are not preparing the data, who will prepare the data? (Name and Business)

I am responsible for preparing the data. The tool I am using is singularity, and the relevant details are shown in the screenshot below

image

luhong123 avatar Jul 20 '23 02:07 luhong123

@Sunnyiscoming

"Received business license and the sps list."

These are not valid, VPN and fake distribution.

Asking notaries not to engage.

herrehesse avatar Jul 20 '23 14:07 herrehesse

Hello @luhong123 per the new guidelines https://github.com/filecoin-project/notary-governance/issues/922 for Open Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be triggered for notary review. Let us know if you have any questions.

ghost avatar Jul 20 '23 20:07 ghost

Hello @luhong123 per the new guidelines filecoin-project/notary-governance#922 for Open Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be triggered for notary review. Let us know if you have any questions.

Completed

luhong123 avatar Jul 21 '23 10:07 luhong123

@Luhong123 thank you for submitting the registration, it was reviewed. You did not provide any SP contact information. Please confirm SP entities you are working with. We will need proof of location of these SPs you mention. f02337006 eager England f02337010 peter California f01859603 Liu GuangZhou f02215422 Sunny Hong Kong

you can send to: [email protected] cc @Sunnyiscoming

ghost avatar Jul 24 '23 11:07 ghost

@Sunnyiscoming Relevant information has been sent, please check it carefully.

luhong123 avatar Jul 25 '23 11:07 luhong123

@Filplus-govteam @Sunnyiscoming Is there anyone here to review it?

luhong123 avatar Aug 02 '23 16:08 luhong123

I am pushing it.

Sunnyiscoming avatar Aug 05 '23 14:08 Sunnyiscoming

Hello @luhong123 we are waiting for SPs to confirm their locations.

ghost avatar Aug 07 '23 11:08 ghost

Hello @luhong123 we are waiting for SPs to confirm their locations.

Most of them never got the verification email.

luhong123 avatar Aug 08 '23 02:08 luhong123

Hi @luhong123 actually in your registration you did not provide:

  • any SP business entity names
  • any way to contact these SPs
  • any proof they are located where you say they are

these SPs were also listed in several other applications and currently being held back because of lack of proof of location. https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/2135 (application was removed by applicant) https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/2136 (application was removed by applicant) https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/2137 (applicant cannot clearly prove location or SP entity) https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/2129 (applicant cannot clearly prove location or SP entity) https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/2100 (applicant cannot clearly prove location or SP entity)

Good deal making should involve clearly working with various distributed SP entities. Until then, we'll wait.

ghost avatar Aug 11 '23 14:08 ghost

Hi @luhong123 actually in your registration you did not provide:

  • any SP business entity names

They say they're all teams of individuals, no business entity

  • any way to contact these SPs

In the email I provided the contact information for their emails image

  • any proof they are located where you say they are

Please let me know,Whether it's me(applicant) who provides the supporting documentation for their location I thought you were asking me for their contact info so you could email them to confirm.

luhong123 avatar Aug 14 '23 09:08 luhong123

OK, thank you, let's see. FYI @Sunnyiscoming

ghost avatar Aug 15 '23 21:08 ghost

Deleting comment

@Sunnyiscoming hasn't the permissions to post this comment.

Please, contact the assignee of this issue.

Datacap Request Trigger

Total DataCap requested

15PiB

Expected weekly DataCap usage rate

1PiB

Client address

f1sapvfym7hgcnydd7j6jnynspdhxo5j3m2fqmrra

Sunnyiscoming avatar Aug 16 '23 15:08 Sunnyiscoming

Please further specify the location and whether data retrieval is supported.

Sunnyiscoming avatar Aug 16 '23 15:08 Sunnyiscoming

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1sapvfym7hgcnydd7j6jnynspdhxo5j3m2fqmrra

DataCap allocation requested

512TiB

Id

1ae24d37-a03e-4b51-b853-0302d230e6fe

Please further specify the location and whether data retrieval is supported.

They both guarantee support for retrieval

luhong123 avatar Aug 21 '23 02:08 luhong123

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

github-actions[bot] avatar Sep 01 '23 01:09 github-actions[bot]

Keep!

luhong123 avatar Sep 04 '23 01:09 luhong123