Skip to content

download uniprot data  #2

@dlemas

Description

@dlemas

This issue is complete when uniprot data is downloaded to UFRC for PPDbench. I have included a brief protocol that can be expanded on with screenshots and better documentation to remove potential ambiguity in the instructions.

Used as input data for Step 4 (step1_pdb_process.py)

  • step 3 will generate "df_predict_det3" file with uniprot IDs
  • https://www.uniprot.org/
  • select "search" in the top right search bar
  • select "customize columns"
  • select "Protein names", "Sequence", "Protein families" & then close.
  • select "List" in the top search bar. this will bring you to an area to enter "Uniprot IDs" from Step3. Select search.
  • select "map IDs"
  • download data
  • select TSV format, don't compress, customize the columns & rearrange to meet: From Entry Protein names Sequence Protein families
  • once the file is downloaded, open file, rename files for script input, and run step 4.

Metadata

Metadata

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions