HIV-1 protease is encoded by the genomic RNA of this retrovirus. The sequence of 4619 complete HIV-1 isolates is available in the HIV sequence database at Los Alamos National Laboratory, but the current version of the reference sequence for the complete HIV-1 genome is NC_001802.1. The reference sequence represents a clinical isolate that was obtained from an AIDS patient in France in 1983, early in the global HIV outbreak.
HIV-1 protease cDNA sequence in FASTA format:Edit
>ENA|CAA09312|CAA09312.1 Human immunodeficiency virus partial HIV-1 protease : Location:1..297 CCTCAGGTCACTCTTTGGCAACGACCCATAGTCACAATAAAGATAGGGGGGCAACTAAAG GAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTGCCAGGA AAATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAGTATGAT CAGGTATCCATAGAAATCTGCGGACATAAAGCTATAGGTACAGTATTAATAGGACCTACA CCTGTCAACATAATTGGAAGGAATCTGTTGACTCAGCTTGGCTGCACTTTAAATTTT