GENSCAN 1.0 Date run: 2-Jun-117 Time: 11:46:51 Sequence gi568815597r:27000310_27254334 : 254025 bp : 49.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 3423 3418 6 1.05 1.02 Term - 6648 5635 1014 0 0 160 41 1724 0.997 167.01 1.01 Init - 12358 12098 261 1 0 72 94 337 0.879 27.76 1.00 Prom - 29672 29633 40 -2.46 2.05 PlyA - 30316 30311 6 1.05 2.04 Term - 31953 31876 78 0 0 62 37 43 0.048 -5.64 2.03 Intr - 34117 33979 139 0 1 21 96 96 0.248 4.17 2.02 Intr - 39301 39153 149 1 2 51 75 63 0.213 0.33 2.01 Init - 40776 40741 36 0 0 82 92 62 0.621 6.04 2.00 Prom - 47983 47944 40 -5.26 3.04 PlyA - 55957 55952 6 1.05 3.03 Term - 60808 60680 129 1 0 91 49 87 0.632 3.28 3.02 Intr - 62755 62622 134 2 2 84 48 11 0.166 -2.94 3.01 Init - 72981 72855 127 0 1 70 64 101 0.392 6.22 3.00 Prom - 82101 82062 40 -3.66 4.18 PlyA - 82185 82180 6 1.05 4.17 Term - 100335 99998 338 1 2 85 41 455 0.958 35.23 4.16 Intr - 100966 100894 73 1 1 109 110 137 0.999 16.98 4.15 Intr - 101517 101416 102 0 0 113 63 204 0.999 20.77 4.14 Intr - 101821 101707 115 0 1 70 76 177 0.998 15.15 4.13 Intr - 102249 102076 174 2 0 95 73 322 0.999 30.45 4.12 Intr - 102434 102364 71 2 2 77 58 120 0.990 5.88 4.11 Intr - 103003 102914 90 1 0 86 84 172 0.959 16.79 4.10 Intr - 105778 105576 203 2 2 151 53 477 0.999 49.50 4.09 Intr - 107556 107339 218 2 2 121 68 378 0.990 37.25 4.08 Intr - 109468 109218 251 1 2 100 69 569 0.591 52.44 4.07 Intr - 113977 113517 461 2 2 119 65 963 0.178 90.00 4.06 Intr - 123864 123825 40 0 1 115 84 34 0.219 3.50 4.05 Intr - 127915 127757 159 1 0 46 101 73 0.321 4.58 4.04 Intr - 132695 132603 93 2 0 112 81 -6 0.291 1.26 4.03 Intr - 141649 141517 133 0 1 74 96 57 0.204 5.75 4.02 Intr - 149535 149352 184 1 1 59 40 100 0.356 1.15 4.01 Init - 154025 153674 352 2 1 93 97 313 0.893 28.02 4.00 Prom - 154855 154816 40 -3.66 5.00 Prom + 166650 166689 40 -4.46 5.01 Sngl + 176774 176968 195 1 0 87 43 314 0.974 20.04 5.02 PlyA + 176999 177004 6 1.05 6.03 PlyA - 178160 178155 6 1.05 6.02 Term - 200836 200521 316 0 1 -35 54 325 0.943 11.21 6.01 Init - 201164 200881 284 2 2 97 -29 412 0.962 26.91 6.00 Prom - 202569 202530 40 -6.76 7.00 Prom + 206307 206346 40 -7.06 7.01 Sngl + 206639 207193 555 1 0 82 50 586 0.145 50.33 7.02 PlyA + 207634 207639 6 1.05 8.03 PlyA - 207675 207670 6 1.05 8.02 Term - 234347 234223 125 0 2 121 42 86 0.743 5.85 8.01 Init - 237210 237123 88 0 1 56 100 23 0.329 1.11 8.00 Prom - 247068 247029 40 -0.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_1|424_aa MPSESGAERRDRAAAQVGTAAATAVATAAPAGGGPDPEALSAFPGRHLSGLSWPQVKRLD ALLSEPIPIHGRGNFPTLSVQPRQIVQVVRSTLEEQGLHVHSVRLHGSAASHVLHPESGL GYKDLDLVFRVDLRSEASFQLTKAVVLACLLDFLPAGVSRAKITPLTLKEAYVQKLVKVC TDSDRWSLISLSNKSGKNVELKFVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEAF HPTVTGESLYGDFTEALEHLRHRVIATRSPEEIRGGGLLKYCHLLVRGFRPRPSTDVRAL QRYMCSRFFIDFPDLVEQRRTLERYLEAHFGGADAARRYACLVTLHRVVNESTVCLMNHE RRQTLDLIAALALQALAEQGPAATAALAWRPPGTDGVVPATVNYYVTPVQPLLAHAYPTW LPCN >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_1|1275_bp atgccgtcggagagcggagctgagcgcagggaccgggcggctgctcaggtggggacggct gcggccacggcggtggccacggcagccccggcaggcggcggccccgacccggaggcctta tcggccttccccggacggcacctgagtgggctgagctggccacaggtgaagcgactggac gctcttctgagcgagccgattcccattcacgggcgcggcaacttccccacgctgagcgtg cagccccggcagatcgtgcaggtggtccgcagcaccctggaggagcagggactacatgtg cacagtgtgcggctgcatggttcagctgccagccacgtgctgcaccctgagagtggcctg ggctacaaggatctggacctggtgttccgggtggacctgcgcagtgaggcatccttccag ctgaccaaggcagtggtgctggcctgcctactagacttcctgccggccggtgtgagccgg gccaagatcacgccactgacactcaaggaggcatacgtgcagaagctggtgaaagtgtgc acagactcggaccgctggagcctcatctcactgtccaacaagagcggcaagaacgtggag ctcaagtttgtggactcggtgagacgccagtttgaattcagcatagactccttccagatc atcctggactccctgttgctctttggccagtgctcgtccactcccatgtctgaggccttc cacccaacggtcacaggcgaaagcctgtacggggacttcaccgaggccctggagcacctg cggcaccgtgtcatcgccacgcgcagtcccgaggagatccgaggtggtggcctcctcaag tactgccacctcctggtgcggggcttccggccccggcccagcaccgatgtgcgcgccctg cagcgctacatgtgctcccgcttcttcatcgactttccagacctggtggagcagcggcgc accctagagcgctacctggaggcccacttcggtggggcagatgcagcccgccgttacgcc tgcctggtgacactgcaccgggtggtcaacgagagcaccgtgtgcctcatgaaccacgag cgccgccagacgctggacctcattgccgcactggcgctgcaggcactggctgagcagggc ccagctgccactgccgccctggcctggcgccctccaggcactgacggggttgtgccagcc actgtcaattactacgtgacccccgtgcaacctctcctggctcacgcctatcccacctgg ctgccttgtaactga >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_2|133_aa MVPGFQYILVNEAKTGQTGERRPCNEFGTCEAAVASPPRSPTVKEEEEETERRGLREGEP GSHNHNHSHTNRHKGRTGSPQGYPNLASQQDPSPFLGPVEGNGTVLEQFCHLSNGKNTEF MRIKHDSGKRSVL >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_2|402_bp atggtgccgggcttccagtatatcctggtgaacgaggctaaaacaggccaaactggggag agaagaccatgcaatgagtttgggacatgtgaggctgcggtggccagccccccgagaagc cccaccgtgaaggaggaagaggaagagaccgagaggagaggactcagggagggggagcct ggaagccacaatcataatcacagtcacaccaacaggcacaagggcaggacaggaagtcct caagggtaccctaacttggcaagccagcaggaccccagcccctttctgggccccgtggaa ggcaacgggactgtcctggaacagttttgccatctgtcaaatggcaagaacaccgagttt atgaggataaaacatgacagtggcaagagaagtgtcctgtaa >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_3|129_aa METQVQLDNKDPSKEDKLLSSMKENVCVTRQSPAGWLSHKGSAPQHPLLHQQFCTAENGG HFHCGRCGKDRPFTHGGTGKHFQKSSLDTHPSCLTVELRLLTPLAASKGKRPRQSFSSDV LLEDVLLVL >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_3|390_bp atggaaacgcaagtccagctcgacaacaaggatccatcgaaggaggacaaattgctgtct tccatgaaggaaaatgtttgtgtaaccaggcagtcaccagctggctggctgagccacaag ggctcagctccccaacacccacttctccaccagcaattctgcacagcagagaatggaggc cacttccactgtggaaggtgtgggaaggacaggcctttcacccacgggggaactggaaaa catttccagaaaagcagcctggatacacaccccagctgcctgactgtggagctccgcctt ctaacccctctggcagcatccaaaggcaaaaggccacggcaaagcttctcctcagacgtc ctccttgaagacgtcctccttgttttataa >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_4|1018_aa MVLRSGICGLSPHRIFPSLLVVVALVGLLPVLRSHGLQLSPTASTIRSSEPPRERSIGDV TTAPPEVTPESRPVNHSVTDHGMKPRKAFPVLGIDYTHVRTPFEISLWILLACLMKIASP MASSPGTKERRSQKPWVLLLMLPLSTCVTMNLEVHAPHSLNLTCLILKTNVEAERGNAWS CSPSQANVELGCCAFYSIFAIKPGNKILSPRWPGPTGIWMLYQEAFGMIRFPWYGARWVP LSRVFSYHVTVQYQVTEKLRFRIGTETTSLIDGPELFTLPAAASCKLAMARMLFCTQMPQ LLAWTEMVWGKLVPGAPDTICFHVIPTISSIVPESCLLIVVGLLVGGLIKGVGETPPFLQ SDVFFLFLLPPIILDAGYFLPLRQFTENLGTILIFAVVGTLWNAFFLGGLMYAVCLVGGE QINNIGLLDNLLFGSIISAVDPVAVLAVFEEIHINELLHILVFGESLLNDAVTVVLYHLF EEFANYEHVGIVDIFLGFLSFFVVALGGVLVGVVYGVIAAFTSRFTSHIRVIEPLFVFLY SYMAYLSAELFHLSGIMALIASGVVMRPYVEANISHKSHTTIKYFLKMWSSVSETLIFIF LGVSTVAGSHHWNWTFVISTLLFCLIARVLGVLGLTWFINKFRIVKLTPKDQFIIAYGGL RGAIAFSLGYLLDKKHFPMCDLFLTAIITVIFFTVFVQGMTIRPLVDLLAVKKKQETKRS INEEIHTQFLDHLLTGIEDICGHYGHHHWKDKLNRFNKKYVKKCLIAGERSKEPQLIAFY HKMEMKQAIELVESGGMGKIPSAVSTVSMQNIHPKSLPSERILPALSKDKEEEIRKILRN NLQKTRQRLRSYNRHTLVADPYEEAWNQMLLRRQKARQLEQKINNYLTVPAHKLDSPTMS RARIGSDPLAYEPKEDLPVITIDPASPQSPESVDLVNEELKGKVLGLSRDPAKVAEEDED DDGGIMMRSKETSSPGTDDVFTPAPSDSPSSQRIQRCLSDPGPHPEPGEGEPFFPKGQ >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_4|3057_bp atggttctgcggtctggcatctgtggcctctctccacatcggatcttcccttccttactc gtggtggttgctttggtggggctgctgcctgttctcaggagccatggcctccagctcagc ccaactgccagcaccattcgaagctcagagccaccacgagaacgctcgattggggatgtc accaccgctccaccggaggtcaccccagagagccgccctgttaatcattccgtcactgat catggcatgaagccgcgcaaggcctttccagtcctgggcatcgactacacacacgtgcgc acccccttcgagatctccctctggatccttctggcctgcctcatgaagatagccagcccc atggcgtccagccctggtacaaaggaaagacgtagtcagaagccctgggttctgctcctg atgctgcctctcagtacctgtgtgaccatgaaccttgaggttcatgcacctcattctctg aacctgacttgcctcatccttaagaccaacgtagaggctgaacgtgggaatgcctggtcg tgtagtccatcacaagccaatgttgaacttgggtgctgtgctttttatagcatctttgcc atcaagccagggaataagatcttatctccccggtggccaggacccacaggaatctggatg ctgtaccaggaagcatttggcatgataagatttccatggtacggggccaggtgggtccca ctctcccgggtgtttagctaccacgtcacagtgcagtaccaggtcacagagaagttaaga tttagaattgggacagaaactacctccctcattgatggaccagagctctttacgctgccc gctgctgcctcgtgcaagcttgctatggcacgaatgctattctgtacccagatgccacag ttgctggcatggacagagatggtgtggggcaaactggtacctggagcacctgataccatc tgtttccatgtgatccccactatctcaagcatcgtcccggagagctgcctgctgatcgtg gtggggctgctggtggggggcctgatcaagggtgtaggcgagacaccccccttcctgcag tccgacgtcttcttcctcttcctgctgccgcccatcatcctggatgcgggctacttcctg ccactgcggcagttcacagaaaacctgggcaccatcctgatctttgccgtggtgggcacg ctgtggaacgccttcttcctgggcggcctcatgtacgccgtgtgcctggtgggcggtgag cagatcaacaacatcggcctcctggacaacctgctcttcggcagcatcatctcggccgtg gaccccgtggcggttctggctgtctttgaggaaattcacatcaatgagctgctgcacatc cttgtttttggggagtccttgctcaatgacgccgtcactgtggtcctgtatcacctcttt gaggagtttgccaactacgaacacgtgggcatcgtggacatcttcctcggcttcctgagc ttcttcgtggtggccctgggcggggtgcttgtgggcgtggtctacggggtcatcgcagcc ttcacctcccgatttacctcccacatccgggtcatcgagccgctcttcgtcttcctctac agctacatggcctacttgtcagccgagctcttccacctgtcaggcatcatggcgctcata gcctcaggagtggtgatgcgcccctatgtggaggccaacatctcccacaagtcccacacc accatcaaatacttcctgaagatgtggagcagcgtcagcgagaccctcatcttcatcttc ctcggcgtctccacggtggccggctcccaccactggaactggaccttcgtcatcagcacc ctgctcttctgcctcatcgcccgcgtgctgggggtgctgggcctgacctggttcatcaac aagttccgtatcgtgaagctgacccccaaggaccagttcatcatcgcctatgggggcctg cgaggggccatcgccttctctctgggctacctcctggacaagaagcacttccccatgtgt gacctgttcctcactgccatcatcactgtcatcttcttcaccgtctttgtgcagggcatg accattcggcccctggtagacctgttggctgtgaagaaaaagcaagagacgaagcgctcc atcaacgaagagatccacacacagttcctggaccaccttctgacaggcatcgaagacatc tgtggccactacggtcaccaccactggaaggacaagctcaaccggtttaataagaaatat gtgaagaagtgtctgatagctggcgagcgctccaaggagccccagctcattgccttctac cacaagatggagatgaagcaggccatcgagctggtggagagcgggggcatgggcaagatc ccctctgccgtctccaccgtctccatgcagaacatccaccccaagtccctgccttccgag cgcatcctgccagcactgtccaaggacaaggaggaggagatccgcaaaatcctgaggaac aacttgcagaagaccaggcagcggctgcggtcctacaacagacacacgctggtggcagac ccctacgaggaagcctggaaccagatgctgctccggaggcagaaggcccggcagctggag cagaagatcaacaactacctgacggtgccagcccacaagctggactcacccaccatgtct cgggcccgcatcggctcagacccactggcctatgagccgaaggaggacctgcctgtcatc accatcgacccggcttccccgcagtcacccgagtctgtggacctggtgaatgaagagctg aagggcaaagtcttagggttgagccgggatcctgcaaaggtggctgaggaggacgaggac gacgatgggggcatcatgatgcggagcaaggagacttcgtccccaggaaccgacgatgtc ttcacccccgcgcccagtgacagccccagctcccagaggatacagcgctgcctcagtgac ccaggcccacaccctgagcctggggagggagaaccgttcttccccaaggggcagtaa >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_5|64_aa MGARHRAWTHSIQITKVEAIAASKCPRPAVKQFHDSKIKFPLPHRVLRCQHKPHFTTRRP NTFF >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_5|195_bp atgggcgcccggcaccgcgcctggacccattccatccagatcacgaaggtggaggcgatc gcagccagcaagtgcccgcggccggccgtcaagcagttccacgactccaagatcaagttt ccgctaccccaccgggtcctgcgctgtcagcacaagccacacttcaccacccggagaccc aacaccttcttctag >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_6|199_aa MGGTTSTRRVTFEADENENITVVKGIRLSENVIDRMKESSPSGSKSQRYSGAYGASVSDE ELKRRVAEELALEQAKKESEDEKRLKQPKSWTERGISSEEERAKAKHLARQLEEKDRVLK KQDAFYNEQLARLEKRNSEFFRVTTEQYQKAAEEVEAKFKRYESHPVCADLQAKILQCYR ENTHQTFKCSTLATQSGHA >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_6|600_bp atgggtgggaccaccagcacccgccgggtcaccttcgaggcggacgagaatgagaacatt accgtggtgaagggcatccggctttcagaaaatgtgattgatcgaatgaaggaatcctct ccatctggttcgaagtctcagcggtattctggtgcttatggtgcctcagtttctgatgaa gaattgaaaagaagagtagctgaggagctggcattggagcaagccaagaaagaatctgaa gatgagaaacgactaaagcagccaaagagctggaccgagagagggatatctagcgaggag gaacgcgctaaggcaaagcacctggctaggcagctggaagagaaagaccgagtgctaaag aagcaggatgcattctacaacgaacagctggctagactggagaagaggaactcagagttc ttcagagtcaccactgaacaatatcagaaagctgctgaagaggtggaagcaaagttcaag cggtatgagtctcatccagtctgtgctgatctgcaggccaaaattcttcagtgttaccgt gagaacacccaccagaccttcaaatgctccactctggccacccagagtggccatgcttga >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_7|184_aa MDMSPLRPQNYLFGCELKADKDDHFKVDNDENEHQLSLRTVSLGAGTKDELHIVEAEAMN YKGSPIKVTLATLKMSAQPTVSLGGFEITPPVVLRFKCGSGPVHISGQHLVAVEEDAESE DEEEEDVKLLSISGKRSAPGGGSKVPQKKVKLAVDEDDDDDDDDDDDDFDDEEAEEKVPV KKSI >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_7|555_bp atggacatgagccccctgaggccccagaactatcttttcggttgtgaactaaaggctgac aaagatgatcactttaaggtggataatgatgaaaatgagcaccagttatctttaagaacg gtcagtttaggggctggtacaaaggatgaattgcacattgttgaagcagaggcaatgaat tacaaaggcagtccaattaaagtaacactggcaactttgaaaatgtctgcacagccaaca gtttcccttgggggctttgaaataacaccaccagtggtcttaagatttaagtgtggttca gggccagtgcatattagtggacagcacttagtagctgtggaggaagatgcagagtcagaa gatgaagaggaggaggatgtgaaactcttaagtatatctggaaagcggtctgcccctgga ggtggtagcaaggttccacagaaaaaagtaaaacttgctgttgatgaagatgatgatgat gatgatgatgatgatgatgatgattttgatgatgaggaagctgaagaaaaagtgccagtg aagaaatctatatga >gi568815597r:27000310_27254334|GENSCAN_predicted_peptide_8|70_aa MTAKKVQPKGFVGPAICNGSLKIQWSIWKAVWGPRRSPATRLRHAVSASALRLAAGGYAT KTAPGRFQNG >gi568815597r:27000310_27254334|GENSCAN_predicted_CDS_8|213_bp atgacagctaagaaagtacaacctaagggttttgtgggcccagcaatctgcaatggcagc ctaaagatacagtggagcatctggaaagctgtctggggacccaggagaagcccagcaacg cgcctgcgccacgcagtctccgcctccgccctgcgcctcgcagccggcgggtacgctacg aagactgcgcccggccgcttccagaatggctag