GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:31:10 Sequence gi568815587f:68584924_68790984 : 206061 bp : 47.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3004 3101 98 0 2 32 89 163 0.504 9.61 1.02 Intr + 5737 5791 55 1 1 93 84 55 0.990 4.58 1.03 Intr + 6653 6783 131 1 2 73 67 102 0.996 6.09 1.04 Intr + 11174 11295 122 2 2 84 111 146 0.877 16.64 1.05 Intr + 15418 15571 154 2 1 74 83 54 0.847 2.63 1.06 Intr + 16940 17046 107 2 2 66 100 63 0.772 5.16 1.07 Intr + 18419 18569 151 0 1 99 86 191 0.598 19.22 1.08 Term + 24330 24447 118 0 1 96 54 21 0.055 -2.49 1.09 PlyA + 25876 25881 6 1.05 2.00 Prom + 34254 34293 40 -5.36 2.01 Init + 38367 38424 58 2 1 42 107 36 0.821 2.37 2.02 Intr + 39963 40096 134 1 2 115 80 106 0.951 12.86 2.03 Term + 51052 51258 207 0 0 58 48 132 0.598 3.54 2.04 PlyA + 53669 53674 6 1.05 3.00 Prom + 54383 54422 40 -5.56 3.01 Init + 55243 55296 54 0 0 63 63 52 0.310 1.48 3.02 Intr + 56289 56411 123 2 0 96 60 18 0.097 0.58 3.03 Term + 64985 65512 528 1 0 87 44 295 0.080 19.25 3.04 PlyA + 65917 65922 6 1.05 4.00 Prom + 75625 75664 40 -4.76 4.01 Init + 83083 83278 196 0 1 90 77 154 0.978 13.66 4.02 Intr + 89247 89362 116 1 2 56 46 95 0.311 2.27 4.03 Intr + 91033 91061 29 0 2 133 92 1 0.381 1.91 4.04 Term + 92770 92875 106 1 1 69 55 74 0.395 0.08 4.05 PlyA + 93273 93278 6 1.05 5.00 Prom + 93556 93595 40 -1.16 5.01 Init + 100001 100081 81 1 0 56 131 195 0.880 19.57 5.02 Intr + 100671 100725 55 2 1 114 113 84 0.998 12.05 5.03 Intr + 103091 103177 87 0 0 45 94 149 0.427 11.14 5.04 Intr + 103926 104003 78 1 0 99 95 40 0.975 5.32 5.05 Intr + 107386 107467 82 2 1 126 56 26 0.155 1.90 5.06 Intr + 108740 108773 34 2 1 106 100 17 0.052 2.93 5.07 Term + 124439 124582 144 1 0 126 43 119 0.868 9.11 5.08 PlyA + 124710 124715 6 1.05 6.19 PlyA - 125383 125378 6 -0.45 6.18 Term - 128027 127966 62 0 2 90 43 46 0.408 -1.73 6.17 Intr - 128484 128307 178 0 1 74 99 50 0.503 4.19 6.16 Intr - 131016 130914 103 2 1 157 85 92 0.997 15.98 6.15 Intr - 143126 142875 252 1 0 91 42 117 0.005 3.95 6.14 Intr - 151092 151055 38 0 2 86 81 1 0.007 -3.74 6.13 Intr - 152593 152558 36 1 0 39 111 63 0.019 2.16 6.12 Intr - 162443 162285 159 2 0 86 46 102 0.511 5.98 6.11 Intr - 165776 165267 510 2 0 102 63 815 0.984 73.57 6.10 Intr - 166610 166497 114 2 0 126 96 57 0.999 10.94 6.09 Intr - 174738 174646 93 0 0 89 92 125 0.999 13.16 6.08 Intr - 175415 175302 114 2 0 42 117 111 0.996 10.04 6.07 Intr - 176764 176612 153 1 0 81 100 51 0.980 5.87 6.06 Intr - 177838 177704 135 1 0 109 47 161 0.652 14.86 6.05 Intr - 188506 188342 165 1 0 93 116 172 0.999 20.66 6.04 Intr - 190509 190393 117 0 0 67 100 152 0.985 14.96 6.03 Intr - 195822 195717 106 2 1 73 105 156 0.999 16.02 6.02 Intr - 197036 196848 189 1 0 89 86 159 0.947 14.50 6.01 Init - 200073 199892 182 0 2 31 85 299 0.713 22.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:68584924_68790984|GENSCAN_predicted_peptide_1|311_aa AFSDYQMQQMTSNFIDQFGFNDEKFADQDDIGNVSFDRVSDINFTLNTNESGNIALFEAC CKERIQQFDDGGSDEEDIWEEKHIAFTPESQRRSSSGSTDSEESTDSEEEDGAKQDLFEP SSANTEDKMEVDLSEPPNWSANFDVPMETTHGAPLDSVGSDVWSTEEPMPTKETGWASFS EFTSSLSTKDSLRSNSPVEMETSTEPMDPLTPSAAALAVQPEAAGSVAMEASSDGEEDAE STDKVTETVMNGGMKETLSLTVDAKTETAVFKRISSWPPLLPLAFVLLTHCSRATRGAVK SDAVNTVTSCD >gi568815587f:68584924_68790984|GENSCAN_predicted_CDS_1|936_bp gccttttctgattatcagatgcaacaaatgacgtccaattttattgaccagtttggcttc aacgatgagaagtttgcagatcaagatgacattggcaatgtttcttttgatcgagtatca gacatcaactttactctcaatacaaatgaaagtggaaatattgccttgtttgaagcatgt tgtaaggaaagaatacaacagtttgatgatggtggctctgatgaggaagatatatgggag gaaaagcacatcgcattcacaccagaatcccaaagacgatccagctcggggagtacagac agtgaggaaagtacagactctgaagaagaagatggagcaaagcaagacttgtttgaaccc agcagtgccaacacggaggataaaatggaggtggacctgagtgaaccacccaactggtca gctaactttgatgtcccaatggaaacaacccacggtgctccattggattctgtgggatct gatgtctggagcacagaggagccgatgccaactaaagagacgggctgggcttctttttca gagttcacgtcttccctgagcacaaaagattctttaaggagtaattctccagtggaaatg gaaaccagcactgaacccatggaccctctgactcccagtgcggctgccctggcagtgcag ccagaagcggcaggcagtgtggccatggaagccagctctgacggagaggaggatgcagaa agtacagacaaggtaactgagacagtgatgaatggcggcatgaaggaaacgctcagcctc actgtagatgccaagacagagactgcggtcttcaaaaggatcagctcttggccccctctt ctccctctggcgtttgtgcttctcacacattgttcgagggcaactcgaggagccgtgaaa tccgatgcagtcaacacggtgacctcatgtgactga >gi568815587f:68584924_68790984|GENSCAN_predicted_peptide_2|132_aa MTSGPQTNQPKKHLTNFKSDERERDFLKSNLTLITAGFTSQASRKALEQFPERIPNGTTR QIPQELATSARNLATRPRNAYSPEFLLSCIPSVRDPTGNRTVQLTWQPLPELLEVWPKAD CFPDLLGLAAED >gi568815587f:68584924_68790984|GENSCAN_predicted_CDS_2|399_bp atgacctcaggtcctcagaccaaccagcccaagaaacatctcaccaatttcaaatctgat gaacgggaaagagattttctcaagtccaatctcacgctgataaccgctggcttcacgagc caggcctccaggaaggcattagagcagtttcctgagaggatccccaatggaactaccagg caaattccccaggagcttgctacaagtgccagaaatctggccaccaggccaaggaatgcc tacagcccagaattcctcctaagctgcatcccatctgtgcgggaccccactggaaatcgg actgttcaactcacctggcagccactcccagagctcctggaagtctggcccaaggctgac tgcttcccagatcttcttggcttagcggctgaagactga >gi568815587f:68584924_68790984|GENSCAN_predicted_peptide_3|234_aa MDSRDPENSGSGTLNDGLSWLCKVSGSGSKGGVCLPEDRTTSCSFHQGTAGSFWPSQQAR GQFHSHRTLQASCPRLCPGRCFLPGPWSAPRTPPQIPATAALFGVSDGELSFLCPRSLQG SAPATVAPLSRLLCWMLGFSPGPSGALEPSSAPRPGPVMGHFRALDGGPAPWRRISHPWQ TPAIRILPFLPAPLFLRVAQKQGWSFRWLLLHTRPWGHRGPGDIMVPASMELTL >gi568815587f:68584924_68790984|GENSCAN_predicted_CDS_3|705_bp atggactcaagagacccggagaacagcggaagtgggactcttaatgatggtctttcttgg ctctgcaaggttagcggttctggttccaaaggaggtgtttgcttgccagaggacagaact acaagctgtagcttccaccagggtactgcaggctccttttggcccagccagcaagcccga ggacagtttcactcccaccggaccctccaagcctcttgcccacggctgtgtcctggccgc tgcttcctgcctgggccctggagtgctccgaggacccctccccaaatcccagccaccgcc gccctgtttggcgtctctgatggagagctgagcttcctgtgtcctcggagcctgcaggga tcagcacccgccacggtagctcccctttcccgcctgctctgctggatgctgggtttcagt ccaggtccttcgggcgccctggaaccctcttccgctccccgacctgggcctgtcatgggc cactttagggccctggatggcggccctgctccctggcgccgcatcagccatccgtggcag actcccgccatccggattctccctttcctgccggcacctctcttcttgcgagtggcccag aagcaggggtggtcgttccgctggctcctgctgcacaccaggccctggggacatcgtggc cctggggacatcatggtccctgcctccatggagctgacgctctag >gi568815587f:68584924_68790984|GENSCAN_predicted_peptide_4|148_aa MIPKLPRRRNTCLGITLNGTTCYPPSPPCTPHPPAVPSRPLGHRGLGFRKKTADESPVMD PIVPHVFSLPGDAAPLGGQLLGTGLPFGQLARQLCSRKWRALAMRGIGELGWVGLSYPEN PQLDESPRAQVDSRTPGCDPASDQQGGP >gi568815587f:68584924_68790984|GENSCAN_predicted_CDS_4|447_bp atgatccccaaactacctcgccggcgcaacacgtgcctgggcatcacgctgaacggcacg acctgctaccctccttccccgccctgcaccccccaccccccagccgtgccatcaaggcct ctgggccaccgaggacttggattcagaaagaaaacagcagatgaaagtcctgtaatggac cccattgtgccccatgtgttcagtctccctggtgatgcagccccactcggtggccagctg ctgggcacggggctgccgttcggacagcttgcaagacagctgtgctcccggaaatggagg gccctggcgatgaggggcattggggagttgggttgggttgggctgagctacccagaaaac ccccagctggatgagtcacccagggcacaggtggattcacggactcccggctgtgaccct gcctctgaccagcaaggaggtccctga >gi568815587f:68584924_68790984|GENSCAN_predicted_peptide_5|186_aa MARGSALLLASLLLAAALSASAGLWSPAKEKRGWTLNSAGYLLGPHAVGNHRSFSDKNGL TSKRELRPEDDMKPGSFDRSIPENNIMRTIIEFLSFLHLKGSTLTLGGLGSPALQTTPSP EVVTSSTWYKRSVAGGSQKAHVQPEKAKVLRTRLSNLGVLLSGQPSPSMPLGAVKEVLED KLNRIT >gi568815587f:68584924_68790984|GENSCAN_predicted_CDS_5|561_bp atggcccgaggcagcgccctcctgctcgcctccctcctcctcgccgcggccctttctgcc tctgcggggctctggtcgccggccaaggaaaaacgaggctggaccctgaacagcgcgggc tacctgctgggcccacatgccgttggcaaccacaggtcattcagcgacaagaatggcctc accagcaagcgggagctgcggcccgaagatgacatgaaaccaggaagctttgacaggtcc atacctgaaaacaatatcatgcgcacaatcattgagtttctgtctttcttgcatctcaaa ggctccacgctcacgcttggtggccttgggtcgccggcgcttcagaccactcccagcccg gaggtagtgacgtcatccacctggtataaacggagtgtggcaggtggcagccagaaggcc cacgtgcagccagagaaggccaaggttctgaggactcgcctgtcaaaccttggggtcctg ctgtcaggacagccttccccatcaatgccgcttggagctgtgaaagaggttctggaagac aagttgaaccgcatcacatag >gi568815587f:68584924_68790984|GENSCAN_predicted_peptide_6|901_aa MRDSKHIVVYHRGRYFKVWLYHDGRLLKPREMEQQMQRILDNTSEPQPGEARLAALTAGD RVPWARCRQAYFGRGKNKQSLDAVEKAAFFVTLDETEEGYRSEDPDTSMDSYAKSLLHGR CYDRWFDKSFTFVVFKNGKMGLNAEHSWADAPIVAHLWEYVMSIDSLQLGYAEDGHCKGD INPNIPYPTRLQWDIPGECQEVIETSLNTANLLANDVDFHSFPFVAFGKGIIKKCRTSPD AFVQLALQLAHYKDMGKFCLTYEASMTRLFREGRTETVRSCTTESCDFVRAMVDPAQTVE QRLKLFKLASEKHQHMYRLAMTGSGIDRHLFCLYVVSKYLAVESPFLKEVLSEPWRLSTS QTPQQQVELFDLENNPEYVSSGGGFGPVADDGYGVSYILVGENLINFHISSKFSCPETVP VPHPSPFSNRQRPPAPEARPTPHPARSTLRQPRRPQVRGPRHPAPPPCAMEEGPLPGGLP SPEDAMVTELLSPEGPFASENIGLKAPVKYEEDEFHVFKEAYLGPADPKEPVLHAFNPAL GADCKGQVKAKLAGGDSDGGELLGEYPGIPELSALEDVALLQAPQPPACNVHFLSSLLPA HRSPAVLPLGAWVLEGASHPGVRMIPVEIKEAGGTTTSNNPEEATLQNLLAQESCCKFPS SQELEDASCCSLKKDSNPMSSAIGHGFVMALMGTERAVHLEPWKEYGGLRTGRETGNRQL VEKSEHTQHSSVKFMFLHGHGSWYSKTIALKFVLLYGCGLWYSKTITLSDIKERGLQITI TDITVMKRYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQ LGNVKPQHNKGCNCRRSGCLKNYCECYEVSAQLPNSLTKNTGWKLPEGSRASSSLSKAQV L >gi568815587f:68584924_68790984|GENSCAN_predicted_CDS_6|2706_bp atgagagacagcaagcacatcgtcgtgtaccatcgaggacgctacttcaaggtctggctc taccatgatgggcggctgctgaagccccgggagatggagcagcagatgcagaggatcctg gacaatacctcggagcctcagcccggggaggccaggctggcagccctcaccgcaggagac agagttccctgggccaggtgtcgtcaggcctattttggacgtgggaaaaataagcagtct cttgatgctgtggagaaagcagcgttctttgtgacgttagatgaaactgaagaaggatac agaagtgaagacccggatacgtcaatggacagctacgccaaatctctactacacggccga tgttacgacaggtggtttgacaagtcgttcacgtttgttgtcttcaaaaacgggaagatg ggcctcaacgctgaacactcctgggcagatgcgccgatcgtggcccacctttgggagtac gtcatgtccattgacagcctccagctgggctatgcggaggatgggcactgcaaaggcgac atcaatccgaacattccgtaccccaccaggctgcagtgggacatcccgggggaatgtcaa gaggttatagagacctccctgaacaccgcaaatcttctggcaaacgacgtggatttccat tccttcccattcgtagcctttggtaaaggaatcatcaagaaatgtcgcacgagcccagac gcctttgtgcagctggccctccagctggcgcactacaaggacatgggcaagttttgcctc acatacgaggcctccatgacccggctcttccgagaggggaggacggagaccgtgcgctcc tgcaccactgagtcatgcgacttcgtgcgggccatggtggacccggcccagacggtggaa cagaggctgaagttgttcaagttggcgtctgagaagcatcagcatatgtatcgcctcgcc atgaccggctctgggatcgatcgtcacctcttctgcctttacgtggtgtctaaatatctc gctgtggagtcccctttccttaaggaagttttatctgagccttggagattatcaacaagc cagacccctcagcagcaagtggagctgtttgacttggagaataacccagagtacgtgtcc agcggagggggctttggaccggttgctgatgacggctatggtgtgtcgtacatccttgtg ggagagaacctcatcaatttccacatttcttccaagttctcttgccctgagacggttcct gtccctcacccttcccccttctccaaccgccaacggccgcctgcgcctgaggcccggccc acgccccatcccgctcgctccacgctgcggcagccccggcggccccaggtgcgcggcccc cgccatcccgccccgccgccctgcgccatggaggagggccctctgccgggcgggctgccc agccccgaggatgcgatggtgacggagctcttaagccccgagggtccgttcgcttcggag aacatcggcctgaaggcccccgtgaagtacgaggaggacgagttccacgtcttcaaagaa gcgtacctgggcccggcggaccccaaggaacccgtcctgcacgcgttcaaccccgcgctg ggcgccgactgcaagggccaggtcaaggcgaagctcgcggggggcgacagcgacggcggg gagctcctcggggagtaccccgggatcccagagctcagcgcgctggaggacgtcgcgctc ctgcaggccccgcagccgcccgcctgcaacgtgcacttcctgtcctcgctgctacccgcg caccgcagcccggcggtgttgcccctgggcgcctgggtcctggaaggagcctcccacccg ggcgtccgcatgatcccagttgaaatcaaggaagcaggtggtactactacaagtaataat ccggaagaagcaactttgcagaatcttcttgctcaggaatcctgttgcaagttcccatcg tcccaggaactagaggatgcctcctgctgttctcttaagaaagattccaacccaatgagt tctgccatcgggcatggcttcgtgatggccttgatgggcactgagagagcagtccatctg gagccatggaaggaatatggaggcctgaggacagggagagagacagggaaccgccagttg gtggagaagtcagaacacactcagcattcatcagttaagttcatgttcttacatggacat ggctcgtggtactccaaaacaattgcacttaagttcgtgctcttatatggatgtggcttg tggtactccaaaacaattacacttagtgacatcaaagaacgaggattgcagatcactata acagacataacagtaatgaaaaggtactgtgactgctttgccagtggggacttttgcaac aactgcaattgtaataattgttgcaacaacttgcatcatgatattgaacggtttaaagcc attaaggcatgtcttggtagaaatccagaagctttccagccaaaaattgggaagggccaa ttgggcaatgtcaagccccagcacaacaaagggtgcaactgcaggaggtcaggctgcctg aagaattactgcgagtgctatgaggttagtgcccagctccctaactcactaacaaaaaac acagggtggaagctcccggagggcagccgcgcctcttcctccctgagcaaagcccaggtg ctctag