GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:42:24 Sequence gi568815590f:26478255_26755703 : 277449 bp : 44.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5260 5412 153 0 0 64 -9 188 0.267 6.97 1.02 Intr + 6767 6908 142 1 1 100 80 34 0.212 3.73 1.03 Term + 23790 23884 95 1 2 56 48 78 0.018 -1.41 1.04 PlyA + 25732 25737 6 1.05 2.02 PlyA - 26453 26448 6 1.05 2.01 Sngl - 30501 29407 1095 0 0 88 48 838 0.981 76.80 2.00 Prom - 31464 31425 40 -6.86 3.00 Prom + 33530 33569 40 -8.76 3.01 Init + 36072 36429 358 2 1 99 50 544 0.342 48.97 3.02 Term + 69661 69728 68 2 2 70 54 100 0.262 2.90 3.03 PlyA + 70170 70175 6 1.05 4.00 Prom + 75333 75372 40 -3.26 4.01 Init + 79567 79735 169 0 1 73 81 138 0.398 11.30 4.02 Intr + 83524 83660 137 2 2 38 51 94 0.094 0.99 4.03 Intr + 98532 98693 162 2 0 132 64 9 0.537 3.07 4.04 Intr + 99086 99310 225 1 0 87 34 120 0.465 4.68 4.05 Intr + 103715 103803 89 1 2 105 108 96 0.838 12.07 4.06 Intr + 105545 105729 185 2 2 50 52 221 0.114 14.13 4.07 Intr + 115899 115962 64 1 1 87 87 52 0.005 2.78 4.08 Term + 117360 117444 85 0 1 96 49 52 0.008 -0.77 4.09 PlyA + 120412 120417 6 1.05 5.00 Prom + 123584 123623 40 -2.36 5.01 Init + 138285 138375 91 2 1 51 93 80 0.459 5.45 5.02 Intr + 145889 146053 165 0 0 115 90 150 0.990 17.83 5.03 Intr + 148363 148424 62 2 2 56 110 53 0.971 2.75 5.04 Intr + 148961 149083 123 1 0 116 76 148 0.689 17.18 5.05 Intr + 149618 149686 69 1 0 80 53 80 0.872 3.08 5.06 Intr + 156526 156646 121 0 1 123 96 247 0.999 29.07 5.07 Intr + 157610 157678 69 0 0 116 86 43 0.957 6.05 5.08 Intr + 165185 165341 157 0 1 91 105 137 0.999 14.77 5.09 Intr + 165696 165837 142 0 1 80 79 223 0.999 20.96 5.10 Intr + 169376 169546 171 1 0 119 57 317 0.983 31.84 5.11 Intr + 174003 174182 180 2 0 47 96 317 0.989 28.46 5.12 Intr + 174978 175143 166 2 1 89 58 115 0.997 8.13 5.13 Term + 177361 177452 92 2 2 160 43 134 0.983 13.98 5.14 PlyA + 179623 179628 6 1.05 6.11 PlyA - 179856 179851 6 1.05 6.10 Term - 180112 180049 64 0 1 84 48 34 0.118 -3.64 6.09 Intr - 181288 181236 53 1 2 81 110 34 0.159 2.61 6.08 Intr - 184450 184275 176 2 2 71 72 63 0.050 2.66 6.07 Intr - 205446 205326 121 0 1 36 4 156 0.060 2.07 6.06 Intr - 216487 216387 101 2 2 92 101 7 0.411 2.23 6.05 Intr - 216988 216869 120 2 0 26 87 87 0.438 2.87 6.04 Intr - 218116 218036 81 2 0 94 49 47 0.047 1.01 6.03 Intr - 223490 223457 34 2 1 86 82 16 0.029 -1.40 6.02 Intr - 238697 238603 95 0 2 67 93 42 0.043 2.28 6.01 Init - 255593 255539 55 2 1 91 20 92 0.282 4.15 6.00 Prom - 259977 259938 40 -0.56 7.00 Prom + 260349 260388 40 -4.86 7.01 Init + 267160 267180 21 0 0 82 76 25 0.362 0.42 7.02 Intr + 270593 270688 96 1 0 57 109 130 0.494 12.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 105545 105764 220 2 1 50 49 218 0.885 10.61 S.002 Intr - 110280 110099 182 0 2 44 110 107 0.939 7.17 S.003 Intr - 118703 118667 37 1 1 98 98 2 0.832 0.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:26478255_26755703|GENSCAN_predicted_peptide_1|129_aa MEMDVELALEANTATLVALAQTEQPMRSGGMAGVAIVSLSTEAAAFVSPVLPCEDVPASS LPFTMIVSFLRSPQPCFLYNLQNCEPIKSLFFINYPVSVGGGHFGKKPQLLLLLLKAVQT INGQLQLLS >gi568815590f:26478255_26755703|GENSCAN_predicted_CDS_1|390_bp atggaaatggatgtggagctggcactggaagcaaacacagcaactctagtggctctggcc cagactgaacagccgatgagaagtggggggatggctggagtggccatcgtgtccctgtcc acggaagcagcagcatttgtgtctcctgtcctgccatgtgaagatgtgcctgcttcctct ttgcctttcaccatgattgtaagtttcctgaggtctccccagccatgcttcctgtacaac ctgcagaactgcgagccaattaaatctcttttctttataaattacccagtctcagtggga ggagggcactttgggaaaaagcctcagctgctgcttctgctgcttaaagcagtccagacg attaatgggcagctccagctgctatcctga >gi568815590f:26478255_26755703|GENSCAN_predicted_peptide_2|364_aa MALALLEDWCRIMSVDEQKSLMVTGIPADFEEAEIQEVLQETLKSLGRYRLLGKIFRKQE NANAVLLELLEDTDVSAIPSEVQGKGGVWKVIFKTPNQDTEFLERLNLFLEKEGQTVSGM FRALGQEGVSPATVPCISPELLAHLLGQAMAHAPQPLLPMRYRKLRVFSGSAVPAPEEES FEVWLEQATEIVKEWPVTEAEKKRWLAESLRGPALDLMHIVQADNPSISVEECLEAFKQV FGSLESRRTAQVRYLKTYQEEGEKVSAYVLRLETLLRRAVEKRAIPRRIADQVRLEQVMA GATLNQMLWCRLRELKDQGPPPSFLELMKVIREEEEEEASFENESIEEPEERDGYGRWNH EGDD >gi568815590f:26478255_26755703|GENSCAN_predicted_CDS_2|1095_bp atggcgctggcactgttagaggactggtgcaggataatgagtgtggatgagcagaagtca ctgatggttacggggataccggcggactttgaggaggctgagattcaggaggtccttcag gagactttaaagtctctgggcaggtatagactgcttggcaagatattccggaagcaggag aatgccaatgctgtcttactagagcttctggaagatactgatgtctcggccattcccagt gaggtccagggaaaggggggtgtctggaaggtgatctttaagacccctaatcaggacact gagtttcttgaaagattgaacctgtttctagaaaaagaggggcagacggtctcgggtatg tttcgagccctggggcaggagggcgtgtctccagccacagtgccctgcatctcaccagaa ttactggcccatttgttgggacaggcaatggcacatgcgcctcagcccctgctacccatg agataccggaaactgcgagtattctcagggagtgctgtcccagccccagaggaagagtcc tttgaggtctggttggaacaggccacggagatagtcaaagagtggccagtaacagaggca gaaaagaaaaggtggctggcggaaagcctgcggggccctgccctggacctcatgcacata gtgcaggcagacaacccgtccatcagtgtagaagagtgtttggaggcctttaagcaagtg tttgggagcctagagagccgcaggacagcccaggtgaggtatctgaagacctatcaggag gaaggagagaaggtctcagcctatgtgttacggctagaaaccctgctccggagagcggtg gagaaacgcgccatccctcggcgtattgcggaccaggtccgcctggagcaggtcatggct ggggccactcttaaccagatgctgtggtgccggcttagggagctgaaggatcagggcccg ccccccagcttccttgagctaatgaaggtaatacgggaagaagaggaggaagaggcctcc tttgagaatgagagtatcgaagagccagaggaacgagatggctatggccgctggaatcat gagggagacgactga >gi568815590f:26478255_26755703|GENSCAN_predicted_peptide_3|141_aa MAERKQSGKAAEDEEVPAFFKNLGSGSPKPRQKFCGMFCPVEGSSENKTIDFDSLSVGRG SGQVVAQQRDVAHLGPDPQPPYSRQGRRAGGEPSVESGRKVEIRRASGKEALQNINDQVD PLSKDDELETDKQEEKEVSLA >gi568815590f:26478255_26755703|GENSCAN_predicted_CDS_3|426_bp atggccgagagaaagcaatccgggaaggcggcagaggacgaagaggtccctgcttttttt aaaaacctgggctccggcagccccaagccccggcagaaattctgtggcatgttctgcccg gtggaagggtcctcggagaacaagaccatcgacttcgactcgctgtcggtgggccggggc tcggggcaggtggtggctcagcagcgggacgtcgcccacttgggcccggacccgcagccg ccgtactcgcggcagggccggcgcgccggcggagagccatctgttgaatcgggccggaag gtggagatccggagggcctcgggcaaagaagccctgcagaacatcaacgaccaggtcgac cctctatccaaggatgatgagttggaaacagataagcaggaggagaaagaagtctccctg gcatga >gi568815590f:26478255_26755703|GENSCAN_predicted_peptide_4|371_aa MRYHYTPVRMANIHPGFSCVVASHFILTSNGEFLSLYILTPNACEDVEQQELSFIAVHSR VRAPMKIYWLADLTGGGAQAVMLAHLLLTSYCEAQFLIGHGRLISSSPPHGAVWVPSTGT VNPQDPGALGSQQLPPTALQAPAKCAAPPRTPQGAESSAAPGRGPGEFSASNGGGGGPGS PPGLRGAGRRCSRALAASEAIGSAQSPHAGLPRRPPRIGAQAGRDRGARRRSDRLLIKGG KIVNDDQSFYADIYMEDGLIKQIGENLIVPGGVKTIEAHSRMVIPGGIDVHTRFQMPDQG MTSADDFFQGTKAALAGGTTMINGYFHMCKVETINSYVLRLWKNLLSFQAGLQGRAAGSI GDKFFPSGCFG >gi568815590f:26478255_26755703|GENSCAN_predicted_CDS_4|1116_bp atgagataccactacacacctgttaggatggccaacattcatcctggattttcatgtgta gtggcatctcatttcatcctcaccagcaatggagagttcctgtcgctctacatcctaaca ccaaatgcttgtgaggatgtggagcaacaggaactctcattcattgctgttcacagtagg gttcgagctcctatgaagatctactggctggctgatctgacaggaggtggagctcaggcg gtaatgctggctcacttgctgctcacctcctactgtgaggcccagttcctaataggccat ggacggctcatctcctcctctccgccccacggtgcggtgtgggtcccatccactggcact gtaaaccctcaggaccccggggcgctgggatcgcagcagctgcccccgacagcgctgcag gcaccagcgaaatgcgctgcgcccccacgcacccctcagggagccgagtcctcagccgcg ccagggcgggggccaggcgagttcagcgccagcaacggcgggggaggagggcccgggagc cctcccgggctgcgcggcgccggccggcggtgcagtcgcgcgctcgccgccagcgaagcc attggctcggcgcagtcaccccacgcggggctgccgcggcggcctccgcggattggcgcg caggcgggcagggaccggggcgcgcgcaggcgaagcgatcgtcttctgatcaaaggaggt aaaattgttaatgatgaccagtcgttctatgcagacatatacatggaagatgggttgatc aagcaaataggagaaaatctgattgtgccaggaggagtgaagaccatcgaggcccactcc cggatggtgatccccggaggaattgacgtccacactcgtttccagatgcctgatcaggga atgacgtctgctgatgatttcttccaaggaaccaaggcggccctggctgggggaaccact atgatcaatggctatttccacatgtgtaaagtggagacaatcaattcttatgtcttacga ctatggaaaaatttgctgagcttccaggcagggctgcagggtcgtgctgctggaagtatt ggtgacaaattcttccccagtgggtgtttcggctaa >gi568815590f:26478255_26755703|GENSCAN_predicted_peptide_5|535_aa MVLENEAVTIWNKILEMGSADRVNIQGAGEVDHVVPEPGTSLLAAFDQWREWADSKSCCD YSLHVDISEWHKGIQEEMEALVKDHGVNSFLVYMAFKDRFQLTDCQIYEVLSVIRDIGAI AQVHAENGDIIAEVQGFLFRHFFITWREQQRILDLGITGPEGHVLSRPEEVEAEAVNRAI TIANQTNCPLYITKVMSKSSAEVIAQARKKGREPFDLYGLVSTSLHHAIHMDQGTVVYGE PITASLGTDGSHYWSKNWAKAAAFVTSPPLSPDPTTPDFLNSLLSCGDLQVTGSAHCTFN TAQKAVGKDNFTLIPEGTNGTEERMSVIWDKAVVTGKMDENQFVAVTSTNAAKVFNLYPR KGRIAVGSDADLVIWDPDSVKTISAKTHNSSLEYNIFEGMECRGSPLVVISQGKIVLEDG TLHVTEGSGRYIPRKPFPDFVYKRIKARSRLAELRGVPRGLYDGPVCEVSVTPKTVTPAS SAKTSPAKQQAPPVRNLHQSGFSLSGAQIDDNIPRRTTQRIVAPPGGRANITSLG >gi568815590f:26478255_26755703|GENSCAN_predicted_CDS_5|1608_bp atggtgctggagaatgaggccgtgactatttggaataagatcctggagatgggcagtgct gatagagtaaatatccagggagcaggggaagttgaccacgttgttcctgagcctgggaca agcctgctcgctgcctttgaccagtggagggaatgggccgacagcaagtcctgctgtgac tactctctgcatgtggacatcagtgagtggcataagggcatccaggaggagatggaagcg cttgtgaaggatcacggggtaaattccttcctcgtgtacatggctttcaaagatcgcttc cagctaacggattgccagatttatgaagtactgagtgtgatccgggatattggcgccata gcccaagtccacgcagaaaatggcgacatcattgcagaggtacagggctttctttttcgt catttcttcatcacctggagggagcagcagaggatcctggatctgggcatcacgggcccc gagggacatgtgctgagccgacctgaggaggtcgaggccgaagccgtgaatcgtgccatc accatcgccaaccagaccaactgcccgctgtatatcaccaaggtgatgagcaaaagctct gctgaggtcatcgcccaggcacggaagaagggaagggagccatttgacctctatggcttg gtttctacttccctgcatcatgcaatccacatggatcaaggaactgtggtgtatggcgag cccatcactgccagcttgggaacggacggctcccattactggagcaagaactgggccaag gctgctgcctttgtcacctccccacccttgagccctgatccaaccactccagactttctc aactccttgctgtcctgtggagacctccaggtcacgggcagtgcccattgcacgtttaac actgcccagaaggctgtaggaaaggacaacttcaccctgattccggagggcaccaatggc actgaggagcggatgtccgtcatctgggacaaggctgtggtcactgggaagatggatgag aaccagtttgtggctgtgaccagcaccaatgcagccaaagtcttcaacctttacccccgg aaaggccgcattgctgtgggatccgatgccgacctggtcatctgggaccccgacagcgtt aaaaccatctctgccaagacacacaacagctctctcgagtacaacatctttgaaggcatg gagtgccgcggctccccactggtggtcatcagccaggggaagattgtcctggaggacggc accctgcatgtcaccgaaggctctggacgctacattccccggaagcccttccctgatttt gtttacaagcgtatcaaggcaaggagcaggctggctgagctgagaggggttcctcgtggc ctgtatgacggacctgtgtgtgaagtgtctgtgacgcccaagacagtcactccagcctcc tcggccaagacgtctcctgccaagcagcaggccccacctgtccggaacctgcaccagtct ggattcagtttgtctggtgctcagattgatgacaacattccccgccgcaccacccagcgt atcgtggcgccccccggtggccgtgccaacatcaccagcctgggctag >gi568815590f:26478255_26755703|GENSCAN_predicted_peptide_6|299_aa MGVGYIKKEEAAKYDNVEVGFHKPHDEIFAIDLFNKEKISNWTQCLLRFKLAGHVLSHQD PAAAPDANTFLEQISTTHCIWCVAPNVPALRNMGTGLLLIVAQSGVPQKTEGGTSISTSV VLLEEVAKVFSKLHEKNKVKKKVTLYYKVGFVLDGFAQLQANSSKCEVTAFLQVYPTARK RQRPSRTGHDDDGGFVKKKRGKLGRHGFKETEGTRERVEVPAMGSWWGAEDRLGGGLRPE NRTDKRLVVQDVKSGRSGREWTRFDWFTVLCHQHAVKGRLHLCLELESRGLDVAACDME >gi568815590f:26478255_26755703|GENSCAN_predicted_CDS_6|900_bp atgggggtgggctacattaagaaggaagaggcggcaaaatacgacaatgtggaggtagga tttcataagccccatgatgaaatatttgctattgacttattcaataaagagaagatatcc aactggactcaatgtttactaagatttaagttagcaggccacgtcctctcccaccaagat ccagctgctgctccagatgccaacactttcctggagcaaatcagcacaactcattgcatc tggtgtgtagcacccaatgtgccagctctgcgcaatatgggaactggactgctgcttatt gtagcccaatcaggagtgccccaaaagacagaaggaggaacttcaatcagtacatctgtt gtcctcttagaagaggtagccaaagtattcagtaagttacatgaaaaaaataaagttaaa aagaaagttacactttattataaagtaggctttgtgttagatggttttgcccagctgcag gctaattcttccaagtgtgaagtgacagcctttctgcaggtgtacccaacagctcggaag agacagcgaccatcgagaacgggccatgatgatgatggcggttttgtcaaaaagaaaagg gggaaattgggaagacatgggttcaaggaaactgagggaactcgcgagagagttgaagta cctgccatggggtcctggtggggtgcggaggacaggctaggtggaggtttacgaccagaa aaccgaacggacaagagactggtggtgcaggatgtcaaaagtgggcgcagtgggagggag tggactcgatttgactggttcacagtgctctgccatcagcatgcagtgaaaggcaggctg catctttgcttagaattagaatcccgtggcctggacgtagcagcctgtgacatggagtga >gi568815590f:26478255_26755703|GENSCAN_predicted_peptide_7|39_aa MRFLNVKLAFGQQDDLANVLAPVTYGLNQRGPLKLSPQQ >gi568815590f:26478255_26755703|GENSCAN_predicted_CDS_7|117_bp atgaggttcctgaatgtcaagcttgcatttggccaacaggatgacctggccaatgtcttg gcaccagtcacctacggactcaaccagagaggacctctgaagctgtcccctcagcag