GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:17:42 Sequence gi568815581r:38081977_38291029 : 209053 bp : 44.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 19427 19504 78 0 0 88 110 55 0.971 7.22 1.02 Intr + 29425 29546 122 2 2 80 64 53 0.762 2.31 1.03 Intr + 35564 35764 201 1 0 92 84 171 0.980 16.58 1.04 Intr + 40824 40988 165 2 0 110 55 43 0.878 3.36 1.05 Intr + 42063 42167 105 2 0 73 93 122 0.984 11.61 1.06 Intr + 47214 47488 275 2 2 88 58 43 0.065 -2.36 1.07 Intr + 47519 47591 73 2 1 90 82 60 0.932 5.01 1.08 Intr + 47754 47839 86 2 2 81 79 118 0.944 8.82 1.09 Intr + 49123 49162 40 1 1 80 116 -10 0.946 -0.77 1.10 Intr + 49645 49725 81 0 0 52 92 42 0.606 0.83 1.11 Intr + 50169 50276 108 2 0 102 101 154 0.993 18.58 1.12 Intr + 50606 50715 110 1 2 102 82 163 0.997 16.18 1.13 Intr + 51971 52019 49 2 1 97 117 85 0.987 10.98 1.14 Intr + 52563 52683 121 2 1 92 44 146 0.646 10.67 1.15 Intr + 52999 53051 53 2 2 100 38 77 0.409 2.53 1.16 Intr + 54405 54504 100 2 1 91 89 93 0.541 9.38 1.17 Intr + 54875 55027 153 0 0 85 80 88 0.496 7.74 1.18 Intr + 56009 56221 213 0 0 101 49 81 0.416 4.19 1.19 Term + 57937 58094 158 2 2 95 39 134 0.498 7.30 1.20 PlyA + 61386 61391 6 1.05 2.00 Prom + 79360 79399 40 -4.06 2.01 Init + 85215 85217 3 2 0 113 81 0 0.651 1.80 2.02 Term + 95617 95748 132 0 0 105 43 126 0.594 7.89 2.03 PlyA + 96687 96692 6 1.05 3.24 PlyA - 96963 96958 6 1.05 3.23 Term - 98638 98481 158 2 2 95 39 134 0.161 7.30 3.22 Intr - 100566 100354 213 1 0 101 49 81 0.134 4.19 3.21 Intr - 101700 101548 153 1 0 85 80 88 0.181 7.74 3.20 Intr - 102170 102071 100 2 1 91 89 78 0.197 7.88 3.19 Intr - 103576 103524 53 2 2 100 38 82 0.158 3.03 3.18 Intr - 104012 103892 121 2 1 92 110 146 0.994 17.27 3.17 Intr - 104604 104556 49 2 1 97 117 85 0.987 10.98 3.16 Intr - 105967 105858 110 1 2 102 82 146 0.997 14.48 3.15 Intr - 106404 106297 108 0 0 102 101 153 0.993 18.48 3.14 Intr - 106928 106848 81 2 0 52 92 42 0.607 0.83 3.13 Intr - 107450 107411 40 1 1 80 116 -10 0.947 -0.77 3.12 Intr - 108819 108734 86 0 2 81 79 118 0.945 8.82 3.11 Intr - 109054 108982 73 0 1 90 82 60 0.935 5.01 3.10 Intr - 109359 109085 275 0 2 88 58 49 0.097 -1.76 3.09 Intr - 114495 114391 105 0 0 73 93 122 0.984 11.61 3.08 Intr - 115734 115570 165 0 0 110 55 43 0.878 3.36 3.07 Intr - 121005 120805 201 0 0 92 84 171 0.980 16.58 3.06 Intr - 127140 127019 122 1 2 80 64 46 0.750 1.61 3.05 Intr - 137147 137070 78 0 0 88 110 55 0.971 7.22 3.04 Intr - 160309 160225 85 1 1 131 116 54 0.978 11.89 3.03 Intr - 175060 174922 139 0 1 105 92 270 0.134 29.57 3.02 Intr - 204858 204731 128 0 2 89 29 81 0.022 1.78 3.01 Init - 205030 204992 39 1 0 85 105 25 0.782 2.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 18369 18417 49 2 1 83 58 48 0.913 0.41 S.002 Term - 103576 103419 158 2 2 100 48 133 0.806 8.60 S.003 Term - 173684 173595 90 2 0 111 42 47 0.824 0.12 S.004 Term - 204858 204727 132 0 0 89 47 86 0.924 2.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:38081977_38291029|GENSCAN_predicted_peptide_1|763_aa XIHATGFNYQNEDEKVTLSFPSTLQTGTGTLKIDFVGELNDKMKGFYRSKYTTPSGEVRY AAVTQFENVIDRKPYPDDENLVEVKFARTPVTSTYLVAFVVGEYDFVETRSKDGVCVCVY TPVGKAEQGKFALEEWWTHLWLNEGFASWIEYLCVDHCFPEYDIWTQFVSADYTRAQELD ALDNSHPIEVSVGHPSEVDEIFDAISYSKGASVIRMLHDYIGDKVETIQPAGAGVSPFHG IGGQQDSLSRSHCTQVLGKMPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIW EPGESGSGVLRSRRVRMDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNN NVDHLGIVHETELPPLTAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDQAYKGMPM NIRGPMWSVLLNTEEMKLKNPGRYQIMKEKGKKSSEHIQRIDRDVSGTLRKHIFFRDRYG TKQRELLHILLAYEEYNPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAA RAPAAIGAHEWADQAQISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGP WARFCNRFVDTWARDEDTVLKHLRASMKKLTRKKGDLPPPAKPEQGSSASRPVPASRGGK TLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNA IVNARRRKLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:38081977_38291029|GENSCAN_predicted_CDS_1|2292_bp naaatacatgctacaggatttaactatcagaatgaagatgaaaaagtcaccttgtctttc cctagtactctgcaaacaggtacaggaaccttaaagatagattttgttggagagctgaat gacaaaatgaaaggtttctatagaagtaagtatactaccccttctggagaggtgcgctat gctgctgtaacacagtttgagaatgtaattgaccggaaaccataccctgatgatgaaaat ttagtggaagtgaagtttgcccgcacacctgttacatctacatatctggtggcatttgtt gtgggtgaatatgactttgtagaaacaaggtcaaaagatggtgtgtgtgtctgtgtttac actcctgttggcaaagcagaacaaggaaaatttgcattagaggaatggtggactcatctt tggttaaatgaaggttttgcatcctggattgaatatctgtgtgtagaccactgcttccca gagtatgatatttggactcagtttgtttctgctgattacacccgtgcccaggagcttgac gccttagataacagccatcctattgaagtcagtgtgggccatccatctgaggttgatgag atatttgatgctatatcatatagcaaaggtgcatctgtcatccgaatgctgcatgactac attggggataaggtggaaaccattcaacctgctggggccggtgtgtccccatttcatggc attgggggacaacaggattctctgtctaggtcccactgtactcaagtccttgggaagatg cccacccctgcttgggacttgagactccagagactggagcagctgtgggccactgggtct ggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatctgg gagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtggta gaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaaaag ggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaacaac aacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgggag gcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctgggagac tgggagaaatacaaaagcagcagaaagctcatagatcaagcgtacaagggaatgcccatg aacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaaaac cccggaagataccagatcatgaaggagaagggcaagaagtcatctgagcacatccagcgc atcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatacgga accaagcagcgggaactactccacatcctcctggcatatgaggagtacaacccggaggtg ggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgaggag gatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggctgcc cgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctcggg ctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgctgatgccgata acaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggcccg tgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtgctc aagcatcttagggcctctatgaagaaactaacaagaaagaagggggacctgccaccccca gccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcgggaag accctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcggccc atttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggggct gtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaacgcc attgttaatgcacggaggaggaagctgactgttagacctgggttttccagggttgcacgg cttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccaggatgg aatgagctgtga >gi568815581r:38081977_38291029|GENSCAN_predicted_peptide_2|44_aa MAVCEMFHVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581r:38081977_38291029|GENSCAN_predicted_CDS_2|135_bp atggcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatccccaagctc tacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcct ttggacctcagctaa >gi568815581r:38081977_38291029|GENSCAN_predicted_peptide_3|893_aa MSFCPFPILISLESLLGWNVPSVSETEQKFGKGGGTAVSRACVSADTLQLLQGKPGLGLA AMPEKRPFERLPADVSPINCSLCLKPDLLDFTFEGKLEAAAQVRQATNQIVMNCADIDII TASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTLKIDFVGELNDKMKGFYRSK YTTPSGEVPYAAVTQFENVIDRKPYPDDENLVEVKFARTPVTSTYLVAFVVGEYDFVETR SKDGVCVCVYTPVGKAEQGKFALEEWWTHLWLNEGFASWIEYLCVDHCFPEYDIWTQFVS ADYTRAQELDALDNSHPIEVSVGHPSEVDEIFDAISYSKGASVIRMLHDYIGDKVETIQP AGASVSPFHGIGGQQDSLSRSHCTQVLGKMPTPAWDLRLQRLEQLWATGSGPFFPGGGGG MGVTQPASIWEPGESGSGVLRSRRVRMDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDK GPKPFRSYNNNVDHLGIVHETELPPLTAREAKQIRREISRKSKWVDMLGDWEKYKSSRKL IDRAYKGMPMNIRGPMWSVLLNTEEMKLKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLR KHIFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLL ASERHSLQAARAPAAIGAHERADQAQISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQK RLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKKGDLPPPAKPEQGSSAS RPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGT QACRKAGVNAIVNARRRKLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:38081977_38291029|GENSCAN_predicted_CDS_3|2682_bp atgtcgttctgcccttttcccatcctgatttctctggagtcacttctggggtggaatgtg ccatctgtttctgaaactgagcagaagtttgggaaagggggagggacagccgtgtccagg gcttgcgtgtcagcagacacgttacagctgctgcagggcaaaccaggcctgggcctcgcc gcgatgccggagaagaggcccttcgagcggctgcctgccgatgtctcccccatcaactgc agcctttgcctcaagcccgacttgctggacttcaccttcgagggcaagctggaggccgcc gcccaggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattatt acagcttcatatgcaccagaaggagatgaagaaatacatgctacaggatttaactatcag aatgaagatgaaaaagtcaccttgtctttccctagtactctgcaaacaggtacaggaacc ttaaagatagattttgttggagagctgaatgacaaaatgaaaggtttctatagaagtaag tatactaccccttctggagaggtgccctatgctgctgtaacacagtttgagaatgtaatt gaccggaaaccataccctgatgatgaaaatttagtggaagtgaagtttgcccgcacacct gttacatctacatatctggtggcatttgttgtgggtgaatatgactttgtagaaacaagg tcaaaagatggtgtgtgtgtctgtgtttacactcctgttggcaaagcagaacaaggaaaa tttgcattagaggaatggtggactcatctttggttaaatgaaggttttgcatcctggatt gaatatctgtgtgtagaccactgcttcccagagtatgatatttggactcagtttgtttct gctgattacacccgtgcccaggagcttgacgccttagataacagccatcctattgaagtc agtgtgggccatccatctgaggttgatgagatatttgatgctatatcatatagcaaaggt gcatctgtcatccgaatgctgcatgactacattggggataaggtggaaaccattcaacct gctggggccagtgtgtccccatttcatggcattgggggacaacaggattctctgtctagg tcccactgtactcaagtccttgggaagatgcccacccctgcttgggacttgagactccag agactggagcagctgtgggccactgggtctggcccctttttccctgggggcggcggtgga atgggggttacgcagccagccagcatctgggagcccggcgagagcggttcaggtgttctc cgaagccgccgcgtacggatggacgtggtagaggtcgcgggcagttggtgggcacaagag cgagaggacatcattatgaaatacgaaaagggacaccgagctgggctgccagaggacaag gggcctaagccttttcgaagctacaacaacaacgtcgatcatttggggattgtacatgag acggagctgcctcctctgactgcgcgggaggcgaagcaaattcggcgggagatcagccga aagagcaagtgggtggatatgctgggagactgggagaaatacaaaagcagcagaaagctc atagatcgagcgtacaagggaatgcccatgaacatccggggcccgatgtggtcagtcctc ctgaacactgaggaaatgaagttgaaaaaccccggaagataccagatcatgaaggagaag ggcaagaggtcatctgagcacatccagcgcatcgaccgggacgtaagcgggacattaagg aagcatatattcttcagggatcgatacggaaccaagcagcgggaactactccacatcctc ctggcatatgaggagtacaacccggaggtgggctactgcagggacctgagccacatcgcc gccttgttcctcctctatcttcctgaggaggatgcattctgggcactggtgcagctgctg gccagtgagaggcactccctgcaggctgcccgggctcctgctgccatcggtgcccacgaa cgggccgaccaagcccagatctctctcgggctcaccctgcgcctgtgggacgtgtatctg gtagaaggcgaacaggcgttgatgccgataacaagaatcgcctttaaggttcagcagaag cgcctcacgaagacgtccaggtgtggcccgtgggcacgtttttgcaaccggttcgttgat acctgggccagggatgaggacactgtgctcaagcatcttagggcctctatgaagaaacta acaagaaagaagggggacctgccacccccagccaaacccgagcaagggtcgtcggcatcc aggcctgtgccggcttcacgtggcgggaagaccctctgcaagggggacaggcaggcccct ccaggcccaccagcccggttcccgcggcccatttggtcagcttccccgccacgggcacct cgttcttccacaccctgtcctggtggggctgtccgggaagacacctaccctgtgggcact caggcgtgccgcaaagcaggcgtcaacgccattgttaatgcacggaggaggaagctgact gttagacctgggttttccagggttgcacggcttctgggagacggatgtgaccctgaggac agggcacaggccagtgtaatgccaggatggaatgagctgtga