GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:14:44 Sequence gi568815575f:7743204_7943852 : 200649 bp : 38.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2406 2551 146 0 2 -13 43 252 0.352 9.61 1.02 Intr + 3239 3429 191 0 2 65 61 87 0.683 2.08 1.03 Intr + 10001 10100 100 1 1 97 38 91 0.683 3.76 1.04 Term + 15954 16000 47 1 2 69 38 65 0.165 -4.01 1.05 PlyA + 16375 16380 6 1.05 2.00 Prom + 17452 17491 40 -3.05 2.01 Init + 19841 20058 218 1 2 68 83 111 0.318 6.81 2.02 Intr + 40474 40591 118 1 1 81 58 36 0.007 -0.55 2.03 Intr + 45925 46083 159 0 0 46 91 74 0.152 2.76 2.04 Intr + 55955 56018 64 1 1 78 95 75 0.956 4.57 2.05 Term + 58713 58936 224 1 2 8 50 190 0.959 3.30 2.06 PlyA + 59154 59159 6 1.05 3.00 Prom + 62209 62248 40 -3.15 3.01 Sngl + 75263 75622 360 1 0 53 50 178 0.672 6.32 3.02 PlyA + 75881 75886 6 1.05 4.00 Prom + 83149 83188 40 -3.25 4.01 Init + 91416 91472 57 2 0 58 114 63 0.534 7.26 4.02 Term + 92927 93040 114 1 0 7 33 208 0.975 4.79 4.03 PlyA + 93687 93692 6 1.05 5.00 Prom + 94891 94930 40 -6.05 5.01 Sngl + 100001 100813 813 1 0 66 42 749 0.984 63.62 5.02 PlyA + 100915 100920 6 1.05 6.10 PlyA - 101749 101744 6 1.05 6.09 Term - 107303 107152 152 0 2 76 48 104 0.921 2.29 6.08 Intr - 110329 110104 226 1 1 82 116 82 0.880 7.04 6.07 Intr - 116464 116408 57 1 0 107 103 39 0.223 5.46 6.06 Intr - 134854 134814 41 2 2 86 115 3 0.135 -0.18 6.05 Intr - 136106 135916 191 1 2 63 77 46 0.017 -0.69 6.04 Intr - 147667 147604 64 2 1 110 89 33 0.188 2.46 6.03 Intr - 148939 148815 125 0 2 -9 69 187 0.289 6.41 6.02 Intr - 152570 152512 59 2 2 99 79 83 0.455 5.26 6.01 Init - 153872 153849 24 2 0 77 94 74 0.347 4.65 6.00 Prom - 156812 156773 40 -4.45 7.10 PlyA - 157231 157226 6 -0.45 7.09 Term - 157614 157483 132 0 0 80 38 93 0.897 0.61 7.08 Intr - 158938 158786 153 1 0 109 101 179 0.971 20.65 7.07 Intr - 168890 168825 66 2 0 102 115 63 0.900 8.58 7.06 Intr - 178645 178510 136 0 1 84 87 63 0.973 5.45 7.05 Intr - 178895 178801 95 2 2 96 69 130 0.997 9.74 7.04 Intr - 182888 182737 152 0 2 39 76 92 0.268 2.06 7.03 Intr - 183917 183780 138 0 0 101 0 104 0.226 2.41 7.02 Intr - 196099 195882 218 0 2 96 -10 121 0.098 0.32 7.01 Init - 198923 198877 47 2 2 73 99 48 0.348 4.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 97466 97591 126 1 0 92 44 104 0.855 3.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:7743204_7943852|GENSCAN_predicted_peptide_1|161_aa XKKKEEEEDKGMENDKKEDKEKEGKEKEEEEEREERKGDGEENELLKLCGRAFQKLYGPD IEKWLSRERMLQLAFEHDHIFLQRCRATLKAIDISTLNHSEKRFLLNHELRLRPKKLFPS VTSNCLMTKFNDLASILNMYNGITATVVTCKAPYTQVAVNE >gi568815575f:7743204_7943852|GENSCAN_predicted_CDS_1|486_bp nngaagaagaaggaggaggaggaagataaggggatggagaacgacaagaaggaggacaag gagaaagaagggaaggagaaggaggaggaggaggagagagaagagaggaagggagatggg gaggagaatgaactactaaagctatgtggcagagcgtttcaaaagctttatggtccagat attgagaaatggctatcaagagagagaatgttgcaattggcttttgagcatgaccacata tttttacaaagatgtagagctaccctgaaggcaatagatatatccactctcaatcattct gagaagagattcctcctgaaccatgaattaagactcaggcccaagaaactcttcccaagt gtcaccagcaactgcctgatgactaaattcaatgatctcgcctcaattctcaacatgtat aatggaataacagcgacagttgtgacttgcaaagccccttacacgcaggtggcagtcaat gaataa >gi568815575f:7743204_7943852|GENSCAN_predicted_peptide_2|260_aa MREKKTDAEWRPFRYNNCRQADGRCSCPLREDGHRHSDLREGNFKNMQAQISLIHIGLGN ISVAWWLSISAVRHCQIYKNILEMNDSRKTWDIEFAGFIFLRIVPSFGNYIMPQILHSNT KWTKTDGLMRWKRSSPLGFEITGQPLRRKTDVSVVPKTMPTFNDLIPELCRVYRSHEYVA IAMNNTAKVEKHWLRLSTKLGYRSAVQVCLGLVGGLSGGKNWEEEINRDEVMGEGDPESE EASMREKLKPCMDFREQKGF >gi568815575f:7743204_7943852|GENSCAN_predicted_CDS_2|783_bp atgagggaaaagaagacagacgcagaatggagaccattcagatacaacaattgcagacag gcagacggacgatgctcatgtccactaagagaagatggtcatagacacagcgacctccga gaaggaaactttaagaatatgcaggcacaaattagcttaattcacatagggttagggaat atttctgtagcctggtggttaagcatttcagctgtgaggcactgtcaaatctacaaaaac atcctggaaatgaatgattctaggaaaacatgggatatcgaatttgcaggtttcattttc ttaagaattgtgccttcattcgggaattacataatgcctcagatacttcatagcaacaca aaatggactaagacagatggcttgatgaggtggaaaagatcttccccactaggctttgag ataacagggcagcctcttagaaggaagacggatgtgtcagtggtccccaagaccatgccc acattcaatgatttgatacctgaattatgtcgtgtctacagatcccatgagtatgtagcc atagcaatgaacaacacagccaaggttgagaagcactggcttaggctgagcacaaaattg ggttatcggagtgctgtgcaggtgtgcttgggacttgtgggtggactctccggaggcaag aattgggaggaagaaattaatagagatgaggtaatgggtgaaggagatccagagtctgag gaagcatctatgagggaaaagttaaaaccctgcatggactttagagaacaaaaagggttt tga >gi568815575f:7743204_7943852|GENSCAN_predicted_peptide_3|119_aa MIQAECEQRGTRDVCNWRVLAHCKKEEKEGGEGEEEEKEEKKKKEEEKRKGGREEEGEGG KRRRGGKRREGVGGREEEEGEGGKRKSEEEREKGKEKEKEKEKKKEKEKGKEGKEKGKE >gi568815575f:7743204_7943852|GENSCAN_predicted_CDS_3|360_bp atgattcaggctgagtgtgaacagcggggaactagggatgtctgcaattggagagtgctt gcccactgcaagaaggaggagaaggagggcggggagggggaagaggaggaaaaggaggag aagaagaagaaggaggaggagaaaaggaagggagggagggaagaggagggggagggaggg aagaggaggaggggagggaagaggagagaaggagtaggagggagggaagaggaggaaggg gagggaggaaagaggaagagtgaggaggagagggagaagggaaaagagaaggaaaaggag aaggagaaaaagaaggaaaaggagaaggggaaagaaggaaaggagaaggggaaggagtag >gi568815575f:7743204_7943852|GENSCAN_predicted_peptide_4|56_aa MTLDEMSLYEEKERVTGQRKLLSDDGDEDDDDDDDDDNVLIMFCVQRQGKLLGDCH >gi568815575f:7743204_7943852|GENSCAN_predicted_CDS_4|171_bp atgacattggatgagatgagcctctatgaagaaaaggagagagtaacaggccaacggaaa ctcttatctgatgatggtgatgaagatgatgatgatgatgatgatgatgataatgtacta attatgttctgcgttcagagacaggggaagcttttgggtgactgtcattaa >gi568815575f:7743204_7943852|GENSCAN_predicted_peptide_5|270_aa MSPKPRASGPPAKATEAGKRKSSSQPSPSDPKKKVSDPPKLLLVFPSPPSSQEASPVLTW HNPPTRPPPLLRTRPCSQPPPSSSLNRSPSVISLLSFQTTKVAKKGKAVRRGRRGKKGAA TKMAAVTAPEAESAPAAPGPSDQPSQELPQHELPPEEPVSEGTQHDPLSQEAELEEPLSQ ESEVEEPLSQESQVEEPLSQESEVEEPLSQESQVEEPLSQESEVEEPLSQESQVEEPLSQ ESEMEEPLSQESQVEEPPSQESEMEELPSV >gi568815575f:7743204_7943852|GENSCAN_predicted_CDS_5|813_bp atgagtccaaagccgagagcctcgggacctccggccaaggccacggaggcaggaaagagg aagtcctcctctcagccgagccccagtgacccgaagaagaaggtgagtgaccctcccaag ctcctcctcgtcttcccctcgcctccttcctcacaagaagcctctcctgtcctcacttgg cacaaccccccaacccggcccccaccgcttctgaggacacgtccctgttcccagcctcct ccatcctcgtccctaaaccggagcccttctgtgatctccctgttgtccttccagactacc aaggtggccaagaagggaaaagcagttcgtagagggagacgcgggaagaaaggggctgcg acaaagatggcggccgtgacggcacctgaggcggagagcgcgccagcggcacccggcccc agcgaccagcccagccaggagctccctcagcacgagctgccgccggaggagccagtgagc gaggggacccagcacgaccccctgagtcaggaggccgagctggaggaaccactgagtcag gagagcgaggtggaagaaccactgagtcaggagagccaggtggaggaaccactgagtcag gagagcgaggtggaagaaccactgagtcaggagagccaggtggaggaaccactgagtcag gagagcgaggtggaggaaccactgagtcaggagagccaggtggaggaaccactgagtcag gagagcgagatggaagaaccactgagtcaggagagccaggtggaggaaccaccgagtcag gagagcgagatggaagaactaccgagtgtgtag >gi568815575f:7743204_7943852|GENSCAN_predicted_peptide_6|312_aa MAKLMALQPSQCEDEDKNPYDPLSLSKQSKEHHDDIPPEQGPEPPYRGNILSTSRQAASH TAQTQVPPALIPSVYCPHLYVNEYPVFSSHFSNLSTVEWAALTLFAEDTRKKMINNNKSL RQCGLEKMDLLLCILNVGIVDISLKIIQLGALEKGPSHDMRLWELQFKVNSSIFTREVSE GKQWTLHVTISNECQWHLNQLSDHTEPSEQHLPGTHAFAEVQSRSSKFPTNTLPWSKNAL FSKVVFIRVHLCKSTVLRVGAKATLAADWMVPTQTEGWSASPSALTQMLISFVNTLTDTP GNNTLHPSTQPS >gi568815575f:7743204_7943852|GENSCAN_predicted_CDS_6|939_bp atggccaagctcatggccctgcagcccagtcaatgtgaagacgaggacaaaaacccttat gatccactttcgctaagtaaacaatccaaggaacaccatgatgacatcccgccagaacaa ggaccagaaccaccttatcgtgggaacatcttatcaacatcccgccaggcagcaagccat actgcccaaacccaagtacccccagccttaatccccagtgtctactgtccccatctttat gtcaatgagtacccagtatttagctcccacttctccaacctcagcactgtggagtgggct gctttgaccctatttgcagaagatacgaggaagaaaatgattaacaacaacaagtccttg agacagtgtggcctagagaagatggatttgcttttatgtatattaaatgtgggaattgta gacatttcattgaaaataatccaattgggagctttggagaaaggtccctcccatgacatg cggttgtgggagctacaattcaaggtaaatagctcaatattcaccagggaagtcagtgaa ggaaagcagtggaccctgcatgtgaccatctcaaatgaatgccaatggcacctcaaccag ctgtcagaccacacagagccgtcagaacagcatctgcctggcactcatgcctttgctgaa gtacaatcaagatcatctaagtttcctactaacacactaccttggagtaagaatgctctg ttttcaaaagtcgtattcattcgtgttcatttgtgtaaaagcacagttcttcgggtgggt gccaaagccacgctggcagctgactggatggtgcccacccagactgagggttggtctgcc tctcccagtgcactgactcaaatgttaatctcctttgtcaacaccctcacagacacacca gggaacaatactttacatccttcaacccaaccaagttga >gi568815575f:7743204_7943852|GENSCAN_predicted_peptide_7|378_aa MTQISNIKQPFAMQHRFFISLLHWTLQIVYPALFLGLCERGRNGRCDQVPEGSVLSLLVW VVPGLAFPEVAPGTCGTSYVPDAGLHGPEVESKLGLDPGSMTGTQPTRTPAQPAAAEPPA LTGPGRVHQEQAVLACGFLGIYHLGAASALCRHGKKLVKDVKAFAGASAGSLVASVLLTA PEKIEECNQFTYKFAEEIRRQSFGAVTPGYDFMARLRSGMESILPPSAHELAQNRLHVSI TNAKTRENHLVSTFSSREDLIKVLLASSFVPIYAGLKLVEYKGQKWVDGGLTNALPILPV GRTVTISPFSGRLDISPQDKGQLDLYVNIAKQDIMLSLANLVRLNQALFPPSKRKMESLY QCGFDDTVKFLLKENWFE >gi568815575f:7743204_7943852|GENSCAN_predicted_CDS_7|1137_bp atgactcagataagtaacataaagcagccttttgccatgcagcacaggttcttcatttcc cttttgcactggaccctgcaaattgtgtatccagcactgttcctaggactttgtgaacgg ggaagaaatgggaggtgtgatcaagtaccagaaggctctgtgctgtctctgctggtctgg gtggttcctggactggcgttccctgaggttgcacctggcacgtgtggcaccagctatgtt cctgatgctggactccatggtcctgaagtggaatcaaaactggggcttgaccccggatcg atgactgggactcagcctacgcggacgccagcgcagcctgcggcggccgagccacctgcc ctcacagggcctggccgggtgcatcaggaacaggcggtcttagcgtgtggatttctgggc atttaccacttgggggcagcatctgcactttgcagacatggcaaaaaacttgtgaaggat gtcaaagccttcgctggggcgtctgcgggatcgttggttgcttctgttctgctaacagca ccagaaaaaatagaggaatgtaaccaatttacctacaagtttgccgaagaaatcagaagg cagtctttcggggcagtaacgcccggttatgacttcatggcccgactaagaagtgggatg gagtcgattcttcctcccagcgctcacgagctggcccagaaccgactgcacgtatccatc accaacgccaaaaccagagaaaatcacttagtctccactttttcctccagggaggacctc attaaggtcctcctagccagcagttttgtgcccatttatgcaggactgaagctagtggaa tacaaagggcagaagtgggtggacggaggcctcaccaacgctcttcccatcctgcccgtc ggccggacagtaaccatctcccccttcagtggacgactggacatctccccgcaggacaaa gggcagctagatctgtatgttaatatcgccaagcaggatatcatgttgtccctggcaaac ctggtgagactcaaccaagccctttttcccccaagcaagaggaaaatggaatctttgtat cagtgtggttttgatgacactgttaagtttttacttaaagaaaattggtttgaataa