GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:54:36 Sequence gi568815580f:11751512_11952108 : 200597 bp : 44.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 923 1067 145 1 1 96 82 207 0.985 21.18 1.02 Intr + 1342 1414 73 2 1 98 54 75 0.999 3.56 1.03 Intr + 2117 2171 55 2 1 81 79 103 0.999 7.68 1.04 Intr + 2315 2434 120 1 0 80 110 105 0.966 12.59 1.05 Term + 21391 21438 48 0 0 69 43 104 0.058 1.30 1.06 PlyA + 22114 22119 6 1.05 2.07 PlyA - 24370 24365 6 1.05 2.06 Term - 32879 32629 251 0 2 57 37 170 0.653 4.67 2.05 Intr - 53985 52423 1563 1 0 -37 -36 637 0.089 27.94 2.04 Intr - 55159 55029 131 0 2 72 43 92 0.628 3.44 2.03 Intr - 59678 59564 115 0 1 -7 115 116 0.554 4.31 2.02 Intr - 64605 64583 23 2 2 71 115 -10 0.036 -2.81 2.01 Init - 70443 70334 110 0 2 113 85 47 0.107 4.70 2.00 Prom - 80439 80400 40 -4.26 3.00 Prom + 94555 94594 40 -5.56 3.01 Init + 96391 96492 102 0 0 46 71 84 0.755 2.74 3.02 Term + 99833 100600 768 1 0 110 49 836 0.879 75.31 3.03 PlyA + 101165 101170 6 1.05 4.00 Prom + 104549 104588 40 -6.86 4.01 Init + 104807 104947 141 1 0 46 103 42 0.341 1.67 4.02 Intr + 110834 110938 105 1 0 60 91 62 0.367 4.11 4.03 Intr + 111130 111316 187 0 1 62 64 55 0.233 -0.24 4.04 Intr + 117032 117152 121 0 1 71 61 71 0.010 2.25 4.05 Intr + 120757 120887 131 1 2 116 107 88 0.999 13.84 4.06 Intr + 125110 125177 68 2 2 77 99 82 0.913 6.82 4.07 Term + 129478 129624 147 0 0 63 55 408 0.676 32.90 4.08 PlyA + 130496 130501 6 -0.45 5.07 PlyA - 131993 131988 6 1.05 5.06 Term - 133116 132934 183 0 0 127 49 59 0.810 3.44 5.05 Intr - 134287 134165 123 1 0 44 52 97 0.638 2.48 5.04 Intr - 135110 134988 123 2 0 69 47 79 0.771 2.68 5.03 Intr - 135514 135406 109 0 1 109 75 195 0.868 20.59 5.02 Intr - 137979 137876 104 0 2 113 80 73 0.996 7.97 5.01 Init - 142055 141957 99 2 0 50 75 148 0.698 8.33 5.00 Prom - 147916 147877 40 -3.96 6.00 Prom + 148058 148097 40 -3.56 6.01 Init + 159358 159564 207 0 0 19 55 118 0.198 0.42 6.02 Intr + 166302 166397 96 2 0 94 80 66 0.856 6.61 6.03 Intr + 175850 175931 82 1 1 5 115 68 0.015 0.41 6.04 Intr + 192888 192981 94 1 1 50 77 89 0.345 3.02 6.05 Term + 196442 196607 166 2 1 90 49 61 0.277 -0.21 6.06 PlyA + 197141 197146 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 3706 3744 39 0 0 106 38 20 0.819 -4.01 S.002 Init - 32930 32915 16 2 1 92 116 -1 0.927 3.64 S.003 Init + 117076 117152 77 0 2 83 61 91 0.984 6.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:11751512_11952108|GENSCAN_predicted_peptide_1|146_aa MGCLGGNSKTTEDQGVDEKERREANKKIEKQLQKERLAYKATHRLLLLGAGESGKSTIVK QMRILHVNGFNPEEKKQKILDIRKNVKDAIVTIVSAMSTIIPPVPLANPENQFRSDYIKS IAPITDFEYSQSLTSLSCEDSIAIEW >gi568815580f:11751512_11952108|GENSCAN_predicted_CDS_1|441_bp atggggtgtttgggcggcaacagcaagacgacggaagaccagggcgtcgatgaaaaagaa cgacgcgaggccaacaaaaagatcgagaagcagttgcagaaagagcgcctggcttacaag gctacccaccgcctgctgctcctgggggctggtgagtctgggaaaagcactatcgtcaaa cagatgaggatcctgcacgtcaatgggtttaatcccgaggaaaagaaacagaaaattctg gacatccggaaaaatgttaaagatgctatcgtgacaattgtttcagcaatgagtactata atacctccagttccgctggccaaccctgaaaaccaatttcgatcagactacatcaagagc atagcccctatcactgactttgaatattcccagtcccttacatccctgtcctgcgaagac agcattgccatcgagtggtag >gi568815580f:11751512_11952108|GENSCAN_predicted_peptide_2|730_aa MAWARSGHRATLDAGLASPEGGCALQGGGRPLCFSPWMGIAGFYGGSFGGFGCKEGTQEC GVVAFGKCGKARISMVGQGSNTCLPQQEDVYVNIKVEGQLMGRNAGHTVSFFLRILFPLI LTVPERDDEKLLNGYNVHYLGDGYTKNSDFTTTQYLHVPPKLHLYFTSTEYLHVPSKLHV YLTTTQYLHVPFKLHLYFTTTQYLHVPLKLHLYFTTTQYLHVPLKLHLYFTTTQYLHVPL KLHLYFTTTQYLHVPLKLHLYFTTQYLHVPLKLHLYFTTTQYLHVPFKLHLYFTTTQYLH VPLKLHLYFTTTQYLRVPPKLHLYCTTTQYLRVPPKLHLYFTTTQYLRVPLKLHLYFTTT QYLRVPLKLHLYFTTTQYLRVPLKPLLYFTTTQYLRVPSKPHLYFTTTQYLRVPSKPHLY FTTTQYLRVPSKPHLYFTTTQYLRVPSKPHLYFTTTQYLRVPSKPHLYFTTTQYLRVPSK PHLYFTTTQYLRVPSKPHLYFTTTQYLRVPSKPHLYFTTTQYLRVPSKPHLYFTTTQYLR VPSKPHLYFTTTQYLRVPSKPHLYFTTTQYLRVPSKLHLYFTTTQYLRVPLKLLLYFTTT QYLRVPNFTCTSPLHSISVFHSNCSCTSPLHSISVFHPNCTCTSPLHKNTVGCRNTCGHV VGFIPHAEAMVDDKGDLEAVRLGVRPIRADTSPADLKTETQCSEVSRRATGSCQHWETKD KYAGPHHEEA >gi568815580f:11751512_11952108|GENSCAN_predicted_CDS_2|2193_bp atggcctgggcgcgttctggccaccgggccacactggatgccggcctggcctcgccagag ggcggatgtgcgctgcagggaggcgggaggcccttgtgcttcagcccttggatgggaatt gcagggttttatggaggctcctttggcggctttgggtgcaaggaaggcacccaggaatgt ggcgttgtggcttttgggaaatgtggcaaggcccggatttcaatggtggggcaggggagc aacacatgcctccctcaacaggaggacgtctatgtgaatatcaaagtggaaggacagtta atgggccgcaatgctgggcacacagtttctttcttccttcgaattctttttcctctaata ctcactgttcctgagagggatgatgaaaaattacttaatgggtacaacgtacattatttg ggtgatggatatactaaaaactcagacttcaccactacacaatatctccatgttccaccc aagctgcacctgtactttaccagtacagaatacctccatgttccatccaaattgcacgtg tacctcaccactacacagtatctccatgttccattcaaattgcacctgtacttcaccact acacaatatctccatgttccactcaaactgcacctgtacttcaccactacacagtatctc catgttccactcaaattgcacctgtacttcaccactacacagtatctccatgttccactc aaattgcacctgtacttcaccactacacaatatctccatgttccactcaaattgcacctg tacttcactacacagtatctccatgttccactcaaattgcacctgtacttcaccactaca cagtatctccatgttccattcaaactgcatctgtacttcaccactacacagtatctccat gttccactcaaactgcacctgtacttcaccactacacagtatctccgtgttccacccaaa ctgcacctgtactgcaccactacacagtatctccgtgttccacccaaactgcacctgtac ttcaccactacacagtatctccgtgttccactcaaactgcacctgtacttcaccactaca cagtatctccgtgttccactcaaactgcacctgtacttcaccactacacagtatctccgt gtcccactcaaaccgctcctgtacttcaccactacacagtatctccgtgttccatccaaa ccgcacctgtacttcaccactacacagtatctccgtgttccatccaaaccgcacctgtac ttcaccactacacagtatctccgtgttccatccaaaccgcacctgtacttcaccactaca cagtatctccgtgttccatccaaaccgcacctgtacttcaccactacacagtatctccgt gttccatccaaaccgcacctgtacttcaccactacacagtatctccgtgttccatccaaa ccgcacctgtacttcaccactacacagtatctccgtgttccatccaaaccgcacctgtac ttcaccactacacagtatctccgtgttccatccaaaccgcacctgtacttcaccactaca cagtatctccgtgttccatccaaaccgcacctgtacttcaccactacacagtatctccgt gttccatccaaaccgcacctgtacttcaccactacacagtatctccgtgttccatccaaa ccgcacctgtacttcaccactacacagtatctccgtgttccatccaaactgcacctgtac ttcaccactacacagtatctccgtgttccactcaaactgctcctgtacttcaccactaca cagtatctccgtgttccaaacttcacctgtacttcaccactacacagtatctccgtgttc cactcaaactgctcctgtacttcaccactacacagtatctccgtgttccatccaaactgc acctgtacttcaccactacataagaacacagttggctgccgcaacacctgtggccacgtg gttggtttcattccccatgctgaggccatggtggatgacaagggagacttagaagcagtc agattaggggtaaggcccattagggcggacacctcgcctgctgacctgaagacagaaaca caatgcagtgaagtctccagaagagcaacaggctcttgtcagcactgggagaccaaagac aagtacgctggtccccaccatgaggaagcatag >gi568815580f:11751512_11952108|GENSCAN_predicted_peptide_3|289_aa MEYKEWKKLRARVDADSDTTVVRDNSSKSNSNDKAPPLALGSDVTTCAAHSRNRKWDQNK GAAAGSGLTLPSLPSARFSAGPPTQRSRPTMSNMEKHLFNLKFAAKELSRSAKKCDKEEK AEKAKIKKAIQKGNMEVARIHAENAIRQKNQAVNFLRMSARVDAVAARVQTAVTMGKVTK SMAGVVKSMDATLKTMNLEKISALMDKFEHQFETLDVQTQQMEDTMSSTTTLTTPQNQVD MLLQEMADEAGLDLNMELPQGQTGSVGTSVASAEQDELSQRLARLRDQV >gi568815580f:11751512_11952108|GENSCAN_predicted_CDS_3|870_bp atggaatataaagaatggaaaaaacttagagccagggtagatgcagattcagataccacg gtggtcagagataattcttccaaatccaactcgaatgacaaggctccgcctttggcgctg ggctctgacgtcaccacctgcgccgctcacagtagaaacaggaagtgggaccaaaacaaa ggagcggcggccgggagcggacttaccttaccttctctgccttcggcgcgcttctcagcc gggccgccgacccaaaggagccgtccgactatgtctaacatggagaaacacctgttcaac ctgaagttcgcggccaaagaactgagtaggagtgccaaaaaatgcgataaggaggaaaag gccgaaaaggccaaaattaaaaaggccattcagaagggcaacatggaagttgcgaggata cacgccgaaaatgccatccgccagaagaaccaggcggtgaatttcttgagaatgagtgcg cgagtcgatgcagtggctgccagggtccagacggcggtgacgatgggcaaggtgaccaag tcgatggctggtgtggttaagtcgatggatgcgacattgaagaccatgaatctggagaag atttctgctttgatggacaaattcgagcaccagtttgagactctggacgtccagacgcag caaatggaagacacgatgagcagcacgacgacgctcaccactccccagaaccaagtggat atgctgctccaggaaatggcagatgaggcgggcctcgacctcaacatggagctgccgcag ggccagaccggctccgtgggcacgagcgtggcttcggcggagcaggatgaactgtctcag agactggcccgccttcgggatcaagtgtga >gi568815580f:11751512_11952108|GENSCAN_predicted_peptide_4|299_aa MSVIVSVSGPGVVPPTVGSLHCIANVCLAWVVTPDSASSFLMTADRQGKVGREQASTLAG FLCSFLERIDSVSLVDYTPTDQHLVIIKIASMQMSLSADKEYAVPFPPGAVLRKAKSLLP GFINYAHSVCGPGPGGSHITVNFPDVTAIIYVAACSSYNMVIREDNNTNRLRESLDLFES IWNNRWLRTISIILFLNKQDMLAEKVLAGKSKIEDYFPEYANYTVPEDATPDAGEDPKVT RAKFFIRDLFLRISTATGDGKHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLKQYELL >gi568815580f:11751512_11952108|GENSCAN_predicted_CDS_4|900_bp atgtctgtcatagtgtcagtctctgggcctggtgtggtgcctcccacagtgggctccttg cactgcatagcaaacgtgtgcctggcctgggtggtcactcctgatagtgcctcgagcttt ctgatgacagcagacaggcaggggaaagtgggcagagaacaggcctcaacccttgctggc tttctttgcagcttcctggaaagaatcgacagcgtcagcttggttgactacacacccaca gaccagcacttagtgattatcaaaattgccagcatgcaaatgagtttatctgcagataag gaatatgccgttccttttcctcctggagctgtcctcaggaaagcgaagtctttacttcct ggcttcataaactatgcccattctgtttgtggccctgggcctggggggtcccatatcaca gtgaactttccagatgtcacagctatcatttacgtcgcagcctgcagtagctacaacatg gtgattcgagaagataacaacaccaacaggctgagagagtccctggatctttttgaaagc atctggaacaacaggtggttacggaccatttctatcatcttgttcttgaacaaacaagat atgctggcagaaaaagtcttggcagggaaatcaaaaattgaagactatttcccagaatat gcaaattatactgttcctgaagacgcaacaccagatgcaggagaagatcccaaagttaca agagccaagttctttatccgggacctgtttttgaggatcagcacggccaccggtgacggc aaacattactgctacccgcacttcacctgcgccgtggacacagagaacatccgcagggtg ttcaacgactgccgcgacatcatccagcggatgcacctcaagcagtatgagctcttgtga >gi568815580f:11751512_11952108|GENSCAN_predicted_peptide_5|246_aa MERAFQTALWLLQPEVVFILGDIFDEGKWSTPEAWADDVERFQKMFRHPSHVQLKVVAGN HDIGFHYDFVMVNSVALNGDGCGICSETEAELIEVSHRLNCSREHYPLYRRSDANCSGED AAPAEERDIPFKENYDVLSREASQKPRLVLSGHTHSACEVHHGGRVPELSVPSFSWRNRN NPSFIMGSITPTDYTLSKCYLPREDVVLIIYCGVVGFLVVLTLTHFGLLASPFLSGLNLL GKRKTR >gi568815580f:11751512_11952108|GENSCAN_predicted_CDS_5|741_bp atggagagagcgttccagacagctctgtggttgctgcagccggaagtcgtcttcatcctg ggggatatctttgatgaagggaagtggagcacccctgaggcctgggcggatgatgtggag cggtttcagaaaatgttcagacacccaagtcatgtacagctgaaggtagttgctggaaac catgacattggcttccattatgactttgtgatggtcaacagcgtggcgctgaacggggat ggctgtggcatctgctctgaaacagaagcagagctcattgaagtttctcacagactgaac tgctcccgagagcattatcctctgtatcggagaagtgatgctaactgttctggggaagac gctgctcctgcagaggaaagggacatcccatttaaggagaactatgacgtgctttcacgg gaggcatcacaaaagccgcgcctggttctcagtggccacacgcacagcgcctgcgaggtg caccacgggggccgagtccccgagctcagcgtcccatctttcagttggaggaacagaaac aaccccagtttcatcatgggtagcatcacgcccacagactacaccctctccaagtgctac ctcccacgtgaggatgtggttttgatcatctactgtggagtggtgggcttccttgtggtc ctcacactcactcactttgggcttctagcctcaccttttctttctggtttgaacttgctc ggaaagcgtaagacaagatga >gi568815580f:11751512_11952108|GENSCAN_predicted_peptide_6|214_aa MRIGKRRNTGRKFREIQPDKDVAAGADSGDGGRGHKPRNATGGLSRWKRQGMNDPADAQV SCHHLSFSPVLRRSNLPKEEHLRRNDIKDVQSYHQKNNEGQNSGKAVLTLTVYYRVQMNN QMKKYPGQVFPSAVHINADGNLMKDSEQNHPAKSLPKRLRPPLMPAAGLDLARGPAPPPL SLDSPGRRPRDRLGSAGKTLQPELEPGPTCPAPS >gi568815580f:11751512_11952108|GENSCAN_predicted_CDS_6|645_bp atgagaattggaaagagacgtaacactggaaggaagttcagagagattcaacctgacaag gatgtggccgctggtgctgactccggagatggaggaaggggccacaagcccaggaatgcc acaggtggcctctcaaggtggaaaaggcaaggaatgaatgatcctgcagatgctcaggtg tcctgccatcacctgagttttagcccagtgttaaggagaagtaacctgcctaaggaggag cacttgagaagaaatgacatcaaagatgtacaaagttatcatcagaaaaacaatgaaggc cagaactcaggaaaagctgtgcttactcttaccgtttattatagagtacaaatgaacaac cagatgaagaagtacccaggacaagtatttccatctgctgtccatatcaatgctgacggc aacctcatgaaagactctgagcagaaccaccctgctaagtcactcccaaagagactgagg ccccccctgatgccagctgcgggcctggacctagcccggggccctgcccctccgcccctc tcgctggacagcccaggccggcgcccgagggataggctcgggagcgcggggaagacgctg cagccggagctggagccagggcccacctgtccagcgccgtcctga