GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:21:46 Sequence gi568815584f:103235861_103441049 : 205189 bp : 45.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1926 2156 231 1 0 57 0 171 0.126 2.94 1.02 Intr + 5860 5997 138 2 0 23 77 152 0.187 8.04 1.03 Intr + 6775 6801 27 2 0 132 86 21 0.791 4.39 1.04 Intr + 33849 34027 179 1 2 101 51 52 0.523 2.44 1.05 Intr + 36677 36864 188 1 2 18 54 191 0.171 7.19 1.06 Term + 43351 43942 592 1 1 68 48 724 0.553 60.40 1.07 PlyA + 44992 44997 6 1.05 2.00 Prom + 46484 46523 40 -3.26 2.01 Sngl + 47347 47625 279 0 0 9 36 205 0.429 2.83 2.02 PlyA + 48894 48899 6 1.05 3.04 PlyA - 49365 49360 6 1.05 3.03 Term - 51387 51125 263 1 2 96 48 116 0.453 4.09 3.02 Intr - 55110 54982 129 1 0 111 41 103 0.808 8.57 3.01 Init - 71565 71487 79 0 1 78 73 54 0.377 4.12 3.00 Prom - 76002 75963 40 -5.66 4.00 Prom + 86384 86423 40 -4.26 4.01 Init + 87492 87530 39 2 0 88 93 19 0.094 2.58 4.02 Intr + 97735 98074 340 0 1 26 37 209 0.055 4.65 4.03 Intr + 98529 98737 209 1 2 79 113 145 0.187 14.90 4.04 Intr + 100176 100257 82 2 1 68 86 95 0.999 6.51 4.05 Intr + 100817 100989 173 0 2 57 111 149 0.997 13.76 4.06 Intr + 101256 101367 112 2 1 57 105 -9 0.868 -2.35 4.07 Intr + 102467 102612 146 0 2 90 101 106 0.915 12.10 4.08 Intr + 102875 103033 159 1 0 78 93 183 0.993 17.98 4.09 Intr + 103312 103473 162 0 0 133 65 104 0.993 12.77 4.10 Intr + 103779 103943 165 2 0 34 81 152 0.968 9.26 4.11 Intr + 104567 104701 135 1 0 45 82 197 0.997 15.56 4.12 Term + 105103 105192 90 0 0 76 43 166 0.999 8.62 4.13 PlyA + 105472 105477 6 1.05 5.00 Prom + 106303 106342 40 -4.76 5.01 Init + 107821 107861 41 0 2 68 78 32 0.157 -0.13 5.02 Intr + 114192 114528 337 0 1 59 89 143 0.066 7.02 5.03 Intr + 120167 120230 64 1 1 84 100 60 0.208 5.09 5.04 Term + 123573 123583 11 1 2 92 43 -7 0.022 -6.44 5.05 PlyA + 125870 125875 6 1.05 6.06 PlyA - 125928 125923 6 1.05 6.05 Term - 128638 128379 260 2 2 22 42 334 0.113 17.91 6.04 Intr - 138804 138372 433 0 1 -29 30 382 0.161 13.92 6.03 Intr - 139069 138838 232 0 1 33 19 221 0.367 7.48 6.02 Intr - 142301 142188 114 1 0 89 94 57 0.370 5.96 6.01 Init - 143068 142965 104 1 2 90 69 66 0.318 4.73 6.00 Prom - 146552 146513 40 -6.36 7.05 PlyA - 147257 147252 6 1.05 7.04 Term - 150023 149505 519 2 0 61 49 265 0.373 14.20 7.03 Intr - 150442 150230 213 1 0 20 32 196 0.377 6.11 7.02 Intr - 169298 169251 48 2 0 40 78 75 0.029 0.58 7.01 Init - 173880 173872 9 0 0 88 83 21 0.055 0.91 7.00 Prom - 174311 174272 40 -7.06 8.02 PlyA - 174321 174316 6 1.05 8.01 Sngl - 176811 176248 564 0 0 74 48 411 0.999 31.85 8.00 Prom - 183682 183643 40 -3.76 9.02 PlyA - 183894 183889 6 1.05 9.01 Sngl - 190807 190505 303 1 0 62 43 147 0.705 3.43 9.00 Prom - 196424 196385 40 -2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100072 72 1 0 78 85 130 0.919 12.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_1|451_aa XPNQQPIWIPSRHLKPYHEPDDEEEILGGSRGPPGCSHVEADAEEDPNCHKQHLLNTATH LGTDQEAVTDGRRKPEENKQVKGKIPAKRINERIEHIKPSMSRDSFLKRVKENDQKETEA TEKDPGDNQLEPAALRLSPLRTAVHRPHPDINQTLTLPLPFAFSIQPYQPASPPDKRPPT SEWFWPVYGGCTECRLQSLRGLREDPETRSPRIKAPSGEKDHPDIVNQCEAATYTSNPCL SVSVRIFGYILYGTMAAGVCCWGAWSGQDHAGHAVRVRPLPEQCAAWAEERLREEIEANA APALLEVVDAAGAEHLVTLQDLDIRHGDGSAVLPGVCSEASFRAVRPPSKRLRRPWEPGA VPRVLVGTQADPDDGRQVLTARGREGRSPFPEVMAKSQRMVDRVFMQAVRETEALVPPEE EVECWEVFCCPLDSIQKKMALVAASKRGEGG >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_1|1356_bp ngaccgaatcaacagccgatttggataccatcaagacacctgaaaccttatcatgagcca gatgatgaggaagagattctgggaggatcccgaggaccccctggttgcagccatgttgag gctgacgctgaggaggaccccaactgtcacaagcaacacctgctgaacacagccacccac ctggggacagaccaagaagctgtcacagatggcagaagaaaacctgaggaaaacaaacaa gttaagggcaagattcctgccaagagaattaatgagcgtattgagcacattaagccctct atgagccgagatagcttcctgaaacgcgtgaaggaaaatgatcagaaagagacagaagcc acagagaaagatccaggagataatcaactagagccagctgcccttaggctttctcccctg cggactgcggtccacagaccccaccccgacatcaaccagactctgacgctgccgcttcct tttgcattttcaatacaaccttaccaaccagcatcccctcctgataagagaccaccgacc tcagagtggttctggccagtctacggaggatgcacagagtgccggctccagagtcttcgg ggtctcagagaagacccagaaacaaggtctccacgaattaaagcaccatcaggagagaaa gaccatccggatatagtcaatcagtgcgaggcagccacatacacaagtaacccttgttta agcgtcagtgtacgcattttcggttacatcttatacggcactatggctgccggggtgtgc tgctggggagcgtggagtgggcaggaccacgctggacacgcagttcgcgtgcggccgctc cccgagcagtgcgcggcgtgggcggaggagcggctccgcgaggagatcgaggcgaacgcg gcgcccgcgttgctggaggtcgtggacgcggcgggcgccgagcacctggtcacgctccag gacctggacatccgccacggcgacggctccgcggtgctcccgggcgtgtgcagcgaggcc tcgttccgggccgtgaggccgccgagcaagcgcctgcgccggccgtgggaacccggggcc gtcccgcgggtgctggtgggcacccaggccgacccggacgacgggcgccaggtgctgacg gcgcggggccgcgaggggcgctccccgttcccggaggtcatggccaagagccagcggatg gtggaccgggtgttcatgcaggcggtgcgcgagacggaggccttggtcccgcccgaggag gaggtggagtgttgggaagttttctgttgtcccctggactccatccagaaaaagatggcc ctggtggcagcctcgaagcgcggggagggaggctag >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_2|92_aa MLNNAESNAELKGLGVDSLVIEHLQVNKAPKMCHQTSRAHGGINPYMSSPCRIEMILTAK EQIVPKPEDCPQEKDIPEETKTYGTGINSALK >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_2|279_bp atgcttaacaatgcagagagcaatgctgaactgaagggtttaggtgtagattctctggtc attgagcacctccaagtgaacaaagcacctaagatgtgccaccagacctccagagctcat ggtgggattaacccatacatgagctctccctgtcgcatcgaaatgatccttactgcgaag gaacaaattgttcctaaaccagaagattgcccacaagaaaaagatatccctgaggaaaca aaaacttatggcacaggaataaattcagcactaaaataa >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_3|156_aa MALTLFNSVKAERGEGDAEEKLKATRDFCETLRSPGEALECGYLEPCVCGEPLPRSKHGL ARASHWTVAGKRYHYHPRFPEEGETEAERGQSSRWRPHNGSRIGAGQPPQRRVLRRNTVC SCPPPLPGYGLSPSAPCKLETSGKTTFAKIMTVRET >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_3|471_bp atggccctcactctcttcaattctgtgaaggctgagagaggtgagggagatgcagaagaa aagttgaaagccaccagagacttttgcgaaactcttcgaagtcctggagaggctctggag tgtggctacttggagccctgtgtctgtggggagccgctgcctagatctaagcacggtctg gctagagcttcccactggacagtggctggaaagcgctatcattatcaccctcgttttcct gaggagggagaaactgaggctgagagaggccagtcatctcggtggaggccacataatgga tccagaattggagctggacagccgccacaacgccgagtgttaaggaggaacacagtttgc agctgccccccacccctgccaggctatggcttgtcaccatcagctccatgtaaactagag actagtggtaaaaccacctttgcaaaaattatgacggtgagagaaacctga >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_4|603_aa MALNSDCDFIAAKRPIHPESETEDVPGMKDPGLGLKVEKQLIPRGPESGVPGEREREMSA RCGENAVGFLSPKVAAAKTRPRKQFRGRWLLRPGSRHDAAACPRPGEGGDQPPSPPQGPN RQRTAPGRPAPALTGVDRGRRWDSPSPLLGGGTWPGRSSLRFASASSDSDSGLYRASLHP SPGRAALGLCLYLTKTVEGKGNGIKTVIVNMVDVAKALNRPPTYPTKYFGCELGAQTQFD VKNDRYIVNGSHEANKLQDMLDGFIKKFVLCPECENPETDLHVNPKKQTIGNSCKACGYR GMLDTHHKLCTFILKNPPENSDSGTGKKEKEKKNRKGKDKENGSVSSSETPPPPPPPNEI NPPPHTMEEEEDDDWGEDTTEEAQRRRMDEISDHAKVLTLSDDLERTIEERVNILFDFVK KKKEEGVIDSSDKEIVAEAERLDVKAMGPLVLTEVLFNEKIREQIKKYRRHFLRFCHNNK KAQRYLLHGLECVVAMHQAQLISKIPHILKEMYDADLLEEEVIISWSEKASKKYVSKELA KEIRVKAEPFIKWLKEAEEESSGGEEEDEDENIEVVYSKAASVPKVETVKSDNKDDDIDI DAI >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_4|1812_bp atggctttaaattctgactgtgattttatcgcagcaaagcggccaatccaccccgagtcc gaaacagaagacgtccctgggatgaaagacccaggcctggggcttaaggtggaaaaacaa ctcattcctcgaggcccggagtcgggcgtcccgggagaacgggaaagggagatgagcgcc cgctgcggggagaacgcggtcgggttcctctccccgaaggtggcagccgcaaaaacacgg cctcgaaagcagtttcgaggacgatggctcctgcgtcccgggagtcgccatgacgccgcg gcctgtcccagacccggcgagggaggggaccaaccaccatctccccctcaaggcccaaac cgacaaagaacggcccccggtcgcccagctccggcgctgacgggtgtggaccgcggacgt cgctgggacagcccctccccgctgctcggcggcggcacctggcccggccgctcctcgctg cgcttcgcctccgcctcctcggactcggactcgggtttatatcgcgcctcacttcatccc agtcccgggcgagcagcgttgggtttatgtctttatttgacgaaaacggttgagggcaaa ggcaatggaatcaagacagttatagtcaacatggttgacgttgcaaaggcgcttaatcgg cctccaacgtatcccaccaaatattttggttgtgagctgggagcacagacccagtttgat gttaagaatgaccgttacattgtcaatggatctcatgaggcgaataagctgcaagacatg ttggatggattcattaaaaaatttgttctctgtcctgaatgtgagaatcctgaaacagat ttgcatgtcaatccaaagaagcaaacaataggtaattcttgtaaagcctgtggctatcga ggcatgcttgacacacatcataaactctgcacattcattctcaaaaacccacctgagaat agtgacagtggtacaggaaagaaagaaaaagaaaagaaaaacagaaagggcaaagacaag gaaaatggctccgtatccagcagtgagacaccaccaccaccaccaccaccaaatgaaatt aatcctcctccacatacaatggaagaagaggaggatgatgactggggagaagatacaact gaggaagctcaaaggcgtcgaatggatgaaatcagtgaccatgcaaaagttctgacactc agtgatgatttggaaagaacaattgaggagagggtcaatatcctctttgattttgttaag aaaaagaaagaagagggtgttattgattcatctgacaaagaaatcgttgctgaagcagaa agactggatgtaaaagccatgggccctcttgttctaactgaagttctttttaatgagaag attagagaacagattaagaaatacaggcgccatttcctacgattttgtcacaacaacaaa aaagcccaacggtaccttcttcatggtttggagtgtgtggtagcaatgcatcaagctcag cttatctccaagattccacatatcttgaaggagatgtacgatgcagaccttttagaagaa gaggtcatcatcagctggtcggaaaaggcctctaagaaatatgtctccaaagaacttgcc aaagagattcgtgtcaaagcagaaccatttataaaatggttgaaggaggcagaggaagaa tcttctggtggcgaagaagaagatgaagatgagaacattgaggtggtgtattcgaaggct gccagtgtaccgaaagttgagactgtaaagtcagacaacaaggatgacgacatcgatatt gatgccatttaa >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_5|150_aa MAEVKAGHLQKADNLRQSASVYVSPCQSMSVYVSPRQSMSVHVNLHQSMSVYVSAFQSTS VHVSLQQSTSVHVSLRQSTSVHVSLRQSMSVYVSLRQSVSLCQSMLVYVSPCQSMPVYVS LCQSMSKSVTNQSHGEVVPGRDHLAVRSRF >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_5|453_bp atggcagaagtgaaagctggccatctgcaaaaagcagataatctacgtcagtctgcatca gtctatgtcagtccatgtcagtctatgtcagtctacgtcagtccacgtcagtctatgtca gtccacgtcaatctacatcagtctatgtcagtctatgtcagtgcatttcagtctacatca gtccatgtcagtctacaacagtctacgtcagtccacgtcagtctacgtcagtctacatca gtccatgtcagtctacgtcagtccatgtcagtctacgtcagtctacgccagtctgtcagt ctatgtcagtctatgttagtctatgtcagtccatgtcagtctatgccagtctatgttagt ctatgtcagtctatgtcaaaaagtgtgaccaaccagagtcacggagaagtcgtgcctggt cgggaccacctggcagtgagaagtcgattctaa >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_6|380_aa MALGDAGEKEPFAPAVILQLSDLLKMIDPHLVLDRQGLGDSEGQEAFSSSRTESSALRLQ SAAGSDTHWLQGRETFTRSGAFDVLQMKEEHVLKFLVAGTHLGGTKLNFQMEQYICKRKS DGIYIINLRRTWEKLLLAARAIVAIENPADTAVLKFAAATGATPVAGRFTPGTFTNQIQA AFREPRLLVVTDPRADHRPLMEASYVNLPTIALCNTDSPLHYGDIAIPCDNKGAHSLGLM WWTLAREVLRMRGTVSREHPWEVMPDLYFYRDPQEIKKEEQAAAEKAETKEEFQDQIHKP WHLYWRCGKKLGEMWNNLNDSEKQPYITKAAKLKEKYEKDVVDFKSKGNFDGTKGPAKVA WKNVEEEYEEDEEEEERGGS >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_6|1143_bp atggccctgggggatgcaggagagaaagagccatttgcccctgcagtaattctccagctc tcggatctgctgaagatgattgatccccatctagttctagatcggcaaggacttggagat tcagaagggcaagaagccttctcaagctcacggacagaaagcagtgcccttagactccag tctgctgctggctctgacacccactggctgcaaggcagggaaactttcacaaggtccgga gcttttgatgtcctgcaaatgaaggaggagcatgtccttaagttccttgtggcaggaacc cacttaggtggcaccaagcttaacttccagatggaacagtacatctgtaaaaggaaaagt gatggcatctacatcataaatctgaggaggacctgggagaagcttctgctggcagctcgt gccattgttgccattgaaaaccctgctgatacggctgtgctgaagtttgctgctgccact ggagccactccagttgctggccgcttcactcctggaaccttcactaaccagatccaggca gccttccgggagccacggcttcttgtggttactgaccccagggctgaccaccggccgctc atggaggcatcttatgttaacctacctaccattgctctgtgtaacacagattctcctctg cactatggggacattgccatcccatgtgacaacaagggagctcactcattgggtttgatg tggtggacgctggctcgggaagttctgcgcatgcgtggcaccgtttcccgtgaacacccg tgggaggtcatgcctgatctctacttctacagagatcctcaagagattaaaaaagaagag caggctgctgctgaaaaggctgagacgaaagaggaatttcaggaccaaatccacaaaccc tggcatctctactggagatgtggcaaaaagctgggtgagatgtggaataacttaaacgac agtgaaaagcagccttacatcacaaaggcggcaaagctgaaggagaagtatgagaaggat gttgttgactttaagtctaaaggaaactttgatggcacaaagggtcctgctaaagttgcc tggaaaaatgtggaagaagaatatgaagaagatgaggaagaagaagaaagaggaggatca tga >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_7|262_aa MVKEAIEFLHRAPERLVRETAGIIDVLRRPAREQRDTADFWRKRSDLAASWVTTACPLFP ERSAGGNTRLQGTARGARMPPRRSANYNAQIPGLARAVPRRVPGLILSLPRRPRRGDREG GSGGAEPAGGGLRFGPPASPCLRAPRDPGPPHLLLHSLPSHREFQLAGLHTAPPAAPENE EAEEDVKEVERKTGRRGLWAPTMERAAEEAPAPDCFSLSSELAASASALLAAAAAAAPRH LRASRSQDGRPSASSAASGCRK >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_7|789_bp atggtcaaggaggctatagagtttctacaccgagctcctgagcggctggtacgagagact gccggcataatcgatgtacttcgcagacccgcccgcgaacagcgggacacggcagacttc tggaggaaacggtccgacctggctgcctcctgggtgactacggcctgccctctatttcca gagcgttcggctggggggaacacccgactgcaaggcactgcccgaggagcacgaatgcca ccccgaaggtccgccaactacaacgcccagatcccgggcctggcccgggccgtcccccgg cgggtcccggggctcattctctccctgcctcggcggccgaggcgaggagaccgggagggc ggcagtggcggagccgagcctgcgggcggcgggctgcgcttcggcccgccggcgtcccct tgcctcagggcgccccgcgatcctgggccccctcacctccttctccactccctcccgtcg caccgcgaatttcagctcgcgggcctccacacagcgccgccggcggcgcctgaaaacgag gaggcggaggaggatgtgaaggaagtggagcgcaagacaggccgccgcgggctgtgggcg ccgaccatggagcgggctgcggaggaggcgcctgcgcccgactgcttctccctctctagc gagcttgctgcctcagcctccgccctgctggccgccgctgccgccgccgcaccccggcac ctcagagcctcacggagccaagatggccgcccctcggcttcctctgctgcctccggctgc cggaagtga >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_8|187_aa MVELQISLKNYDPQKDKRFSGTVRLKSTPCPKCGSWGNSSTVTRPRPRISPQVDTEVPKK LKNKNLVKKLAKKYDAFLASESLIKQIPRILGIGLTKAVKFPSLLTHDENTVAKVDEAKS TIKFHMNMLCLAVAVGHVKMTDDELVHNIHLAVNFLVSLLEKKLAECPGLIYQEHHGQAP APILRHI >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_8|564_bp atggtggagttgcagatcagcttgaagaactatgacccccagaaggacaagcgcttctcg ggcaccgtcaggcttaagtccactccctgtcccaagtgtgggtcctgggggaacagcagc actgtgacgagaccaaggccacggatatccccccaagtggacactgaggtgccaaaaaaa ctcaagaataagaatctggtcaagaagctggccaagaagtatgatgcatttttggcttca gagtctctgataaagcaaatcccacgaatcctcggcataggcctaacaaaggcagtaaag ttcccttccctgctcacacacgatgaaaacacggtggccaaagtggatgaggcgaagtcc acaatcaagtttcatatgaacatgttatgtttggctgtggctgttggccacgtgaagatg acagacgatgagcttgtgcataacattcacctggctgttaacttcttggtgtcactgctc gagaaaaaactggcagaatgtccgggccttatatatcaagagcaccacgggcaagcccca gcgcctatattaaggcatatttga >gi568815584f:103235861_103441049|GENSCAN_predicted_peptide_9|100_aa MISMDYGKRMEVTIEGKKSGERLCHDQGISSNSALHLSSKTANEPTPIRSKRLCFLLGIL QDTGPEDPPLKKASIAKVQGMSLEFLYIKHSHTFWSLLHF >gi568815584f:103235861_103441049|GENSCAN_predicted_CDS_9|303_bp atgatttctatggactatggaaaaagaatggaagtgacaatagaagggaagaaaagtgga gagaggctttgccatgaccagggaatatcaagtaactcggctttgcacctgagctccaaa acagcaaatgagcccacccccataagatcaaagaggctgtgcttcctactgggcatcctg caggacacaggcccagaagacccacccctgaagaaagcatctatagccaaagtccaggga atgagcttggagtttctctacatcaaacattctcacactttttggtcccttttacacttt taa