GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:21:43 Sequence gi568815583f:41131383_41379386 : 248004 bp : 44.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 379 584 206 2 2 63 115 42 0.424 3.32 1.02 Term + 2161 2604 444 0 0 20 44 189 0.666 2.94 1.03 PlyA + 5087 5092 6 1.05 2.00 Prom + 9716 9755 40 -3.96 2.01 Init + 31780 32061 282 0 0 89 2 297 0.350 18.28 2.02 Term + 32540 32992 453 1 0 80 36 315 0.906 20.66 2.03 PlyA + 34004 34009 6 1.05 3.10 PlyA - 34031 34026 6 1.05 3.09 Term - 52674 52652 23 1 2 97 47 16 0.539 -3.13 3.08 Intr - 53211 52884 328 0 1 86 53 179 0.833 9.47 3.07 Intr - 58746 58555 192 0 0 56 92 101 0.898 6.99 3.06 Intr - 60203 60060 144 2 0 74 94 9 0.564 0.58 3.05 Intr - 64473 64393 81 0 0 116 88 27 0.963 5.33 3.04 Intr - 64655 64551 105 2 0 64 115 21 0.854 2.81 3.03 Intr - 78205 78119 87 1 0 105 67 62 0.486 5.97 3.02 Intr - 84451 84393 59 2 2 35 115 63 0.887 2.30 3.01 Init - 85382 85286 97 2 1 56 68 67 0.465 1.97 3.00 Prom - 93069 93030 40 -4.46 4.00 Prom + 96877 96916 40 -7.26 4.01 Init + 100001 100067 67 1 1 80 68 169 0.871 15.33 4.02 Intr + 112285 112357 73 2 1 57 96 102 0.314 6.36 4.03 Intr + 125528 125608 81 2 0 47 84 135 0.655 7.75 4.04 Intr + 131374 131501 128 1 2 36 65 129 0.945 5.72 4.05 Intr + 139175 139236 62 0 2 116 91 100 0.978 11.55 4.06 Intr + 147385 147507 123 0 0 107 81 132 0.998 15.18 4.07 Term + 147954 148007 54 2 0 119 43 91 0.997 5.26 4.08 PlyA + 150480 150485 6 1.05 5.00 Prom + 156078 156117 40 -3.66 5.01 Init + 160451 160623 173 1 2 62 63 133 0.852 7.12 5.02 Intr + 200774 201148 375 2 0 131 66 194 0.022 15.63 5.03 Intr + 210309 210360 52 0 1 41 74 26 0.023 -4.59 5.04 Intr + 211004 211072 69 1 0 65 109 53 0.744 4.48 5.05 Intr + 217716 217859 144 2 0 62 68 103 0.902 6.18 5.06 Intr + 219651 219747 97 2 1 19 97 77 0.319 1.28 5.07 Intr + 226767 226876 110 1 2 34 95 75 0.572 2.80 5.08 Intr + 234020 234207 188 1 2 60 103 21 0.505 -0.71 5.09 Intr + 240145 240302 158 1 2 86 58 106 0.762 7.05 5.10 Intr + 244330 244446 117 2 0 61 95 25 0.139 0.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:41131383_41379386|GENSCAN_predicted_peptide_1|216_aa XLFLTQSLFGRLFTRTHVTFGAEDPGQGESFRRLVPCPRPHSMRSSTYDLGSSDQPAQGT FNISPISNQKLQKLDSGPQTPQQDLINFAFKVYDNREEAAKRQRISELRLVASTVRETPA TSPAHKNFKTPKLQQPGIPPGPPPPGSCFKCQKSGQWAKECPQPGIPPKLCPICAGPHWK LDCPTCLAATPRAPRALAQGSLTDSFLDLLGLAAED >gi568815583f:41131383_41379386|GENSCAN_predicted_CDS_1|651_bp nctttgttcctcacacaaagcctgtttggtcgtctcttcacacggactcatgtgacattt ggtgccgaagacccgggacagggggagtccttcaggagactggtcccctgtcctcgccct cactccatgaggagctccacctatgatctcgggtcctcagaccaaccagcccaaggaaca tttaacatctcaccaatttcaaatcagaaactccaaaagctagactccggccctcaaacc ccacaacaagacttaattaactttgccttcaaggtgtacgataatagagaagaggcagcc aagcggcaacgtatttctgagttgcgattagttgcctccactgtgagagaaaccccagcc acatctccagcacacaagaacttcaaaacgcctaaactgcagcagccaggcattcctcca gggcctcctcccccaggatcttgtttcaagtgccagaaatctggccaatgggccaaggaa tgcccacagcccgggattcctcctaagctgtgtcccatctgtgcaggaccccactggaaa ttggactgtccaacttgcctggcagccactcccagagcccccagagcactggcccaaggc tctctgactgactccttcctagatctgctcggcttagcggctgaagactga >gi568815583f:41131383_41379386|GENSCAN_predicted_peptide_2|244_aa MMRCTLENRNAQTKQLQTAVSNVEKHFGELCQIFAAYVRKTARLRDKADHLVNEINAYAA TETPHLKLGLMNFDDEFAKLQDYRQAEVERLEAKAETELQRTAMDASRTSRHLEETINNF ERQKMKDIKTIFSEFITIEMLFHGKALEVYTAAYRNIQNIDEDEDLEVFRNSLYAPDYSS CLDTVRANSKSPLQRSLSAKCVSGTRQVSTCRLRKDQQAEDDEDEELDVTEEENFLNYTF PFSS >gi568815583f:41131383_41379386|GENSCAN_predicted_CDS_2|735_bp atgatgaggtgcaccctggaaaatcggaacgctcaaacgaaacaactgcaaacagctgtc tcaaatgtggagaagcattttggagaactgtgccaaatcttcgctgcctatgtgcggaaa actgccaggctgagagacaaagcagaccacctggtgaatgaaatcaatgcgtatgctgct acagagaccccacatttaaagctgggcctgatgaactttgatgatgagtttgccaaactt caggattatcgacaagcagaggttgaaagacttgaagccaaagcagaaacggaattacag agaactgcaatggatgctagccgaacaagtcgtcatctggaggaaactattaacaacttt gaaaggcagaaaatgaaggatataaagactatattttctgaatttatcacaatcgaaatg ttatttcacggcaaagctttagaggtctacactgctgcctaccggaatatacaaaacatt gatgaagatgaagatttagaggttttccgaaattctctgtatgcaccagattattcatct tgtttagatactgtaagagcgaattcaaagtcacctcttcagagatcactgtcagctaag tgtgtatctggaacaagacaggtatccacttgtcgactaagaaaggatcaacaagcagaa gatgatgaggatgaagagttagatgttacagaagaagaaaattttcttaactacacattt ccattttcatcataa >gi568815583f:41131383_41379386|GENSCAN_predicted_peptide_3|371_aa MKVEDLNVCEPASPAPEAPATSLLNDLKYSPSEEEEVTYTVINQFQQKFGAAILHIKKQN VLSVAAEGANVCRHGKLCWLQVATNCRVYLFDIFLLGSRAFHNGLQMILEDKRILKVIHD CRWLSDCLSHQYGILLNNVFDTQVADVLQFSMETGGYLPNCITTLQESLIKHLQVAPKYL SFLEKRQKLIQENPEVWFIRPVSPSLLKILALEATYLLPLRLALLDEMMSDLTTLVDGYL NTYREGSADRLGGTEPTCMELPEELLQLKDFQKQRREKAAREYRVNAQGLLIRTVLQPKK LVTETAGKEEKVKGFLFGKNFRIDKAPSFTSQDFHGDVNLLKEESLNKQATNPQHLPPTE EGETRNQSVSK >gi568815583f:41131383_41379386|GENSCAN_predicted_CDS_3|1116_bp atgaaagttgaagacctaaatgtatgtgagcctgcttctcctgcccctgaagcaccagct acctctctgctgaatgacctcaagtacagcccatcagaggaagaggaggtgacatacaca gtcattaatcaattccagcagaagtttggtgctgcgatactccatatcaagaagcagaat gtcctgagtgtggcagcagaaggagcgaatgtatgtcgccatggcaaactgtgctggctg caggtggccacaaattgccgagtttacttatttgacattttccttctgggaagtcgagct ttccacaatggacttcagatgatactagaagacaagagaattttgaaggttatccatgat tgtcgttggctttctgattgcctctctcatcagtatggaattttgctgaataatgtcttt gacacacaggtagcagatgtacttcagttttccatggaaacgggtggctatcttccaaac tgcatcactactttgcaggagagtttaatcaaacaccttcaagtagcccctaaatatctc tcctttctagaaaagagacaaaaactaattcaggaaaatccagaagtatggttcatccga cctgtttcaccctctttactgaaaattttggccctggaagctacctacctgttacccctt cgcttggcactcctagatgagatgatgtctgacctaaccaccctggtggatggttaccta aacacgtatcgcgaagggtctgcagaccggcttggaggcactgagcctacatgtatggag ctgccagaggaactgcttcaactcaaggacttccagaagcagcgcagggagaaagctgca agagaatatagggtgaatgcacagggactcctgataaggacagtgctacagccaaagaaa ttagtgacagagacagcagggaaagaggagaaagtcaaaggcttcttatttggtaaaaat tttaggatagataaagctccaagttttacatctcaagactttcacggggatgtgaattta ctgaaagaagaatctttgaataaacaagctacaaatcctcaacatctacctcccacagag gaaggggaaaccagaaaccagagtgtctccaagtga >gi568815583f:41131383_41379386|GENSCAN_predicted_peptide_4|195_aa MGSRASTLLRDEELEEIKKETGFSHSQITRLYSRFTSLDKGENGTLSREDFQRIPELAIN PLGDRIINAFFPEGEDQVNFRGFMRTLAHFRPIEDNEKSKDVNGPEPLNSRSNKLHFAFR LYDLDKDEKISRDELLQVLRMMVGVNISDEQLGSIADRTIQEADQDGDSAISFTEFVKVL EKVDVEQKMSIRFLH >gi568815583f:41131383_41379386|GENSCAN_predicted_CDS_4|588_bp atgggttctcgggcctccacgttactgcgggacgaagagctcgaggagatcaagaaggag accggcttttcccacagtcaaatcactcgcctctacagccggttcaccagcctggacaaa ggagagaatgggactctcagccgggaagatttccagaggattccagaacttgccatcaac ccactgggggaccggatcatcaatgccttctttccagagggagaggaccaggtaaacttc cgtggattcatgcgaactttggctcatttccgccccattgaggataatgaaaagagcaaa gatgtgaatggacccgaaccactcaacagccgaagcaacaaactgcactttgcttttcga ctatatgatttggataaagatgaaaagatctcccgtgatgagctgttacaggtgctacgc atgatggtcggagtaaatatctcagatgagcagctgggcagcatcgcagacaggaccatt caggaggctgatcaggatggggacagtgccatatctttcacagaatttgttaaggttttg gagaaggtggatgtagaacagaaaatgagcatccgatttcttcactaa >gi568815583f:41131383_41379386|GENSCAN_predicted_peptide_5|495_aa MVLNKTVLTSDASCTSEVPPATCIFDQVATDLESPMTLLEPAFDFVGFLIVFSSFKVRWF SATRWRKREHPLGLASAFPERFTALTGEDHGPEGPRQVPGEVHRVGEHCVTLCALEHSAP LRLQPRRQLRAGGWLLSPQPCGAERRGPLHHLRIPLHGGRKRSLVNRPLSATTKVPPGRR CTTPQHFWLIPIKCILHIVQATKLLKALKGYIKHEARKGNENQDESQTSASSCDETEIQI SNQEEAERQPLGHVTKTRRRCKTVRVDPDSQNHEKQESQDLRATAKVPSPPDEHQEAENA VSSDFKKLHEAHFKEMESIDQYIERKKKHFEEHNSMNELKQQPINKGGVRTPVPPRGRLS VASTPISQRRSQGRSCGPASQSTLGLKGSLKRSAISAAKTGVRFSAATKDNEHKRSLTKT PARKSAHVTVSGGTPKGEAVLGTHKLKTITGNSAAVITPFKLTTEATQTPVSNKKPVFDL KASLSRPLNYEPHKX >gi568815583f:41131383_41379386|GENSCAN_predicted_CDS_5|1485_bp atggtcctcaacaagactgtcctcacttcagatgccagctgtacttcggaggtaccccca gccacctgcattttcgaccaagtggctacagatttggagagtcccatgaccctcttagaa ccagcttttgatttcgttggttttcttattgtttttagttcctttaaagttaggtggttc tccgccacccggtggagaaagcgggaacaccctctcgggctagcctctgcctttcccgaa cgcttcactgcactcactggagaagaccacggccccgagggaccgcgacaggtcccaggc gaggtgcaccgagtcggcgagcactgcgtgacactgtgcgcactggaacacagcgcacct ctcaggctgcagccaagacggcagctgcgggccggcggctggctcctcagcccccagccc tgcggggccgagcggcgaggaccccttcaccacctgcgtatcccactccatggaggtcgt aaaagaagcttggtcaatcgccctctcagtgccaccacaaaagtccccccggggcggcgt tgcacaacgccacagcatttctggctcatccccatcaaatgcatcttgcacattgtgcag gcaaccaagttgttaaaagccttgaaaggctacattaaacatgaggcaagaaaaggaaat gagaatcaggatgaaagtcaaacttctgcatcctcttgtgatgagactgagatacagatc agcaaccaggaagaagctgagagacagccacttggccatgtcaccaaaacaaggagaagg tgcaagactgtccgtgtggaccctgactcacagaatcatgaaaagcaggaaagccaggat ctcagagctactgcaaaagttccttctccaccagacgagcaccaagaagctgagaatgct gtttcctcagactttaagaagcttcatgaagctcattttaaggaaatggagtccattgat caatatattgagagaaaaaagaaacattttgaagaacacaattccatgaatgaactgaag cagcagcccatcaataagggaggggtcaggactccagtacctccaagaggaagactctct gtggcttctactcccatcagccaacgacgctcgcaaggccggtcttgtggccctgcaagt cagagtaccttgggtctgaaggggtcactcaagcgctctgctatctctgcagctaaaacg ggtgtcaggttttcagctgctactaaagataatgagcataagcgttcactgaccaagact ccagccagaaagtctgcacatgtgaccgtgtctgggggcaccccaaaaggcgaggctgtg cttgggacacacaaattaaagaccatcacggggaattctgctgctgttattaccccattc aagttgacaactgaggcaacgcagactccagtctccaataagaaaccagtgtttgatctt aaagcaagtttgtctcgtcccctcaactatgaaccacacaaagnn