GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:48:42 Sequence gi568815592f:52971024_53197827 : 226804 bp : 41.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 684 679 6 1.05 1.01 Sngl - 4259 3972 288 2 0 88 50 151 0.346 6.64 1.00 Prom - 6853 6814 40 -2.85 2.04 PlyA - 6951 6946 6 1.05 2.03 Term - 7569 7447 123 0 0 105 41 118 0.991 6.20 2.02 Intr - 11682 11551 132 0 0 84 99 43 0.958 4.92 2.01 Init - 13575 13441 135 0 0 47 100 118 0.524 9.13 2.00 Prom - 21336 21297 40 -5.05 3.14 PlyA - 22686 22681 6 1.05 3.13 Term - 23238 22967 272 1 2 73 42 243 0.963 12.86 3.12 Intr - 35414 35292 123 0 0 56 105 80 0.943 6.14 3.11 Intr - 38544 38416 129 1 0 82 109 46 0.968 5.85 3.10 Intr - 40894 40746 149 0 2 83 50 197 0.566 14.36 3.09 Intr - 41204 41146 59 2 2 77 81 15 0.452 -3.54 3.08 Intr - 42959 42639 321 2 0 76 97 287 0.756 23.53 3.07 Intr - 45227 45060 168 2 0 84 5 160 0.873 6.52 3.06 Intr - 47478 47307 172 2 1 59 90 51 0.472 1.42 3.05 Intr - 48336 48204 133 1 1 73 23 175 0.562 8.28 3.04 Intr - 60121 60042 80 0 2 90 90 44 0.010 3.08 3.03 Intr - 61631 61510 122 2 2 83 77 73 0.007 4.07 3.02 Intr - 62424 62349 76 2 1 58 83 37 0.003 -1.20 3.01 Init - 70213 70113 101 1 2 82 106 88 0.042 9.79 3.00 Prom - 72318 72279 40 -7.05 4.04 PlyA - 73516 73511 6 1.05 4.03 Term - 82968 82819 150 0 0 63 48 117 0.500 2.13 4.02 Intr - 87856 87774 83 2 2 84 73 32 0.255 -0.26 4.01 Init - 91051 90859 193 1 1 42 55 159 0.488 7.19 4.00 Prom - 91797 91758 40 -8.55 5.03 PlyA - 92392 92387 6 1.05 5.02 Term - 94656 94321 336 0 0 92 40 338 0.656 22.99 5.01 Init - 97187 97176 12 2 0 77 121 23 0.422 3.64 5.00 Prom - 97460 97421 40 -6.15 6.00 Prom + 97526 97565 40 -6.45 6.01 Init + 102470 102615 146 1 2 27 -24 186 0.296 0.94 6.02 Intr + 103218 103347 130 0 1 108 80 -33 0.478 -2.32 6.03 Intr + 105463 105520 58 0 1 71 93 87 0.880 5.04 6.04 Intr + 107776 107875 100 2 1 87 37 131 0.993 6.15 6.05 Intr + 109945 110075 131 1 2 102 64 132 0.994 11.62 6.06 Intr + 111481 111595 115 2 1 67 79 107 0.892 6.29 6.07 Intr + 121406 121524 119 2 2 90 115 36 0.826 5.69 6.08 Intr + 121711 121801 91 2 1 70 87 47 0.989 0.93 6.09 Intr + 122443 122538 96 1 0 89 69 65 0.880 2.91 6.10 Intr + 124490 124641 152 2 2 109 58 94 0.028 7.39 6.11 Intr + 139149 139201 53 1 2 88 73 59 0.132 2.11 6.12 Term + 148803 149021 219 2 0 80 48 130 0.568 4.26 6.13 PlyA + 150683 150688 6 1.05 7.07 PlyA - 150743 150738 6 1.05 7.06 Term - 157773 157183 591 0 0 80 46 383 0.618 26.84 7.05 Intr - 159908 159780 129 2 0 61 51 88 0.752 2.37 7.04 Intr - 161096 160984 113 0 2 101 47 117 0.959 8.08 7.03 Intr - 163301 163049 253 2 1 61 75 215 0.969 13.58 7.02 Intr - 174739 174535 205 0 1 56 91 134 0.490 8.78 7.01 Init - 179177 179092 86 2 2 77 84 81 0.924 5.16 7.00 Prom - 180323 180284 40 -7.45 8.00 Prom + 189811 189850 40 -5.25 8.01 Init + 205339 205485 147 0 0 73 9 216 0.126 12.34 8.02 Intr + 212175 212356 182 2 2 61 67 106 0.008 3.54 8.03 Term + 221822 221891 70 2 1 83 48 65 0.134 -1.57 8.04 PlyA + 222530 222535 6 1.05 9.02 PlyA - 224823 224818 6 1.05 9.01 Term - 225923 225726 198 2 0 18 33 219 0.840 5.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 60545 60600 56 1 2 42 99 94 0.893 6.71 S.002 Term + 61888 61987 100 1 1 70 33 130 0.881 2.32 S.003 Term + 126699 126807 109 0 1 109 43 96 0.881 4.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_1|95_aa MPKWGSCRGFSVRNKRGKMDSLGPWCLGQKCQGTQNKATLSERFWQIECLKKHLTLTLLL SSPGRSSSAGANLPRGWNRAESESLGTGAATDNNQ >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_1|288_bp atgcctaaatggggcagttgcagaggcttcagtgtcaggaacaaaagaggaaaaatggac agcttgggaccctggtgtctaggacagaagtgccaagggacccagaataaagccacactt tcagagaggttttggcagattgagtgcttgaagaaacacttgactctgactctcctcctg tcctcacctggccgaagcagttctgcaggggccaacctccctaggggatggaacagagca gaaagtgaatccctcgggacaggggctgcaacagacaataaccagtag >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_2|129_aa MYVEGTLDLLELLIMHPFLKPDDQQKEVVNMAQKAIIRYFPVFEKILRGHGQSFLVGNQL SLADVILLQTILALEEKIPNILSAFPFLQEYTVKLSNIPTIKRFLEPGSKKKPPPDEIYV RTVYNIFRP >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_2|390_bp atgtacgtggaggggacactggatctgctggaactgcttatcatgcatcctttcttaaaa ccagatgatcagcaaaaggaagtggttaacatggcccagaaggctataattagatacttt cctgtgtttgaaaagattttaaggggtcacggacaaagctttcttgttggtaatcagctg agccttgcagatgtgattttactccaaaccattttagctctagaagagaaaattcctaat atcctgtctgcatttcctttcctccaggaatacacagtgaaactaagtaatatccctaca attaagagattccttgaacctggcagcaagaagaagcctccccctgatgaaatttatgtg agaaccgtctacaacatctttaggccataa >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_3|634_aa MNRYTTIRQLGDGTYGSVLLGRSIESGELIAIKKTSNTNRKQCAHGYTHMYAHVPGRGKS LKKLNHANVVKLKEVIRENDHLYFIFEYMKENLYQLIKERNKLFPESAIRNIMYQILQGL AFIHKHGFFHRDLKPENLLCMGPELVKIADFGLAREIRSKPPYTDYVSTRWYRAPEVLLR STNYSSPIDVWAVGCIMAEVYTLRPLFPGASEIDTIFKICQVLGTPKKTDWPEGYQLSSA MNFRWPQCVPNNLKTLIPNASSEAVQLLRDMLQWDPKKRPTASQALRYPYFQVGHPLGST TQNLQDSEKPQKGILEKAGPPPYIKPVPPAQPPAKPHTRISSRQHQASQPPLHLTYPYKA EVSRTDHPSHLQEDKPSPLLFPSLHNKHPQSKITAGLEHKNGEIKPKSRRRFESVLDLKP SEPVGTGNSAPTQTSYQRRDTPTLRSAAKQHYLKHSRYLPGISIRNGILSNPGKEFIPPN PWSSSGLSGKSSGTMSVISKVNSVGSSSTSSSGLTGNYVPSFLKKEIGSAMQRVHLAPIP DPSPESLKSYHGSKAQAPLSQRKRPDGVREMGFSCRRSRGTLYAVSSRICHTAFETFERK IRFEAPFSSSSRPPKSQARFTESDTIPEGIFLYN >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_3|1905_bp atgaatagatacacaacaatcaggcagctcggggatggaacctacggttccgtcctgctg ggaagaagcattgagtctggggagctgatcgctattaaaaaaactagcaatacaaacaga aaacagtgtgcacatggctatactcacatgtacgcacatgttccaggaagaggaaagtct ttaaagaagctcaaccatgccaatgtagtcaaattaaaagaagttatcagggaaaatgat catctttattttatcttcgagtacatgaaggaaaatctttaccagctcattaaagagaga aataagttgtttcctgagtctgctataaggaatatcatgtatcagatattacaaggactc gcatttattcacaaacacggcttctttcatcgagacttaaagcctgagaacctcctctgc atgggaccagaacttgtgaaaattgcagactttggtttggcccgagaaatacgatcaaaa cctccatatacagattatgtatctaccagatggtacagggctccagaagtactcctgagg tctaccaactacagctcccccattgacgtctgggcggtgggctgcatcatggcagaagtt tacaccctcaggccactcttccctggagccagtgaaattgacacaatattcaaaatttgc caagtgctggggacaccaaaaaagactgactggcctgaaggctatcaactttcaagtgca atgaacttccgttggccacagtgtgtacccaataacttaaagaccttgattcccaatgct agcagtgaagcagtccagctcctgagagacatgcttcagtgggatcccaagaaacgacca acagctagtcaggcacttcgatatccttacttccaagttggacacccactaggcagcacc acacaaaaccttcaggattcagaaaaaccacagaaaggcatcctggaaaaggcaggccca cctccttatattaagccagtcccacctgcccagccaccagccaagccacacacacgaatt tcttcacgacagcatcaagccagccagccccctctgcatctcacgtacccctacaaagca gaggtctccaggacagatcacccaagccatctccaggaggacaagccaagcccgttgctt ttcccatccctccacaacaagcatccacagtcgaaaatcacagctggcctggagcacaaa aatggtgagataaagccaaagagtaggagaaggtttgagagtgttttggacctgaagccc tctgagcctgtgggcacaggaaacagtgcccccacccagacgtcatatcagcggcgagac acgcccaccctgagatctgcagccaagcagcactatttgaagcactctcgatacttgcct gggatcagtataagaaatggcatactctcgaatccaggcaaggaatttattccacctaat ccatggtctagttctggcttgtctggaaaatcttcagggacaatgtcagtaatcagcaaa gtaaattcagttggttccagctctacaagttctagtggactgactggaaactatgtccct tcctttctgaaaaaagaaatcggttctgctatgcagagggtacacctagcacctattcca gacccttcccctgaaagcctgaaaagctatcatggcagcaaggcccaagctccactatcc caacggaagaggccggatggagtccgtgagatgggttttagctgccgccggagtcgaggt accttatatgctgtttcttcacggatttgtcatacagcttttgagacctttgaaagaaaa atacgttttgaagcaccgttcagttctagttctcgtccaccaaaatcacaggcacgcttc actgagtcagataccatccctgaagggatttttctttataattag >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_4|141_aa MVDRADLQRGVRSESEIALRSAGVRGPWALCCPRPRAAPGGSAARARANRTQCRDAQGQR GELQGSCRVGVVPSRSIDVCPYKMHFSACGIYTNASSIPDTESQNHTALEEETTLEQVPC PNSTKSKLKDQEMPRGTADQC >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_4|426_bp atggttgatcgagccgatctgcaacgaggcgttcgttcggagtcggagatcgccttgcgg tcagcgggggttaggggtccctgggcgctgtgctgccctcggcccagagctgcgcctggc ggctcggccgcgcgagccagggcgaacaggacgcagtgcagggacgcgcagggccagcga ggggagctgcagggatcttgcagagttggagtggtacctagccgttctattgatgtctgc ccttataaaatgcatttttctgcttgtgggatctatacaaatgctagctcaattcctgat acagagtcacagaatcacacagctttagaggaagaaaccaccttagagcaagtcccatgc cctaattccaccaaaagtaaactcaaggaccaagagatgccccgaggcacagcagatcag tgctga >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_5|115_aa MAFKRSLAEFSVGEAHQAAAITAGPAYGSHLRGAGGQIGARWGGGGCRGLGEWRDARCQP RRGIVRAPERQPYWRPRTPPRSCPLSCGNLRRRAEHAQCRCCPCGRPGHPSPRAA >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_5|348_bp atggccttcaagcggagcttagcagagttctcggtcggagaagcgcaccaggcagcggca ataactgcgggcccggcgtacggcagccatcttcgcggggcagggggccagatcggggcg cgctggggtggcggcggctgccggggactcggggagtggagggacgcccgttgccagccg aggcgggggattgtgcgagcaccggagcgtcagccctactggagaccccggacgccgccg cggagctgccccctcagctgcggaaacctgcgccgacgcgccgagcatgcgcagtgccgg tgctgcccgtgtgggcggcccggccacccatccccgcgggctgcctag >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_6|469_aa MFRAQWMFELAPGVSSSNLENRPCRAARGSLQKTSADTKGKQEQAKEEKGCHLFTIRSKF VFIISHQIIGITMMTYLLSPLNFYFEIITGPQARELFLKAVEEEQNGALYEAIKFYRRAM QLVPDIEFKITYTRSPDGDGVGNSYIEDNDDDSKMADLLSYFQQQLTFQESVLKLCQPEL ESSQIHISVLPMEVLMYIFRWVVSSDLDLRSLEQLSLVCRGFYICARDPEIWRLACLKVW GRSCIKLVPYTSWREMFLERPRVRFDGVYISKTTYIRQGEQSLDGFYRAWHQVEYYRYIR FFPDGHVMMLTTPEEPQSIVPRLRTRNTRNHLTINTDIFVVSLYKKQIRVFMWGYSYVPV VTRGSTNSSGYIILVTLLTRIWTQVMSSEFLEDGVTREVGRTHLATKTAFPPTNPAQMSK SSLAVTPPCGLHGRDRWTKAMTVKVPSGVKATYSLAAPQPIAFTPILLH >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_6|1410_bp atgttccgagctcagtggatgtttgaacttgctccaggtgtaagctctagcaatttagaa aatcgaccttgcagagcagcaagaggctctctccagaaaacatcggcagataccaaagga aaacaagaacaggcaaaagaagaaaaaggctgccatctttttaccatcagatccaagttt gtgtttataattagtcatcaaataattggcattaccatgatgacttaccttctatctcct cttaatttttattttgaaataattacaggtccacaggctcgagaactcttcctaaaagca gtagaagaagaacaaaatggagctctctatgaagccatcaagttttatcgtagggctatg caacttgtacctgatatagagttcaagattacttatacccggtctccagatggtgatggc gttggaaacagctacattgaagataatgatgatgacagcaaaatggcagatctcttgtcc tacttccagcagcaactcacatttcaggagtctgtgcttaaactgtgtcagcctgagctt gagagcagtcagattcacatatcagtgctgccaatggaggtcctgatgtacatcttccga tgggtggtgtctagtgacttggacctcagatcattggagcagttgtcgctggtgtgcaga ggattctacatctgtgccagagaccctgaaatatggcgtctggcctgcttgaaagtttgg ggcagaagctgtattaaacttgttccgtacacgtcctggagagagatgtttttagaacgg cctcgtgttcggtttgatggcgtgtatatcagtaaaaccacatatattcgtcaaggggaa cagtctcttgatggtttctatagagcctggcaccaagtggaatattacaggtacataaga ttctttcctgatggccatgtgatgatgttgacaacccctgaagagcctcagtccattgtt ccacgtttaagaactaggaataccagaaaccacttgactataaatacagatattttcgtc gtgtccctgtacaagaagcagatcagagttttcatgtggggctacagctatgttccagtg gtcaccagaggttcaacaaactcatctggatacatcattcttgtcacattacttacaagg atctggacacaggtcatgagctcagagtttttagaagatggtgtcactagggaggttgga aggacccatttggccacaaagacagcatttccacctaccaacccagcccagatgtctaag tccagcttagctgtgacccctccttgtggactccatggcagagacaggtggacaaaagca atgactgttaaggtgccatctggggtcaaggccacatacagtctggcagcacctcagcca attgccttcacccctatcctactccattga >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_7|458_aa MRLSKPFAFSPSHLCLLSLILESLDLPDRKQSPGPRCLSGPIQLYQEPLRTKFSSISEGS RIGENLGWPDLIMEPDDFDSEDKEILSWDINDVKLPQNVKKTDWFQEWPDSYAKHIYSSE DKNAQRHLSSWAMRNTNNHNSRILKKSCLGVVVCGRDCLAEEGRKIYLRPAICDKARQKQ QRKRCPNCDGPLKLIPCRGHGGFPVTNFWRHDGRFIFFQSKGEHDHPKPETKLEAEARRA MKKVNTAPSSVSLSLKGSTETRSYGLGGITDLTDQTSTVDPMKLYEKRKLSSSRTYSSGD LLPPSASGVYSDHGDLQAWSKNAALGRNHLADNCYSNYPFPLTSWPCSFSPSQNSSEPFY QQLPLEPPAAKTGCPPLWPNPAGNLYEEKVHVDFNSYVQSPAYHSPQEDPFLFTYASHPH QQYSLPSKSSKWDFEEEMTYLGLDHCNNDMLLNLCPLR >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_7|1377_bp atgaggctttccaagccttttgccttctcgccttctcatctctgcctcctaagcctcatc cttgaaagtctggaccttcctgacagaaaacaatctcctggtccaaggtgcttgagtggg ccgatccagctatatcaagaacctttgagaacaaaattctcaagcatttctgaggggagt cgaataggtgaaaaccttggctggcctgaccttatcatggaacctgacgactttgattct gaagacaaagagatattaagctgggatattaatgatgtgaaactgccacagaacgtgaaa aaaaccgactggttccaggagtggccagattcctatgccaaacacatctacagctcggag gacaagaatgcgcagcggcacctgagcagctgggccatgcgcaataccaacaaccacaac tcccgcatcctcaagaagtcctgcctgggtgtggtggtgtgcggccgcgactgtctcgca gaggaggggcgcaagatctacctgagacctgccatctgtgacaaggcccggcagaagcag cagcggaaacgctgtcccaactgtgacgggcctctgaagctcatcccttgccgaggtcat gggggcttcccggtcaccaacttctggaggcacgacggacgctttatatttttccagtca aagggagagcatgatcatccaaaaccagaaaccaagttagaagctgaggcaagaagagcc atgaagaaagtgaacacagcaccttcctccgtctcattgagcctgaaggggagcacagag accaggagttatggtctgggaggaatcacagatctgactgaccagacttccactgtggac cccatgaagctctatgaaaagcgcaaattgtccagtagcagaacctacagtagtggagac ctgcttcctccttctgcctccggagtctactctgatcatggcgatctacaagcgtggagt aaaaatgctgctttggggagaaatcatcttgctgacaactgttattccaattatcctttt cctctgaccagctggccttgcagcttctctccttcccaaaactcttcagaacccttttac cagcagcttccattggagccacctgcagccaaaactggctgtcccccattatggccaaat ccagcgggtaatctttatgaagagaaagtacatgtggattttaacagctacgtccagtct cctgcataccattcacctcaagaagacccctttctcttcacctacgcctctcatcctcat cagcaatattcactgccaagcaagagcagcaaatgggattttgaggaagaaatgacatac ttgggtttggatcactgcaacaatgatatgcttctgaacctgtgtcctttgagatga >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_8|132_aa MSGLEGGKKKPLKQPKKQAKEMDEEQEASKREKKGEAEETQELKVKAKAALRGVCRAQGS CVREDVLDSEPGEASVGVLTNSSSKRGGSGSFQPRIEASMASGNWVSRPRVFWSNQEEIK CLGLHFQQLELS >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_8|399_bp atgtccggcctcgaaggcgggaagaagaagcccctgaaacaaccaaagaagcaagccaag gagatggacgaggaacaagaggcctccaaacgggaaaaaaaaggagaagcagaagaaacc caggagctaaaagtgaaggccaaggctgccctgaggggcgtgtgcagagctcagggctcc tgtgtgagggaggacgttcttgactctgagccaggtgaagccagtgtgggggtgcttaca aattcgagcagtaagagaggaggttctgggagcttccagcccaggatagaagcctccatg gcttctggcaactgggtgtccagacccagggtgttctggtcaaatcaagaggagatcaaa tgtctaggccttcactttcagcagctggaactgagttga >gi568815592f:52971024_53197827|GENSCAN_predicted_peptide_9|65_aa DQERHVGDLGNVMAGKDGVANVSTEDSLSEDHSISGCTRVIQKKQMSWAKVAMKKAQRRE TLEVI >gi568815592f:52971024_53197827|GENSCAN_predicted_CDS_9|198_bp gatcaagagaggcatgtcggagacctgggcaatgtgatggctggcaaagatggtgtggcc aatgtgtctactgaagattcactttcagaagatcattccatcagtggctgtacaagggtg atccagaagaaacagatgtcttgggcaaaggttgcaatgaagaaagcacaaagacgagaa acactggaagtcatttag