GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:58:57 Sequence gi568815585f:44889655_45128000 : 238346 bp : 41.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1155 1150 6 -0.45 1.04 Term - 2357 2076 282 2 0 76 39 179 0.245 6.24 1.03 Intr - 4279 4133 147 1 0 114 61 13 0.190 0.71 1.02 Intr - 14284 14065 220 0 1 113 82 170 0.668 16.28 1.01 Init - 21148 21099 50 1 2 79 97 47 0.577 5.17 1.00 Prom - 21218 21179 40 -7.55 2.00 Prom + 27795 27834 40 -8.15 2.01 Init + 28337 28479 143 1 2 59 57 122 0.474 5.86 2.02 Intr + 35003 35292 290 2 2 42 10 154 0.022 -1.03 2.03 Term + 40685 40923 239 0 2 54 47 197 0.677 7.55 2.04 PlyA + 42817 42822 6 1.05 3.07 PlyA - 44706 44701 6 1.05 3.06 Term - 51668 51552 117 2 0 67 48 142 0.984 5.66 3.05 Intr - 54020 53788 233 0 2 53 54 206 0.996 10.27 3.04 Intr - 60184 60068 117 2 0 102 115 6 0.941 4.22 3.03 Intr - 69920 69727 194 1 2 56 107 167 0.321 13.61 3.02 Intr - 76282 76190 93 0 0 87 94 87 0.306 7.36 3.01 Init - 80026 80007 20 1 2 54 83 31 0.320 -1.33 3.00 Prom - 81342 81303 40 -7.55 4.00 Prom + 81572 81611 40 -4.15 4.01 Init + 84848 84853 6 1 0 65 106 0 0.586 0.48 4.02 Term + 86631 87083 453 2 0 -46 43 381 0.958 14.37 4.03 PlyA + 90099 90104 6 1.05 5.00 Prom + 96348 96387 40 -6.45 5.01 Init + 99577 99630 54 0 0 86 105 68 0.050 9.64 5.02 Intr + 99968 100088 121 1 1 -10 107 165 0.030 7.75 5.03 Intr + 114657 114783 127 1 1 47 86 208 0.008 15.32 5.04 Intr + 116548 116649 102 1 0 64 54 159 0.465 8.47 5.05 Intr + 119141 119225 85 2 1 67 69 43 0.290 -0.80 5.06 Intr + 125298 125429 132 2 0 44 95 152 0.993 11.42 5.07 Intr + 125778 125942 165 2 0 83 100 123 0.999 12.24 5.08 Intr + 130676 130774 99 1 0 97 101 47 0.901 6.29 5.09 Term + 138131 138349 219 1 0 104 42 147 0.993 7.76 5.10 PlyA + 138586 138591 6 1.05 6.00 Prom + 141199 141238 40 -5.75 6.01 Init + 148666 148758 93 0 0 114 60 26 0.692 2.83 6.02 Intr + 156928 157070 143 0 2 20 64 136 0.016 2.63 6.03 Intr + 163334 163505 172 2 1 38 69 109 0.081 3.02 6.04 Term + 164587 164838 252 0 0 120 38 187 0.566 11.55 6.05 PlyA + 165166 165171 6 1.05 7.00 Prom + 171645 171684 40 -6.75 7.01 Init + 172083 172101 19 2 1 97 48 14 0.602 -1.39 7.02 Intr + 175066 175258 193 2 1 85 78 126 0.549 8.93 7.03 Intr + 188528 188784 257 2 2 130 70 66 0.323 5.16 7.04 Intr + 194090 194168 79 0 1 68 94 76 0.199 3.89 7.05 Intr + 216605 216775 171 2 0 82 54 118 0.488 5.94 7.06 Intr + 230868 231067 200 0 2 92 90 190 0.821 17.67 7.07 Intr + 233566 233697 132 2 0 63 28 148 0.393 6.00 7.08 Term + 234353 234483 131 0 2 20 45 110 0.206 -2.74 7.09 PlyA + 236084 236089 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 17698 17452 247 2 1 73 75 149 0.928 7.70 S.002 Init + 99434 99630 197 1 2 77 105 117 0.912 10.75 S.003 Init + 114626 114783 158 1 2 52 86 223 0.989 17.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:44889655_45128000|GENSCAN_predicted_peptide_1|232_aa MKEEGLSIATNCLPKDRTSPQSPPMMSPGNCWSFPSRGLGHSCFDSKSFNAKASVAVRNR EGSNAAGQLPPGSVVASLDEERKLVLELPLGLKACFTSCLLSRSLSILQRRHPRAITNAP GVLLLPGHAFLINVYIKTQWGMGQQFLLAGDVCQCLETFLVSQQQLKVGVVLASSGQTEI RDATKHPTTHTATPTTKNQPAPNVHSAEVETPCSTTANCVFILSDKCLGLYC >gi568815585f:44889655_45128000|GENSCAN_predicted_CDS_1|699_bp atgaaggaagaaggattgtctattgccacaaactgtttgccaaaagataggacttctccc cagtcccctcctatgatgtcacctggcaactgctggagcttcccaagcagaggattagga cattcttgttttgactccaagagcttcaacgcaaaggcctctgtggctgttaggaacaga gagggcagtaatgcggctggccagctgccaccagggtcagtggtggccagcctggatgaa gaaaggaagctcgtgcttgagctgcctctgggcctcaaggcatgcttcacctcctgcctc ctgtccagatccctgtcaatcctgcaacgtagacatccgagggccatcactaatgcaccg ggtgtcctccttctgcctgggcatgcatttttaattaatgtttatatcaagactcagtgg gggatggggcagcaatttctcctcgcaggggatgtttgccaatgtctggagacatttttg gtttcacagcagcagttgaaggtgggagtggtattggcatccagtgggcagacagagatc agggatgctactaaacatcctacaacgcacacagcaacccccacaacaaagaatcagcca gccccaaatgtccacagtgccgaggttgagacaccctgttctactacagcaaactgtgtt tttatcctttcagacaaatgcttaggtctatattgttag >gi568815585f:44889655_45128000|GENSCAN_predicted_peptide_2|223_aa MWETAIATPSLDKLSLLGRLPPTPSRSSVGVKPAPAPARSLQRFKPFSCNLLGIYTWAIN EDLRIHIAKAELLRVHPQPHSPASSLCIFSIPVNSSFILPIVPSQNPGITLDSILSQPTS NPSENPTGSIMKIDPELDHFSHSPGGVLEEGSCWLPVPKSDHDHTTSSRQQQLLPVFPPF PEAASPPPPHLSLLRDTSTYWLASPPQHEASGPAKRDLPSPEG >gi568815585f:44889655_45128000|GENSCAN_predicted_CDS_2|672_bp atgtgggagacggcaatagcgactccaagcctagacaaattgagtcttctcggtcggctt ccgcccactccatcgcgttcatccgtaggcgtcaaacctgctcctgcgcctgcgcggagt ctgcagcggtttaaaccgttcagctgcaaccttcttggcatctacacatgggcaattaat gaggacctcaggattcacatagctaaagctgagctcctgagagtccatccccaaccgcac tccccagccagctccctctgcatcttctccattccagttaatagcagctttattctcccc attgttccaagccagaaccctggaatcaccctggattccattctctctcagcctacctcc aacccctcagaaaatcctactggctccatcatgaaaatagacccagaattggatcacttc tcccattcccctggaggggtactagaggaagggagctgttggcttcctgttcctaagagt gaccatgatcacaccacctcatcccggcagcagcagttgcttccagtgtttccaccattt ccagaagcagcctcaccacctcctccccatctctctctgctcagagacaccagcacctac tggctggcatcccctcctcagcatgaggcatcggggccagccaagcgggaccttcctagc ccagagggctga >gi568815585f:44889655_45128000|GENSCAN_predicted_peptide_3|257_aa MRKVKKRNYPTLANIERKKKLKLEKEKRGAVLTTTQYGKMKGMSRHSQMAKIRSPGKNHK WKNDNSRQRAVTGSGSHLCDLKLEGPPEANADPLGVLINSDSESDKEEKPQHSVIPKEVT PALCSLMSSYGSLSGSESEPEETPIKTEADVLAENQVLDSSAPKSPSQDVKATVRNFSEA KSENRKKSFEKTNPKRKKDYHNYQTLFEPRTHHPYLLEMLLAPDIRHERNVILQCVRYII KKDFFGLDTNSAKSKDV >gi568815585f:44889655_45128000|GENSCAN_predicted_CDS_3|774_bp atgaggaaggtgaagaagagaaactatccaactctggccaatattgaaaggaagaagaag ttaaaacttgaaaaggagaagagaggagcagtattgacaacaacacaatatggcaagatg aaggggatgtccagacattcacaaatggcaaagatcagaagtcctggcaagaatcacaaa tggaaaaacgacaattctagacagagagcagtcactggatcaggcagtcacttgtgtgat ttgaagctagaaggtccaccggaggcaaatgcagatcctcttggtgttttgataaacagt gattctgagtctgataaggaggagaaaccacaacattctgtgatacccaaggaagtgaca ccagccctatgctcactaatgagtagctatggcagtctttcagggtcagagagtgagcca gaagaaactcccatcaagactgaagcagacgttttggcagaaaaccaggttcttgatagc agtgctcctaagagtccaagtcaagatgttaaagcaactgttagaaatttttcagaagcc aagagtgagaaccgaaagaaaagctttgaaaaaacaaaccctaagaggaaaaaagattat cacaactatcaaacgttattcgaaccaagaacacaccatccatatctcttggaaatgctt ctagctccggacattcgacatgaaagaaatgtgattttgcagtgtgttcggtacatcatc aaaaaagacttttttggactggatactaattctgcgaaaagtaaagatgtatag >gi568815585f:44889655_45128000|GENSCAN_predicted_peptide_4|152_aa MKKKEGRRRRRKEEEKKGGRRRRKEKGGGGGGEGEGEEEEEKEEEEEKRRRRKRKRRKRK KKKKNNNNKRRRKKKQPIDRKETSRTNKTFFKKSSYLGSGVMRRKPCIYKEATYRSQGTS DTVLQNSYVKLLHGLSPFYVAITEYIIRLDNL >gi568815585f:44889655_45128000|GENSCAN_predicted_CDS_4|459_bp atgaaaaagaaggaaggaaggagaaggagaagaaaggaggaggagaagaaaggaggaagg agaagaagaaaggagaaaggaggaggaggaggaggcgaaggtgaaggtgaggaggaggag gagaaggaagaggaagaagagaagaggaggaggaggaagaggaagaggaggaagaggaag aagaaaaagaagaataacaacaacaagaggaggaggaagaagaaacaaccaatagataga aaagaaacaagcagaacaaacaaaacattttttaaaaagtccagctacttggggagtggt gtcatgaggagaaaaccttgtatctataaagaagctacctacagaagccaaggaacttca gataccgtgcttcaaaattcctatgtcaaactacttcatggtctcagtccattttatgtt gctataactgaatacatcataagattggataatttataa >gi568815585f:44889655_45128000|GENSCAN_predicted_peptide_5|367_aa MGGCDSEEGFDPAAGSEDTHICDQCPPPRMARDLIGPALPPGFKARGTAEDEERDPSPGP ALPPNYKSSSSDSSDSDEDSSSLYEEGNQESEEDDSGPTARKQRKNQDDDDDDDDGFFGP ALPPGFKKQDDSPPRPIIGPALPPGFIKSTQKSDKGRDDPGQQETDSSEDEDIIGPMPAK GPVNYNVTTEFEKRAQRMKEKLTKGDDDSSKPIVRESWMTELPPEMKDFGLGPRTFKRRA DDTSGDRSIWTDTPADRERKAKETQEARKSSSKKDEEHILSGRDKRLAEQVSSYNESKRS ESLMDIHHKKLKSKAAEDKNKPQERIPFDRDKDLKVNRFDEAQKKALIKKSRELNTRFSH GKGNMFL >gi568815585f:44889655_45128000|GENSCAN_predicted_CDS_5|1104_bp atggggggctgcgactcagaggaaggctttgacccggctgcgggaagcgaggacactcat atctgtgaccagtgtccgccaccgcggatggcaagagacctgatcggaccggccctgccg cccggcttcaaggcccgcggaacagcggaggacgaagagcgggacccgagccctggacca gctctgccccctaattataaaagcagtagttcagattcatcagacagcgatgaagacagt agttctttgtacgaagaaggaaatcaagaatctgaagaagatgacagtggtccaactgca agaaaacagaggaaaaatcaggatgatgacgatgatgatgatgatgggttttttggacca gcccttcctcctggatttaaaaagcaggatgattctcctccaaggcccataataggtcct gcattgccacctggtttcattaaatctacacagaaaagtgacaagggcagagatgatcca ggacaacaggaaacagacagcagtgaagatgaggatattattggaccaatgcctgcaaaa ggaccagttaactataatgtaacgacagagtttgaaaaaagggcccagagaatgaaagaa aaactgaccaaaggagatgatgattcatctaaacccattgtaagagagtcatggatgact gaacttcctccagaaatgaaagactttggtcttgggccaaggacttttaagagaagagct gatgacacatctggagatcgatcaatctggacagatactccagctgatagggaaaggaaa gctaaggaaacacaagaagcaaggaagtcatccagtaagaaagatgaagaacatatatta tcaggaagagataagagactggctgagcaggtatcttcatacaatgaatcaaaaagatca gaatctcttatggacatacatcataaaaagttaaagagtaaggctgctgaagacaaaaat aagcctcaagagagaataccatttgaccgtgataaagatctcaaggttaatcggtttgat gaagctcagaaaaaagccctaataaaaaaatctagagaactaaacaccagattttcacac ggcaaaggcaatatgtttttataa >gi568815585f:44889655_45128000|GENSCAN_predicted_peptide_6|219_aa MPSQSEVLIPSIGLGLGPIDGWAVYFGQAAQEVPTISALDACKCPEEAFVCGLILIAASQ VFPPALSVYLLLPEELFGRSQAAAAEPPSRTPHENIGFMSPTGNVYRVWRMMAGAQGVGS EVEKAGSPRLIHVIIQFLTPQKAFISFITQFLGARNRTDAAGLKEQRNLSEGYWMACRNL RGPEPGTEALQPGTIAHKKALETLLCWTPQVPPLTTRDQ >gi568815585f:44889655_45128000|GENSCAN_predicted_CDS_6|660_bp atgcccagccaatcagaagttttgattccatctataggtttggggttgggcccaatagat ggttgggccgtctattttgggcaagctgcccaggaggtgcctaccatctctgctctggat gcttgcaaatgtcccgaagaggctttcgtatgtggtctaatactgatagctgcttctcag gtttttccgccagcactgtctgtgtacctgcttctgcctgaagagctgtttggaaggtca caagcagcagcagctgagccaccatcccgaactccacatgagaacatcggcttcatgagc cccacaggcaacgtgtacagggtctggagaatgatggctggggcacaaggtgtgggttca gaagtggagaaagcagggtcaccaaggttaatacatgtcattattcagttcctcacacct caaaaggcttttatttcatttatcactcagtttttaggtgcaaggaacagaaccgacgct gctggacttaaggagcaaaggaatttatcagaaggatattggatggcttgcaggaacctg agaggcccagagccaggcactgaggctctgcagccaggaacaattgcccacaagaaagcc ttagaaacacttctgtgctggaccccacaagttccaccactgacaacacgggatcaataa >gi568815585f:44889655_45128000|GENSCAN_predicted_peptide_7|393_aa MASNSLVLITVFHGVSEDCPCIAWPPSPCVSVEETGTLAPAVSAQQSAYGKAGAQDFLDE GVNCPYLCFVKFTVRTTASHLLNVLSCGSNHVISEYDIILPRSQFKISRKKNSTECQAVA PQTVDHMEQIQLPEVYSCNLWIQVATLERGNPDSCAGLILKRKLSKARSPSSLESSSHWN IHSREHMSEQVRDLDGCFGCFQEQAACRPGSSAQVGVPATPEAPEGMLQCSFSSAVCGQC VLAGKERRLFASQRGLSFVPDARSSALRLLGSLLHPARLHRLQTHGRARGTRLDRRQTEH RSVASQAARLTKEDDHPDRGGLVFSQGYMYGCTPLLEPPKKLSRRAYGRPGVLLLRGHVG HEVPHPVAVAKFIVLPGNELDKVVVEGNASPSI >gi568815585f:44889655_45128000|GENSCAN_predicted_CDS_7|1182_bp atggcgtccaatagcctagtgcttatcacagtttttcacggtgtgtctgaggactgcccg tgcattgcctggccccccagcccatgtgtgagtgtggaggaaacaggaaccctggctcct gctgtcagtgcccagcagagtgcttatggcaaagcaggggctcaggattttttggatgaa ggagtgaattgtccatatctgtgtttcgtaaagttcacagttaggacaactgcttcacac ttgttaaatgtgttaagttgtggttctaaccatgtgatatcagagtatgacataattctt cctaggtcccagtttaaaatttccagaaaaaagaattctacagagtgtcaagcagttgcc ccccaaactgtggatcatatggaacaaattcagctgccagaggtctactcctgtaaccta tggatccaggtggcaaccctcgaaagagggaatcctgatagctgcgcagggcttatcctg aaacgcaagctcagcaaggcaagaagcccatcctccttggagagctctagtcactggaac attcacagcagggagcacatgagtgagcaagtgcgggatctggatggctgctttgggtgc ttccaggagcaggctgcgtgcaggcctggcagcagtgcccaggttggggtgcctgcaacc cctgaagctccagagggcatgttacagtgttcttttagctctgccgtctgtggacagtgt gttctggcaggtaaggaacgccggctcttcgcctctcagcgcggcttgtcctttgttccg gacgcccgctcctcagccctgcggctcctggggtcgctgctgcatcccgcacgcctccac cggctgcagacccatggccgagcgcggggaactcgacttgaccggcgccaaacagaacac aggagtgtggctagtcaagctgcacgcttgaccaaagaggatgaccatccggatagagga ggactggtcttcagtcaagggtatatgtatggttgtactcccctgctagaacctccaaag aagctttcaagaagagcttatggcaggccgggggtcttactccttagaggccatgtgggc catgaggtcccccaccctgttgctgtagccaaattcattgtcttaccaggaaatgaactt gacaaagtggtcgttgagggcaatgccagccccagcatctaa