GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:36:44 Sequence gi568815585r:44841209_45089436 : 248228 bp : 42.11% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1824 2001 178 1 1 41 80 134 0.214 5.96 1.02 Intr + 3200 3371 172 2 1 -11 64 89 0.116 -4.38 1.03 Intr + 9839 9994 156 1 0 66 14 135 0.492 3.19 1.04 Intr + 10334 10526 193 1 1 32 34 168 0.529 3.94 1.05 Intr + 20961 21612 652 1 1 63 39 324 0.003 15.04 1.06 Intr + 26438 26604 167 2 2 98 32 68 0.050 0.88 1.07 Intr + 30874 31098 225 2 0 52 77 101 0.003 2.43 1.08 Intr + 31624 31835 212 2 2 110 25 75 0.019 1.11 1.09 Intr + 34088 34160 73 1 1 131 68 15 0.014 1.86 1.10 Intr + 35649 35940 292 1 1 -34 -7 327 0.004 6.27 1.11 Intr + 37109 37236 128 2 2 73 63 55 0.868 0.90 1.12 Intr + 40726 40949 224 2 2 86 74 182 0.747 13.52 1.13 Term + 49055 49153 99 1 0 60 47 93 0.076 -0.45 1.14 PlyA + 49399 49404 6 1.05 2.05 PlyA - 49601 49596 6 -0.45 2.04 Term - 50803 50522 282 1 0 76 39 179 0.515 6.24 2.03 Intr - 52725 52579 147 0 0 114 61 13 0.417 0.71 2.02 Intr - 62730 62511 220 2 1 113 82 170 0.681 16.28 2.01 Init - 69594 69545 50 0 2 79 97 47 0.577 5.17 2.00 Prom - 69664 69625 40 -7.55 3.00 Prom + 76241 76280 40 -8.15 3.01 Init + 76783 76925 143 0 2 59 57 122 0.474 5.86 3.02 Intr + 83449 83738 290 1 2 42 10 154 0.022 -1.03 3.03 Term + 89131 89369 239 2 2 54 47 197 0.677 7.55 3.04 PlyA + 91263 91268 6 1.05 4.07 PlyA - 93152 93147 6 1.05 4.06 Term - 100114 99998 117 1 0 67 48 142 0.984 5.66 4.05 Intr - 102466 102234 233 2 2 53 54 206 0.996 10.27 4.04 Intr - 108630 108514 117 1 0 102 115 6 0.941 4.22 4.03 Intr - 118366 118173 194 0 2 56 107 167 0.321 13.61 4.02 Intr - 124728 124636 93 2 0 87 94 87 0.306 7.36 4.01 Init - 128472 128453 20 0 2 54 83 31 0.320 -1.33 4.00 Prom - 129788 129749 40 -7.55 5.00 Prom + 130018 130057 40 -4.15 5.01 Init + 133294 133299 6 0 0 65 106 0 0.586 0.48 5.02 Term + 135077 135529 453 1 0 -46 43 381 0.958 14.37 5.03 PlyA + 138545 138550 6 1.05 6.00 Prom + 144794 144833 40 -6.45 6.01 Init + 148023 148076 54 2 0 86 105 68 0.050 9.64 6.02 Intr + 148414 148534 121 0 1 -10 107 165 0.030 7.75 6.03 Intr + 163103 163229 127 0 1 47 86 208 0.008 15.32 6.04 Intr + 164994 165095 102 0 0 64 54 159 0.465 8.47 6.05 Intr + 167587 167671 85 1 1 67 69 43 0.290 -0.80 6.06 Intr + 173744 173875 132 1 0 44 95 152 0.993 11.42 6.07 Intr + 174224 174388 165 1 0 83 100 123 0.999 12.24 6.08 Intr + 179122 179220 99 0 0 97 101 47 0.901 6.29 6.09 Term + 186577 186795 219 0 0 104 42 147 0.993 7.76 6.10 PlyA + 187032 187037 6 1.05 7.05 PlyA - 187105 187100 6 1.05 7.04 Term - 205533 205381 153 0 0 45 48 151 0.090 3.74 7.03 Intr - 211880 211756 125 0 2 68 46 94 0.066 2.58 7.02 Intr - 213461 213162 300 0 0 36 21 289 0.087 12.78 7.01 Intr - 241917 241830 88 0 1 77 116 84 0.614 8.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 66144 65898 247 1 1 73 75 149 0.929 7.70 S.002 Init + 147880 148076 197 0 2 77 105 117 0.912 10.75 S.003 Init + 163072 163229 158 0 2 52 86 223 0.989 17.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:44841209_45089436|GENSCAN_predicted_peptide_1|923_aa XTQGPGIACPSSPSPYQVCGGLSPEIILPRMHIAASVTGPLGPSGGERSARMCEMGCAYS EGKLTREEPFSIRTEETTTGILGYSHDFPKMSTCLYEVFCFVLASLCVPSDCSETFKGTH PTNGNNETYATIMSTIMRMFQEAKSHKYFDLETEWSAGPTHHLMDEDNEHLLAIDKIQHC NDETPSYYRTLGLSDPGTRLCAAEQHHQDTEAPSLIPPLLLDVGPLEPLAISCLLEVLAM AIRQEKEIKGIQLGKEEVKLSLCADDMIVFLENPIVSAQNLLKLISNFSKVSGYKTSVQK SQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDMKDLFKENYKPLLDKIKEDTN KWKNIPCSWIGRINIVKMAILPKVIYRFNAIPIKLPMTLFTELEKTTLKFIWNQKRARIA KTILSKKNKAGGIMLPDFKLYYKATVTKTAWAGEEKAWITTNHVEVSGSRVLCLLLPPPT PPHLCAGPGTQSAALPTHSLHFMAFPSEKSEEHREAWIGAPKQGRAAALGRHCPPRIYWG KMCLEVPRAPLWVGQEEAVWGFKLLSARELEADCWDTRGWEGLQCGQSSYLLSPFGSQGP PPSGTAGHSLLDSPTDPRILHGLGRFAAVTPQSSWSLAVHQAIRTPCGKPAWVTFDNAHS LKWSIQGISRGQHLVSGAAERQGHLGEVLGLLLGGAVCKADCWRHSRHAQVLRVSTEEPR QLMRHTLAESDMEQDMGGHLSDTLCSGDHPRGGTNHARLVSNSDTVWQLRQLQARSAVPM QLSPVFLKCCECELPAQRTAAVGSTLGQCCSIRYQSTNSEVGPNIHSSTRTTAVFADKPQ CESTGRKKPFPKGNIGCKDIPLPTMKTVFESLNRAAMTASSHSLSIKQPIQEPLGDVTPV ATAGPWTLDSRANSRNWTSYVSP >gi568815585r:44841209_45089436|GENSCAN_predicted_CDS_1|2772_bp ngtacccagggccctggaattgcctgcccaagcagcccaagcccttaccaggtctgtgga ggcctctctccagagatcatcctcccaaggatgcacatcgctgccagtgtgactggacct ctaggcccaagtggtggcgagcgatcagctcggatgtgtgaaatggggtgtgcatactcg gaggggaagcttaccagagaggagcccttcagcatccgcacggaggagaccacaacagga attctaggttatagccatgacttcccaaagatgtccacgtgtctgtatgaagtgttttgt tttgttctggcatctctctgtgtgccatctgattgttctgaaacttttaagggtactcat ccgacaaatggtaataacgaaacatatgctaccatcatgtctaccatcatgaggatgttt caagaagccaagagtcataaatatttcgatctggaaactgagtggtcagctggccccacc catcaccttatggatgaggacaatgagcatctgttggccattgataagatacagcattgc aatgatgagaccccaagctactaccgcactttggggctctctgatccagggactcgcctc tgtgctgctgaacaacatcaccaagacacagaagccccctcactgatccctcctctcctt ctggatgttggccccctggagcctctggctatctcatgtttgttggaagttctggccatg gcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattg tccctgtgtgcagatgacatgattgtatttctagaaaaccccatcgtctcagcccaaaat ctccttaagctgataagcaacttcagcaaagtctcaggatacaaaaccagtgtgcaaaaa tcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactc ccattcacaattgcttcaaagagaataaaatacctgggaatccaacttacaagggatatg aaggacctcttcaaggagaactataaaccactgctcgacaaaataaaagaggatacaaac aaatggaagaacattccatgctcatggataggaagaatcaatatcgtgaaaatggccata ctgcccaaggtaatttatagattcaatgctatccccatcaagctaccaatgactctcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgcc aagacaatcctaagcaaaaagaacaaagctgggggcatcatgctacctgacttcaaacta tactacaaggctacagtaaccaaaacagcatgggctggagaagaaaaggcctggatcaca acaaaccatgtcgaggtgtcagggtcaagagtgctgtgtctgctcctgcccccaccaacc cctccacacctatgcgctggtcccggtactcagtcagccgcactaccaacccacagcctc cacttcatggcttttccaagtgagaagtctgaagagcacagggaagcctggataggtgct cccaagcaagggagagctgctgcattgggaaggcactgccctcccaggatctactggggc aaaatgtgtctagaagttcccagggccccactctgggtagggcaggaggaggctgtgtgg ggcttcaagctgctgtccgcccgagagctagaggcagactgctgggacacgaggggctgg gaaggtcttcagtgtggtcaatccagttatctcctttctcccttcggctcccaggggcca cctccctccgggacagctggacacagcctgcttgactctcccactgacccccgcatcctg cacggactgggaagatttgctgcagtgactccacagagctcgtggtcacttgctgtacac caggcaattagaactccctgtggaaaacctgcttgggtaacctttgacaatgcccactct ttgaaatggtccatccaagggatttcaagaggtcagcatcttgtgtcaggcgctgcggaa cgccagggacatctgggcgaagtcctgggcttgctccttggtggagcagtgtgcaaggca gattgctggagacacagcaggcatgctcaggtattgcgagtgagcaccgaggaaccgagg cagctcatgcggcacaccttggctgagtccgacatggaacaagatatgggcggtcatctt tcagataccctctgtagcggggaccatcccaggggaggaacaaaccatgcaagactcgtc tccaacagcgacaccgtgtggcagctacgccagctgcaggcgaggtcagcagtgccaatg cagctttctcctgtcttcctcaaatgctgtgagtgtgaacttcctgcacagagaacagct gcagtggggtccaccctgggacagtgctgctcaatcaggtatcagtcaacaaacagcgaa gttggacctaatatacacagctccacaagaaccacagcagtgtttgcagacaagccacag tgtgagtccacaggcagaaagaagccgtttcccaaaggaaacattggctgcaaagatata cctctgcctaccatgaagacagtctttgaatccttaaacagggcagcaatgactgcttcc agccatagcctctcaattaaacagcccattcaggaaccacttggcgatgtcaccccagtg gccactgcaggaccctggacactagactccagagctaacagcaggaactggacgtcatat gtgtcaccatga >gi568815585r:44841209_45089436|GENSCAN_predicted_peptide_2|232_aa MKEEGLSIATNCLPKDRTSPQSPPMMSPGNCWSFPSRGLGHSCFDSKSFNAKASVAVRNR EGSNAAGQLPPGSVVASLDEERKLVLELPLGLKACFTSCLLSRSLSILQRRHPRAITNAP GVLLLPGHAFLINVYIKTQWGMGQQFLLAGDVCQCLETFLVSQQQLKVGVVLASSGQTEI RDATKHPTTHTATPTTKNQPAPNVHSAEVETPCSTTANCVFILSDKCLGLYC >gi568815585r:44841209_45089436|GENSCAN_predicted_CDS_2|699_bp atgaaggaagaaggattgtctattgccacaaactgtttgccaaaagataggacttctccc cagtcccctcctatgatgtcacctggcaactgctggagcttcccaagcagaggattagga cattcttgttttgactccaagagcttcaacgcaaaggcctctgtggctgttaggaacaga gagggcagtaatgcggctggccagctgccaccagggtcagtggtggccagcctggatgaa gaaaggaagctcgtgcttgagctgcctctgggcctcaaggcatgcttcacctcctgcctc ctgtccagatccctgtcaatcctgcaacgtagacatccgagggccatcactaatgcaccg ggtgtcctccttctgcctgggcatgcatttttaattaatgtttatatcaagactcagtgg gggatggggcagcaatttctcctcgcaggggatgtttgccaatgtctggagacatttttg gtttcacagcagcagttgaaggtgggagtggtattggcatccagtgggcagacagagatc agggatgctactaaacatcctacaacgcacacagcaacccccacaacaaagaatcagcca gccccaaatgtccacagtgccgaggttgagacaccctgttctactacagcaaactgtgtt tttatcctttcagacaaatgcttaggtctatattgttag >gi568815585r:44841209_45089436|GENSCAN_predicted_peptide_3|223_aa MWETAIATPSLDKLSLLGRLPPTPSRSSVGVKPAPAPARSLQRFKPFSCNLLGIYTWAIN EDLRIHIAKAELLRVHPQPHSPASSLCIFSIPVNSSFILPIVPSQNPGITLDSILSQPTS NPSENPTGSIMKIDPELDHFSHSPGGVLEEGSCWLPVPKSDHDHTTSSRQQQLLPVFPPF PEAASPPPPHLSLLRDTSTYWLASPPQHEASGPAKRDLPSPEG >gi568815585r:44841209_45089436|GENSCAN_predicted_CDS_3|672_bp atgtgggagacggcaatagcgactccaagcctagacaaattgagtcttctcggtcggctt ccgcccactccatcgcgttcatccgtaggcgtcaaacctgctcctgcgcctgcgcggagt ctgcagcggtttaaaccgttcagctgcaaccttcttggcatctacacatgggcaattaat gaggacctcaggattcacatagctaaagctgagctcctgagagtccatccccaaccgcac tccccagccagctccctctgcatcttctccattccagttaatagcagctttattctcccc attgttccaagccagaaccctggaatcaccctggattccattctctctcagcctacctcc aacccctcagaaaatcctactggctccatcatgaaaatagacccagaattggatcacttc tcccattcccctggaggggtactagaggaagggagctgttggcttcctgttcctaagagt gaccatgatcacaccacctcatcccggcagcagcagttgcttccagtgtttccaccattt ccagaagcagcctcaccacctcctccccatctctctctgctcagagacaccagcacctac tggctggcatcccctcctcagcatgaggcatcggggccagccaagcgggaccttcctagc ccagagggctga >gi568815585r:44841209_45089436|GENSCAN_predicted_peptide_4|257_aa MRKVKKRNYPTLANIERKKKLKLEKEKRGAVLTTTQYGKMKGMSRHSQMAKIRSPGKNHK WKNDNSRQRAVTGSGSHLCDLKLEGPPEANADPLGVLINSDSESDKEEKPQHSVIPKEVT PALCSLMSSYGSLSGSESEPEETPIKTEADVLAENQVLDSSAPKSPSQDVKATVRNFSEA KSENRKKSFEKTNPKRKKDYHNYQTLFEPRTHHPYLLEMLLAPDIRHERNVILQCVRYII KKDFFGLDTNSAKSKDV >gi568815585r:44841209_45089436|GENSCAN_predicted_CDS_4|774_bp atgaggaaggtgaagaagagaaactatccaactctggccaatattgaaaggaagaagaag ttaaaacttgaaaaggagaagagaggagcagtattgacaacaacacaatatggcaagatg aaggggatgtccagacattcacaaatggcaaagatcagaagtcctggcaagaatcacaaa tggaaaaacgacaattctagacagagagcagtcactggatcaggcagtcacttgtgtgat ttgaagctagaaggtccaccggaggcaaatgcagatcctcttggtgttttgataaacagt gattctgagtctgataaggaggagaaaccacaacattctgtgatacccaaggaagtgaca ccagccctatgctcactaatgagtagctatggcagtctttcagggtcagagagtgagcca gaagaaactcccatcaagactgaagcagacgttttggcagaaaaccaggttcttgatagc agtgctcctaagagtccaagtcaagatgttaaagcaactgttagaaatttttcagaagcc aagagtgagaaccgaaagaaaagctttgaaaaaacaaaccctaagaggaaaaaagattat cacaactatcaaacgttattcgaaccaagaacacaccatccatatctcttggaaatgctt ctagctccggacattcgacatgaaagaaatgtgattttgcagtgtgttcggtacatcatc aaaaaagacttttttggactggatactaattctgcgaaaagtaaagatgtatag >gi568815585r:44841209_45089436|GENSCAN_predicted_peptide_5|152_aa MKKKEGRRRRRKEEEKKGGRRRRKEKGGGGGGEGEGEEEEEKEEEEEKRRRRKRKRRKRK KKKKNNNNKRRRKKKQPIDRKETSRTNKTFFKKSSYLGSGVMRRKPCIYKEATYRSQGTS DTVLQNSYVKLLHGLSPFYVAITEYIIRLDNL >gi568815585r:44841209_45089436|GENSCAN_predicted_CDS_5|459_bp atgaaaaagaaggaaggaaggagaaggagaagaaaggaggaggagaagaaaggaggaagg agaagaagaaaggagaaaggaggaggaggaggaggcgaaggtgaaggtgaggaggaggag gagaaggaagaggaagaagagaagaggaggaggaggaagaggaagaggaggaagaggaag aagaaaaagaagaataacaacaacaagaggaggaggaagaagaaacaaccaatagataga aaagaaacaagcagaacaaacaaaacattttttaaaaagtccagctacttggggagtggt gtcatgaggagaaaaccttgtatctataaagaagctacctacagaagccaaggaacttca gataccgtgcttcaaaattcctatgtcaaactacttcatggtctcagtccattttatgtt gctataactgaatacatcataagattggataatttataa >gi568815585r:44841209_45089436|GENSCAN_predicted_peptide_6|367_aa MGGCDSEEGFDPAAGSEDTHICDQCPPPRMARDLIGPALPPGFKARGTAEDEERDPSPGP ALPPNYKSSSSDSSDSDEDSSSLYEEGNQESEEDDSGPTARKQRKNQDDDDDDDDGFFGP ALPPGFKKQDDSPPRPIIGPALPPGFIKSTQKSDKGRDDPGQQETDSSEDEDIIGPMPAK GPVNYNVTTEFEKRAQRMKEKLTKGDDDSSKPIVRESWMTELPPEMKDFGLGPRTFKRRA DDTSGDRSIWTDTPADRERKAKETQEARKSSSKKDEEHILSGRDKRLAEQVSSYNESKRS ESLMDIHHKKLKSKAAEDKNKPQERIPFDRDKDLKVNRFDEAQKKALIKKSRELNTRFSH GKGNMFL >gi568815585r:44841209_45089436|GENSCAN_predicted_CDS_6|1104_bp atggggggctgcgactcagaggaaggctttgacccggctgcgggaagcgaggacactcat atctgtgaccagtgtccgccaccgcggatggcaagagacctgatcggaccggccctgccg cccggcttcaaggcccgcggaacagcggaggacgaagagcgggacccgagccctggacca gctctgccccctaattataaaagcagtagttcagattcatcagacagcgatgaagacagt agttctttgtacgaagaaggaaatcaagaatctgaagaagatgacagtggtccaactgca agaaaacagaggaaaaatcaggatgatgacgatgatgatgatgatgggttttttggacca gcccttcctcctggatttaaaaagcaggatgattctcctccaaggcccataataggtcct gcattgccacctggtttcattaaatctacacagaaaagtgacaagggcagagatgatcca ggacaacaggaaacagacagcagtgaagatgaggatattattggaccaatgcctgcaaaa ggaccagttaactataatgtaacgacagagtttgaaaaaagggcccagagaatgaaagaa aaactgaccaaaggagatgatgattcatctaaacccattgtaagagagtcatggatgact gaacttcctccagaaatgaaagactttggtcttgggccaaggacttttaagagaagagct gatgacacatctggagatcgatcaatctggacagatactccagctgatagggaaaggaaa gctaaggaaacacaagaagcaaggaagtcatccagtaagaaagatgaagaacatatatta tcaggaagagataagagactggctgagcaggtatcttcatacaatgaatcaaaaagatca gaatctcttatggacatacatcataaaaagttaaagagtaaggctgctgaagacaaaaat aagcctcaagagagaataccatttgaccgtgataaagatctcaaggttaatcggtttgat gaagctcagaaaaaagccctaataaaaaaatctagagaactaaacaccagattttcacac ggcaaaggcaatatgtttttataa >gi568815585r:44841209_45089436|GENSCAN_predicted_peptide_7|221_aa EGSCTAFMSCFNRGRRSQESQFLTASVEEEPIPTRESESTRSCFSSSLSTTEGTGQSGTS SLTLIDEKSGVWKKQVRYRADDGSDRSRIIDPVLSVVELVGSSTEVFLRLSCGQLFLAAE PQCLALGLSAIILQTLYTLPVGLMKPMFSCGVRDGGSAAAACDLGKSMVSKVHMLATFQT ALQAEAGTQTVLAEKPEKQLSVLDHIRKPLRDICKHPEQRW >gi568815585r:44841209_45089436|GENSCAN_predicted_CDS_7|666_bp gaaggcagctgtacggctttcatgagttgtttcaacagagggagaagaagccaggagagc caatttctgacggcaagtgttgaagaagaacccattcccactagagaatcagaaagtacc agatcttgcttttccagctccttgagcacaactgaaggcacaggccagtcagggacatct tccctaaccctgattgatgagaaatcaggggtttggaagaagcaggtgcggtacagagca gatgatggcagtgaccgtagccgcattattgatcccgtgttgtcagtggtggaacttgtg gggtccagcacagaagtgtttctaaggctttcttgtgggcaattgttcctggctgcagag cctcagtgcctggctctgggcctctcagccatcattctccagaccctgtacacgttgcct gtggggctcatgaagccgatgttctcatgtggagttcgggatggtggctcagctgctgct gcttgtgacctgggaaaatcaatggtttccaaggttcatatgcttgctaccttccaaaca gctcttcaggcagaagcaggtacacagacagtgctggcggaaaaacctgagaagcagcta tcagtattagaccacatacgaaagcctcttcgggacatttgcaagcatccagagcagaga tggtag