GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:13:36 Sequence gi568815576r:29631228_29848544 : 217317 bp : 50.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5524 5685 162 0 0 74 60 153 0.761 11.27 1.02 Intr + 7863 7985 123 2 0 78 88 43 0.935 4.08 1.03 Intr + 10975 11058 84 0 0 114 102 94 0.999 13.42 1.04 Intr + 23430 23498 69 2 0 115 90 64 0.994 8.68 1.05 Intr + 24367 24449 83 0 2 87 94 21 0.943 1.04 1.06 Intr + 26962 27037 76 1 1 137 65 117 0.902 13.82 1.07 Intr + 29978 30112 135 1 0 38 75 177 0.952 12.16 1.08 Intr + 33763 33837 75 0 0 86 119 65 0.986 9.11 1.09 Intr + 37106 37219 114 1 0 105 99 -21 0.715 1.34 1.10 Intr + 40599 40721 123 2 0 135 20 162 0.999 14.88 1.11 Intr + 42042 42259 218 2 2 55 78 394 0.998 32.40 1.12 Intr + 43609 43714 106 1 1 136 105 173 0.998 24.02 1.13 Intr + 46969 47096 128 0 2 61 53 132 0.974 6.48 1.14 Intr + 50212 50374 163 1 1 87 75 175 0.972 16.08 1.15 Intr + 62872 63021 150 0 0 77 49 74 0.291 2.76 1.16 Term + 63525 63575 51 2 0 122 43 74 0.391 3.73 1.17 PlyA + 67349 67354 6 1.05 2.00 Prom + 68958 68997 40 -7.56 2.01 Init + 71662 71736 75 0 0 86 49 102 0.768 7.19 2.02 Intr + 75799 75932 134 0 2 76 80 77 0.731 5.14 2.03 Intr + 79105 79312 208 1 1 87 40 82 0.219 2.38 2.04 Intr + 83562 83743 182 2 2 46 67 115 0.526 3.87 2.05 Intr + 89098 89359 262 1 1 88 72 296 0.161 25.49 2.06 Intr + 89425 89904 480 0 0 8 -15 259 0.299 0.83 2.07 Intr + 91514 91622 109 1 1 62 60 73 0.634 1.76 2.08 Intr + 96435 96578 144 1 0 79 89 342 0.969 33.65 2.09 Intr + 97403 97515 113 0 2 94 63 132 0.998 11.40 2.10 Intr + 97828 97981 154 0 1 40 70 406 0.937 33.65 2.11 Term + 98215 98342 128 2 2 30 42 314 0.934 19.44 2.12 PlyA + 100580 100585 6 -0.45 3.06 PlyA - 100654 100649 6 -0.45 3.05 Term - 102798 102636 163 2 1 82 53 83 0.747 1.61 3.04 Intr - 107214 107103 112 1 1 123 92 76 0.960 10.94 3.03 Intr - 109503 109423 81 1 0 90 83 132 0.994 12.51 3.02 Intr - 111253 111191 63 2 0 111 103 74 0.987 9.89 3.01 Init - 117317 117191 127 2 1 99 90 335 0.986 35.12 3.00 Prom - 118591 118552 40 -4.26 4.00 Prom + 126488 126527 40 -2.26 4.01 Init + 127708 127756 49 0 1 99 103 28 0.973 6.51 4.02 Intr + 131109 131180 72 1 0 10 105 80 0.470 1.28 4.03 Term + 131224 131297 74 2 2 53 49 82 0.807 -1.13 4.04 PlyA + 131506 131511 6 1.05 5.00 Prom + 132547 132586 40 -7.26 5.01 Init + 136172 136321 150 1 0 101 60 399 0.952 38.44 5.02 Term + 142276 142431 156 0 0 108 43 114 0.873 6.83 5.03 PlyA + 143285 143290 6 1.05 6.21 PlyA - 143631 143626 6 1.05 6.20 Term - 157957 157786 172 0 1 96 53 203 0.920 14.90 6.19 Intr - 159321 159242 80 0 2 109 84 24 0.933 2.45 6.18 Intr - 161308 161206 103 0 1 88 77 144 0.878 13.48 6.17 Intr - 162263 162133 131 2 2 102 94 318 0.977 33.29 6.16 Intr - 162449 162350 100 1 1 93 80 215 0.925 21.31 6.15 Intr - 169883 169764 120 1 0 98 105 185 0.999 20.81 6.14 Intr - 170981 170767 215 2 2 75 75 380 0.999 32.91 6.13 Intr - 173603 173411 193 1 1 -12 87 215 0.166 10.89 6.12 Intr - 175326 175258 69 2 0 98 105 121 0.999 13.10 6.11 Intr - 175677 175570 108 2 0 102 115 56 0.969 9.10 6.10 Intr - 176958 176884 75 2 0 65 61 68 0.572 0.43 6.09 Intr - 182315 182203 113 2 2 46 110 50 0.957 2.18 6.08 Intr - 183540 183430 111 0 0 77 95 42 0.946 4.38 6.07 Intr - 184846 184779 68 2 2 125 81 49 0.999 6.52 6.06 Intr - 191237 191108 130 2 1 84 75 151 0.992 13.67 6.05 Intr - 194030 193860 171 2 0 99 80 234 0.999 23.84 6.04 Intr - 194553 194395 159 0 0 52 92 155 0.910 12.48 6.03 Intr - 196961 196891 71 0 2 38 48 21 0.084 -8.00 6.02 Intr - 201115 201018 98 0 2 101 93 121 0.615 13.55 6.01 Intr - 207045 206951 95 0 2 28 105 180 0.542 12.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 173572 173411 162 1 0 53 87 184 0.813 14.97 S.002 Term - 175063 174943 121 0 1 103 42 157 0.905 10.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:29631228_29848544|GENSCAN_predicted_peptide_1|619_aa MKWKGKDLFDLVCRTLGLRETWFFGLQYTIKDTVAWLKMDKKVGLELDETGGADVLDHDV SKEEPVTFHFLAKFYPENAEEELVQEITQHLFFLQVKKQILDEKIYCPPEASVLLASYAV QAKYGDYDPSVHKRGFLAQEELLPKRVINLYQMTPEMWEERITAWYAEHRGRARDEAEME YLKIAQDLEMYGVNYFAIRNKKGTELLLGVDALGLHIYDPENRLTPKISFPWNEIRNISY SDKEFTIKPLDKKIDVFKFNSSKLRVNKLILQLCIGNHDLFMRRRKADSLEVQQMKAQAR EEKARKQMERQRLAREKQMREEAERTRDELERRLLQMKEEATMANEALMRSEETADLLAE KAQITEEEAKLLAQKAAEAEQEMQRIKATAIRTEEEKRLMEQKVLEAEVLALKMAEESER RAKEADQLKQDLQEAREAERRAKQKLLEIATKPTYPPMNPIPAPLPPDIPSFNLIGDSLS FDFKDTDMKRLSMEIEKEKVEYMEKSKHLQEQLNELKTEIEALKLKERETALDILHNENS DRGGSSKHNTIKKSATCVHQPSLPAQATLRKPLPPAHGEKHLLSCHVMSKFASGPPGKPG VEGLTLQSAKSRVAFFEEL >gi568815576r:29631228_29848544|GENSCAN_predicted_CDS_1|1860_bp atgaagtggaaagggaaggacctctttgatttggtgtgccggactctggggctccgagaa acctggttctttggactgcagtacacaatcaaggacacagtggcctggctcaaaatggac aagaaggttgggctagaactcgatgaaactggtggggctgacgtactggatcatgatgtt tcaaaggaagaaccagtcacctttcacttcttggccaaattttatcctgagaatgctgaa gaggagctggttcaggagatcacacaacatttattcttcttacaggtaaagaagcagatt ttagatgaaaagatctactgccctcctgaggcttctgtgctcctggcttcttacgccgtc caggccaagtatggtgactacgaccccagtgttcacaagcggggatttttggcccaagag gaattgcttccaaaaagggtaataaatctgtatcagatgactccggaaatgtgggaggag agaattactgcttggtacgcagagcaccgaggccgagccagggatgaagctgaaatggaa tatctgaagatagctcaggacctggagatgtacggtgtgaactactttgcaatccggaat aaaaagggcacagagctgctgcttggagtggatgccctggggcttcacatttatgaccct gagaacagactgacccccaagatctccttcccgtggaatgaaatccgaaacatctcgtac agtgacaaggagtttactattaaaccactggataagaaaattgatgtcttcaagtttaac tcctcaaagcttcgtgttaataagctgattctccagctatgtatcgggaaccatgatcta tttatgaggagaaggaaagccgattctttggaagttcagcagatgaaagcccaggccagg gaggagaaggctagaaagcagatggagcggcagcgcctcgctcgagagaagcagatgagg gaggaggctgaacgcacgagggatgagttggagaggaggctgctgcagatgaaagaagaa gcaacaatggccaacgaagcactgatgcggtctgaggagacagctgacctgttggctgaa aaggcccagatcaccgaggaggaggcaaaacttctggcccagaaggccgcagaggctgag caggaaatgcagcgcatcaaggccacagcgattcgcacggaggaggagaagcgcctgatg gagcagaaggtgctggaagccgaggtgctggcactgaagatggctgaggagtcagagagg agggccaaagaggcagatcagctgaagcaggacctgcaggaagcacgcgaggcggagcga agagccaagcagaagctcctggagattgccaccaagcccacgtacccgcccatgaaccca attccagcaccgttgcctcctgacataccaagcttcaacctcattggtgacagcctgtct ttcgacttcaaagatactgacatgaagcggctttccatggagatagagaaagaaaaagtg gaatacatggaaaagagcaagcatctgcaggagcagctcaatgaactcaagacagaaatc gaggccttgaaactgaaagagagggagacagctctggatattctgcacaatgagaactcc gacaggggtggcagcagcaagcacaataccattaaaaagagtgccacctgtgtgcaccag ccctccctgcctgcacaagccacactgagaaagccgttgcctccagctcatggagagaaa caccttctcagctgccacgtcatgagcaagtttgcttcgggaccacctgggaaaccagga gtagaagggctcaccttgcagagcgccaagtcccgagtggccttctttgaagagctctag >gi568815576r:29631228_29848544|GENSCAN_predicted_peptide_2|662_aa MGVTGPGYRRSGYGYSHGREPEWNQPFSRPQFPLCTSEQQFSKGSTPRSRLAPSQHRRDS MMDEDPAGGSWQGHTVRPPSVLVSLDADRAHGSVWRILTWPQQLHQHPQPSSLCPALSCL SFPTCWVAEAAATRYHVPMGVPGKATKLTWQHLDILMTPLSRGGNQSGDGVSTKFKVILG LLITALPEPRVLLGEKEEGSRAPRPLQPPPGPPPAHEPRPQSLRRAGGRGASKMPFHPVT AALMYRGIYTVPNLLSEQRPVDIPEDELEGECPPGSPPRRPSYLCAQVGAPASSCAPRQD LSAPRGAAVGVAQAESGPAPARVVPSAAETCRAPQRGHGAAQSAVTVRQRGGLGRGSALS PQRRPGRSTRILIPSAAPARGSLLALTFRSCRFKNSICSGSSRRGEGDCPALARCGAAPV PTCGRSPFPAALRGPPARRAGKAAREAGGLGETQALCREDAIRGGATCARKRELGSPGWN AADKIREAFKVFDRDGNGFISKQELGTAMRSLGYMPNEVELEVIIQRLDMDGDGQVDFEE FVTLLGPKLSTSGIPEKFHGTDFDTVFWKCDMQKLTVDELKRLLYDTFCEHLSMKDIENI IMTEEESHLGTAEECPVDVETCSNQQIRQTCVRKSLICAFAIAFIISVMLIAANQVLRSG MK >gi568815576r:29631228_29848544|GENSCAN_predicted_CDS_2|1989_bp atgggagtgactggcccaggataccgcaggagtggctatggctacagccatggccgggag ccggagtggaaccagcccttctccaggcctcagtttcccttgtgcaccagcgagcagcag ttttctaaaggttccaccccgaggagccggctggcccccagccagcatcgcagggacagc atgatggatgaggatcccgcggggggcagctggcaaggacacaccgtccgtccgcccagt gtcctggtctccttggacgcagacagagcccacggaagcgtctggcgcatcctgacttgg ccgcagcagctgcaccagcacccccagccctcttccctgtgcccagcgctgtcctgcctg tccttccccacctgctgggtggccgaagcagccgccacacgctaccatgtccccatgggt gtgccgggcaaggccacaaagctgacctggcagcacctggatattttaatgaccccattg tccagaggaggaaatcaaagtggagacggtgtaagcaccaagttcaaggtcatcctggga ctgttgatcacagccctacctgagccgcgggtcctgctgggagagaaggaggaggggagc cgcgcgccccgcccgctccagccgcccccggggccgccaccggcccatgagccccggcct caaagtttgcggcgggcgggcgggcgcggagcctccaagatgccgttccacccggtgacg gcggcgttgatgtaccggggcatctacaccgtgcccaacctgctgtcggagcagcgcccg gtggacatcccggaggacgagctggagggtgagtgtccgccgggatccccgccccggcgg ccctcctacctgtgcgcccaggtgggcgccccagctagcagctgtgccccgcggcaagac ctgtccgcaccccggggcgccgcggtgggggtcgctcaggcggagagcggcccagcccct gcccgcgtggtccccagcgctgcggaaacttgccgggccccgcagcggggtcacggggcc gcgcagtcggcggtgacggtgcggcaacgcggcggactggggcgggggtccgcgctgagc ccccagcgccggcccggccggagcacccgcatcctgatcccctccgcggcgcccgcccgc ggctctctgctcgcattgacattccgctcgtgtcgctttaaaaattcaatctgctcgggc agcagcagaaggggagagggcgactgccctgctcttgcccgctgcggggccgcccccgtc cccacctgcggccgtagccccttccctgcagccctgcggggacccccagcccggcgcgcc gggaaggcggcccgggaggcgggcggtctgggcgagacccaggccctctgccgggaggac gccattcgcggaggagccacatgtgccaggaagagggagctgggcagcccgggatggaat gctgcagacaagatccgagaggccttcaaggtgtttgaccgtgacggcaatggcttcatc tccaagcaggagctgggcacagccatgcgctcactgggttacatgcccaacgaggtggag ctggaggtcatcatccagcggctggacatggatggtgatggtcaagtggactttgaggag tttgtgacccttctgggacccaaactctccacctcagggatcccagagaagttccatggc accgactttgatactgtcttctggaagtgcgacatgcagaagctgacggtggatgagctg aagcggctgctctacgacaccttctgcgagcacctgtccatgaaggacatagagaacatc atcatgacggaggaggagagccacctgggcacagccgaggagtgtcccgtggatgtggag acctgctccaaccagcagatccgccagacttgcgtgcgcaagagtctcatctgcgccttc gccatcgccttcatcatcagtgtcatgctcattgcggccaaccaggtgctgcgcagtggc atgaagtag >gi568815576r:29631228_29848544|GENSCAN_predicted_peptide_3|181_aa MGKRYFCDYCDRSFQDNLHNRKKHLNGLQHLKAKKVWYDMFRDAAAILLDEQNKRPCRKF LLTGQCDFGSNCRFSHMSERDLQELSIQVEEERRAREWLLDAPELPEGHLEDWLEKRAKR LSSAPSSRTAAGKMQESGSCPHAQDKQVTREATSRVVPDTHLNVPKALAHVSGALAVCQC V >gi568815576r:29631228_29848544|GENSCAN_predicted_CDS_3|546_bp atggggaagcgatacttctgtgactactgcgaccgctccttccaggacaacctccacaac cgcaagaagcacctgaacgggctgcagcacctcaaggccaagaaggtctggtacgacatg ttccgagatgcagctgccatcttgctggatgagcagaacaagcggccctgcaggaagttt ctactgacaggccagtgcgactttggctccaactgcagattttcccacatgtcagagcga gacctgcaggagctgagcatccaggtggaggaggagaggcgagccagggagtggctacta gatgctcctgagctccccgagggccatctggaggactggctggagaagagagccaagcgg ctgagctcagccccaagtagcaggaccgcggctgggaaaatgcaagagagtggctcctgt cctcacgcacaggacaaacaggtcaccagagaagctacgagcagagttgtgccagacacg cacctcaatgtccccaaggctttggctcacgtgtctggggccctggcagtgtgccagtgt gtgtga >gi568815576r:29631228_29848544|GENSCAN_predicted_peptide_4|64_aa MEGSRDLVHSDEPARGGDPNPQAPERYGSMACQELGRTAGATPHCSHYRLSFVSCQIIGS IRFS >gi568815576r:29631228_29848544|GENSCAN_predicted_CDS_4|195_bp atggagggttccagagacctggttcactcggatgaaccagccaggggtggggaccccaac ccccaggccccggaacggtacgggtccatggcctgtcaggagctgggccgcacagcagga gccactccccattgctcgcattaccgcctgagcttcgtctcctgtcagatcatcggcagc attagattctcatag >gi568815576r:29631228_29848544|GENSCAN_predicted_peptide_5|101_aa MAAATLTSKLYSLLFRRTSTFALTIIVGVMFFERAFDQGADAIYDHINEGPDVIQPELDY PESSLMEEDSGQETHGEVGETLEIVQFFLPEVYLCEDEVEA >gi568815576r:29631228_29848544|GENSCAN_predicted_CDS_5|306_bp atggcggccgcgacgttgacttcgaaattgtactccctgctgttccgcaggacctccacc ttcgccctcaccatcatcgtgggcgtcatgttcttcgagcgcgccttcgatcaaggcgcg gacgctatctacgaccacatcaacgaggggcctgatgtgattcagccagagctggattat ccagaatcttctctgatggaggaagattcaggtcaggaaacccatggagaggtaggagag accctagaaattgtccagtttttcctacctgaggtgtatctttgtgaagacgaagttgaa gcctag >gi568815576r:29631228_29848544|GENSCAN_predicted_peptide_6|793_aa FDPGSAGAQGAVTVVGGGGGGGGTEPVVEPPRRVTQHNASSAPGPTPDHPQGPEDRKAED FTSAVLILECLEPWRKSACSGKPSLILQHPEQKADRYFVLYKPPPKDNIPALVEEYLERA TFVANDLDWLLALPHDKFWCQVIFDETLQKCLDSYLRYVPRKFDEGVASAPEVVDMQKRL HRSVFLTFLRMSTHKESKDHFISPSAFGEILYNNFLFDIPKILDLCVLFGKGNSPLLQKM IGNIFTQQPSYYSDLDETLPTILQVFSNILQHCGLQGDGANTTPQKLEERGRLTPSDMPL LELKDIVLYLCDTCTTLWAFLDIFPLACQTFQKHDFCYRLASFYEAAIPEMESAIKKRRL EDSKLLGDLWQRLSHSRKKLMEIFHIILNQICLLPILESSCDNIQGFIEEFLQIFSSLLQ EKRDETRTAYILQAVESAWEGVDRRKATDAKDPSVIEEPNGEPNGVTVTAEAVSQASSHP ENSEEEECMGAAAAVGPAMCGVELDSLISQVKDLLPDLGEGFILACLEYYHYDPEQVINN ILEERLAPTLSQLDRNLDREMKPDPTPLLTSRHNVFQNDEFDVFSRDSVDLSRVHKGKST RKEENTRSLLNDKRAVAAQRQRYEQYSVVVEEVPLQPGESLPYHSVYYEDEYDDTYDGNQ VGANDADSDDELISRRPFTIPQVLRTKVPREGQEEDDDDEEDDADEEAPKPDHFVQDPAV LREKAEARRMAFLAKKGYRHDSSTAVAGSPRGHGQSRETTQERRKKEANKATRANHNRRT MADRKRSKGMIPS >gi568815576r:29631228_29848544|GENSCAN_predicted_CDS_6|2382_bp tttgaccccggaagtgcgggcgctcagggagctgtcaccgtggtcggcggcggcggcggc ggcggcggcacagagccggtggtggagccgccgaggagggtcacgcagcacaatgccagc tctgcccctggaccaactccagatcacccacaaggacccgaagacaggaaagctgaggac ttcaccagcgctgtccttatcctcgaatgcttggagccttggaggaagtctgcttgctct gggaagccgtcccttatcctccagcaccccgagcagaaggcagaccggtattttgtgtta tacaaaccgccccctaaagacaacattcccgccctagtggaggagtacctggaacgcgcc accttcgtagccaatgacctcgactggctcctggccttgcctcacgataaattctggtgc caggtgatctttgacgagactctacagaagtgcctggactcctacctgcgctatgtcccc cgcaaattcgacgagggggtggcctcagcccctgaggttgttgacatgcagaagcgcctc catcgaagtgtttttctcaccttcctccgcatgtccactcacaaggaatccaaagatcac ttcatttccccttctgcgtttggagaaatcctctacaataacttcctctttgacattcca aagatcctggacctctgcgtgctctttggaaaaggcaactcaccactgctccagaagatg ataggaaacatctttacacagcagccaagttactacagtgacctggatgaaaccctgcct accatccttcaggtcttcagcaatatcctccagcactgtggtttgcaaggggacggggcc aataccacaccccagaagcttgaggagaggggccgattgacccccagtgacatgcctctc ctggaattaaaggacattgttctctacctttgtgatacctgcaccacactttgggccttt ctggatatcttccctttggcttgccagaccttccagaagcacgacttttgttacagacta gcttccttctacgaagcagcaattcccgaaatggagtctgcaattaagaagaggaggctt gaagatagcaagcttcttggtgacctgtggcagaggctctcccattccaggaagaagcta atggagattttccacatcatcctgaaccagatctgcctccttcccatcctagaaagcagc tgtgacaacattcagggcttcatcgaagagttccttcagatcttcagctccttgctgcag gagaagagggacgagacgcggactgcctacatcctccaggcagtcgagagtgcatgggaa ggggtggacagacggaaagccacagatgctaaagacccatcggtgattgaggagcctaat ggggagcctaacggggtcacggtgacagcagaggcagtcagtcaagcatcatcacatccg gagaactcggaggaagaggagtgcatgggagcagccgcggctgtgggccctgccatgtgt ggggtggaactggactctctcatctcccaagtgaaggacctgctgccagaccttggtgag ggcttcatcctggcctgcctggagtactaccactacgacccagagcaggtgatcaacaat atcctggaggagcggctggcccccaccctcagccagctggaccgcaacctagacagagaa atgaaaccagaccctacacccctgctgacgtctcgccacaacgtcttccagaatgacgag tttgatgtgttcagcagggactcagtagacctgagccgggtgcacaagggcaagagcacc aggaaggaggaaaacacgcggagtttgctgaacgacaagcgtgcagtggcggcacagcgg cagcgctacgagcagtacagcgtggtggtggaggaggtgccactgcagccaggcgagagc ctgccctaccacagtgtctactacgaggatgagtacgatgacacatacgatggcaaccag gtgggcgccaatgatgcagactctgatgacgagctcatcagccgcaggccattcaccatc cctcaggtgctgagaaccaaagtgcctagagaagggcaggaggaggatgacgacgatgag gaagacgatgctgacgaggaggctcccaagcccgaccattttgttcaggaccctgcagtg ctgagagagaaggcagaagccaggcgcatggcctttctcgccaagaaagggtaccggcat gacagctcaacagcagtggccggcagcccccgaggccatgggcagagccgcgagacaacc caggaacgcaggaagaaggaagccaacaaggcgacaagagccaaccacaaccggagaacc atggccgaccgcaagaggagcaaaggcatgatcccatcctga