GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:03:26 Sequence gi568815595r:195468903_195679461 : 210559 bp : 45.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 2379 2328 52 1 1 111 71 55 0.020 4.07 1.03 Intr - 50283 50116 168 1 0 62 72 152 0.886 10.92 1.02 Intr - 54884 54790 95 1 2 46 53 124 0.999 4.31 1.01 Init - 55990 55917 74 1 2 62 100 100 0.932 9.14 1.00 Prom - 59428 59389 40 -3.56 2.05 PlyA - 59567 59562 6 1.05 2.04 Term - 69171 69163 9 0 0 98 43 0 0.173 -5.31 2.03 Intr - 74264 74002 263 0 2 7 83 388 0.566 27.41 2.02 Intr - 81301 81232 70 1 1 59 86 48 0.542 0.45 2.01 Init - 96957 96727 231 0 0 48 98 115 0.698 6.76 2.00 Prom - 97600 97561 40 -5.06 3.05 PlyA - 98252 98247 6 1.05 3.04 Term - 100233 99998 236 1 2 102 41 281 0.843 21.28 3.03 Intr - 102463 102375 89 0 2 74 78 68 0.991 4.01 3.02 Intr - 105069 104948 122 0 2 78 105 100 0.995 9.99 3.01 Init - 110559 110437 123 0 0 93 93 201 0.980 19.29 3.00 Prom - 121235 121196 40 -4.66 4.00 Prom + 139735 139774 40 -3.36 4.01 Sngl + 149755 151215 1461 0 0 60 49 1046 0.285 92.63 4.02 PlyA + 152236 152241 6 1.05 5.10 PlyA - 153974 153969 6 1.05 5.09 Term - 156052 155779 274 0 1 77 43 118 0.236 1.14 5.08 Intr - 157460 157357 104 2 2 53 94 35 0.413 -0.43 5.07 Intr - 157879 157748 132 1 0 102 76 42 0.613 5.24 5.06 Intr - 159605 159470 136 1 1 12 90 95 0.197 2.67 5.05 Intr - 160371 160186 186 2 0 125 -9 118 0.205 4.60 5.04 Intr - 178020 177820 201 2 0 95 46 203 0.524 15.20 5.03 Intr - 179607 178612 996 2 0 71 -66 548 0.106 27.63 5.02 Intr - 180191 179737 455 2 2 27 -15 299 0.195 5.56 5.01 Init - 181541 181200 342 2 0 66 -34 265 0.178 9.34 5.00 Prom - 183958 183919 40 -8.86 6.00 Prom + 185421 185460 40 -5.66 6.01 Init + 186232 186473 242 0 2 74 86 175 0.494 13.25 6.02 Intr + 189146 189263 118 2 1 42 101 122 0.415 9.37 6.03 Intr + 193669 193716 48 0 0 83 54 51 0.471 0.08 6.04 Intr + 194722 194865 144 0 0 56 49 240 0.951 17.38 6.05 Intr + 195184 195348 165 0 0 115 86 70 0.998 9.66 6.06 Intr + 197164 197312 149 0 2 97 53 129 0.608 9.33 6.07 Intr + 198416 198641 226 2 1 16 67 95 0.518 -1.81 6.08 Intr + 199559 199739 181 1 1 58 91 188 0.921 15.54 6.09 Intr + 202230 202398 169 1 1 60 100 146 0.975 12.00 6.10 Intr + 203582 203777 196 2 1 81 64 296 0.982 25.92 6.11 Intr + 204916 205085 170 0 2 133 119 92 0.989 15.54 6.12 Term + 205918 205999 82 1 1 110 39 74 0.460 1.87 6.13 PlyA + 206221 206226 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 85812 86063 252 2 0 51 47 182 0.932 5.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:195468903_195679461|GENSCAN_predicted_peptide_1|130_aa MGDDEDACSDTEATEAMAPDILARKLAAAEGLEPKYRIQEQESSGEEDSDLSPEEREKKR QFEMKRKLHYNEGLNIKLARQLISKDLHDDDEDEEMLETADGESMNTEESNQDLDTKQTC WIVKVRKDAS >gi568815595r:195468903_195679461|GENSCAN_predicted_CDS_1|390_bp atgggggatgatgaagatgcctgtagtgacaccgaggccactgaagccatggcgccagac atcttagccaggaaattagctgcagctgaaggcttggagccaaagtatcggattcaggaa caagaaagcagtggagaggaggatagtgacctctcacctgaagaacgagaaaaaaagcga caatttgaaatgaaaaggaagcttcactacaatgaaggactcaatatcaaactagccaga caattaatttcaaaagacctacatgatgatgatgaagatgaagaaatgttagagactgca gatggagaaagcatgaatacggaagaatcaaatcaagatctggacacaaagcaaacttgc tggattgtcaaggtacgcaaggatgcaagn >gi568815595r:195468903_195679461|GENSCAN_predicted_peptide_2|190_aa MCPPELATANPQAEQKPLTSMASLDGNDKLRQSNKSLTLSSLERGSLKALVLQGHVACSG HYRPCTTEVMCAEQKRRPRERKEGLESCLENHLIKTPDNTAVSEAAVAAALALSGICGCL RVSAVPTLLFADPTPSSDPEPTAGAPGNGGLDGLAPAHQGDLEEQDLYDFLYGGVGRTAP RECRRGAEGY >gi568815595r:195468903_195679461|GENSCAN_predicted_CDS_2|573_bp atgtgtccacctgagctagcaacagccaatccacaggctgaacagaaacctctgacttct atggcctcgctggatggaaatgataaactgagacaatcaaataagagtttgactttaagc agtctggaaagaggatcattgaaagctctggtactgcaaggtcatgtggcttgtagtggc cactaccggccatgcactactgaagtgatgtgtgcggaacagaaacgcaggccacgggag cgcaaagagggtcttgagtcctgtctagagaaccacttgataaaaacacctgataacaca gccgtttccgaggcagcagttgcggccgctttagccctgagcgggatctgcggctgcctg cgagtctctgctgtgccgacccttctcttcgcggaccccacgccaagcagcgaccctgag ccgacagccggagcgcccggcaatggcggcctcgacggcctcgcaccggcccatcaaggg gatcttgaagaacaagacctctacgacttcctctatggtggcgtcggccgaacagccccg cgggaatgtcgacgaggagctgagggatactga >gi568815595r:195468903_195679461|GENSCAN_predicted_peptide_3|189_aa MVMLLLLLSALAGLFGAAEGQAFHLGKCPNPPVQENFDVNKYLGRWYEIEKIPTTFENGR CIQANYSLMENGKIKVLNQELRADGTVNQIEGEATPVNLTEPAKLEVKFSWFMPSAPYWI LATDYENYALVYSCTCIIQLFHVDFAWILARNPNLPPETVDSLKNILTSNNIDVKKMTVT DQVNCPKLS >gi568815595r:195468903_195679461|GENSCAN_predicted_CDS_3|570_bp atggtgatgctgctgctgctgctttccgcactggctggcctcttcggtgcggcagaggga caagcatttcatcttgggaagtgccccaatcctccggtgcaggagaattttgacgtgaat aagtatctcggaagatggtacgaaattgagaagatcccaacaacctttgagaatggacgc tgcatccaggccaactactcactaatggaaaacggaaagatcaaagtgttaaaccaggag ttgagagctgatggaactgtgaatcaaatcgaaggtgaagccaccccagttaacctcaca gagcctgccaagctggaagttaagttttcctggtttatgccatcggcaccgtactggatc ctggccaccgactatgagaactatgccctcgtgtattcctgtacctgcatcatccaactt tttcacgtggattttgcttggatcttggcaagaaaccctaatctccctccagaaacagtg gactctctaaaaaatatcctgacttctaataacattgatgtcaagaaaatgacggtcaca gaccaggtgaactgccccaagctctcgtaa >gi568815595r:195468903_195679461|GENSCAN_predicted_peptide_4|486_aa MTTDDTEVPAMTLAPGHAALETQTLSAETSSRASTPAGPIPEAETRGAKRISPARETRSF TKTSPNFMVLIATSVETSAASGSPEGAGMTTVQTITGSDPREAIFDTLCTDDISEEAKTL TMDILTLAHTSTEAKGLSSESSASSDGPHPVITPSRASESSASSDGPHPVITPSRASESS ASSDGPHPVITPSRASESSASSDGPHPVITPSRASESSASSDGLHPVITPSRASESSASS DGLHPVITPSRASESSASSDGPHPVITPSWSPGSDVTLLAEALVTVTNIEVINCSITEIE TTTSSIPGASDTDLIPTEGVKASSTSDPPALPDSTNTKPHITEVTASAETLSTAGTTESA APDATIGTPLPTNSTIEREVTAPGATTLSGALATGNPLEETSALSVETPSYVKVSGAAPV SIEAGSAVGKTTSFAGSSASSYSPLEAALKNFTPSETLTTDIATKGPFPTSRAPLPSVPP TTTNSS >gi568815595r:195468903_195679461|GENSCAN_predicted_CDS_4|1461_bp atgacaacggacgacacagaagtgcccgctatgactctagcaccgggccacgccgctctg gaaactcaaacgctgagcgctgagacctcttctagggcctcaaccccagccggccccatt ccagaagcagagaccaggggagccaagagaatttcccctgcaagagagaccaggagtttc acaaaaacatctcccaacttcatggtgctgatcgccacctccgtggagacatcagccgcc agtggcagccccgagggagctggaatgaccacagttcagaccatcacaggcagtgatccc agggaagccatctttgacaccctttgcaccgatgacatctctgaagaggcaaagacactc acaatggacatattgacattggctcacacctccacagaagctaagggcctgtcctcagag agcagcgcctcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagc agcgcctcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagcagc gcctcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagcagcgcc tcttccgacggcccccatccagtcatcaccccgtcacgggcctcagagagcagcgcctct tccgacggcctccatccagtcatcaccccgtcacgggcctcagagagcagcgcctcttcc gacggcctccatccagtcatcaccccgtcacgggcctcagagagcagcgcctcttccgac ggcccccatccagtcatcaccccctcatggtccccgggatctgacgtcactctcctcgct gaagccctggtgactgtcacaaacatcgaggttattaattgcagcatcacagaaatagaa acaacgacttccagcatccctggggcctcagacacagatctcatccccacggaaggggtg aaggcctcgtccacctccgatccaccagctctgcctgactccactaacacaaaaccacac atcactgaggtcacagcctctgccgagaccctgtccacagccggcaccacagagtcagct gcacctgatgccacgattgggaccccactccccaccaacagcaccatagaaagagaagtg acagcacccggggccacgaccctcagtggagctctggccacagggaatcccctggaagaa acctcagccctctctgttgagacaccaagttacgtcaaagtctcaggagcagctccggtc tccatagaggctgggtcagcagtgggcaaaacaacttcctttgctgggagctctgcttcc tcctacagccccttggaagccgccctcaagaacttcaccccttcagagacactgaccacg gacatcgcaaccaaggggcccttccccaccagcagggcccctcttccttctgtccctccg actacaaccaacagcagctga >gi568815595r:195468903_195679461|GENSCAN_predicted_peptide_5|941_aa MVPERKNQEKESDDASTVNEETSEENNEMEESDVSQAEKDLLHSEGSENEGPESSGSSDC RETEELVGSNSSKTGEILSESSMDNDDEATEVTDEPLEQDYLETFTCSILHTVLGPKTRT SAPAWVASSTTLKLEHQIPTGLQLQGPQTGTSAPHQISSCTDLKLEHQLPAGSPAALPAN WNMSSLPGLQLHGPQTGTSAPHQIASCTALKLEYQLHPGAPGAQPSTCNISSPLGLQMNG PQPATSAPHRVSRCMALKLEHQLPTGCTASNCNISSPLGLQQHGPQPGTSAPPNPGLQLH SPQPATLATNWVSRCMALKLEHQLHPRYPAAQPSNWNISSLPGLQVHGPQTGTSAPHQVS SRTALILEHQLPTRSPAAQLSNRNISSPQGLQLHGSQTRTSAPHRVSSCTALNLQHWLPT GSPDARPSNCNISSPRAYSCMALNWNISSPLGLQEHGPQTGTSAPCQVTSCMALKLEHHL PARSPAAWPSNCNISSHQSLQLHGHQTGTSAPPRVSSCTDLKLEHQLPAGSSTAWPSNWN ISSTPGSPVARPYNWNISFPLGLRLHSPTTGTSAPCRVSSCTALKLEHQLPAEFKLFQCT AIKLEHQLPGPQTGTSAPRRISSCTAVNISSSPSPQLHDPQVRTSASPQVFSCVTLNLEH QFLYRAVCLGPGVRNGQFFNTVSGSGRPEGSQALSPWRAFCGLSQGSRQEPGENRCVCSR RALMPLRDTERPTVCFAGTVQCSQGKSCIRHAGAVAAPAENQSLIFGVRKVEAPSPNTIA LGVRISTDEFRGTRTFKPSQSPTHQGESVWREDIREPPGLWSPWSVVLAYGSPTAPVRWE ETADTMRRHHTLRRKPPGGLWGLRSVPGVPGSCLNAHSGSCPAASAPRPRPRAGSAPWCR LGSVQGRAGCAEHLQAPPWVTEDRGADQVFYRSVGDQQLDA >gi568815595r:195468903_195679461|GENSCAN_predicted_CDS_5|2826_bp atggtcccagaaagaaaaaaccaagaaaaagaatctgatgatgcctcaactgtgaatgaa gagacttctgaggaaaataatgaaatggaggaatctgatgtgtctcaagctgagaaagat ttactacattctgaaggtagtgaaaacgaaggccctgaaagtagtggttcttctgactgc cgtgaaacagaagaattagtaggatccaattccagtaaaactggagagattctttcagaa tcatccatggataatgatgacgaagccacagaagtcaccgatgaaccactggaacaagac tatttagaaacatttacatgcagtattttacacacagttctgggccctaaaactagaaca tcagctcccgcctgggtcgccagcagcaccaccctcaaactggaacatcagatccccacg ggtctccagctgcagggccctcaaactggaacatcagctccccaccagatctccagctgc acggacctcaaactggaacatcagctccccgccgggtctccagctgcactgcctgcaaac tggaacatgagctccctgcccggtctccagctgcatggccctcaaactggaacatcagct ccccaccagattgccagctgcacggccctcaaactggaatatcagctccaccccggggct ccaggtgcacagccctcaacctgcaacatcagctccccactgggtctccagatgaatggc cctcaacctgcaacatcagctccccaccgggtctccagatgcatggccctcaaactggaa catcagctccccaccggctgcaccgcctcaaactgcaacatcagctccccgctgggtctc cagcagcatggccctcaacctggaacatcagctccccccaacccgggtctccaactccac agccctcaacctgcaacactggctaccaactgggtctccagatgcatggccctcaaactg gaacatcagctccacccccggtatccagctgcacagccctcaaactggaacatcagctcc ctgccgggtctccaggtgcacggccctcaaactggaacatcagctccccaccaggtctcc agccgcacggccctcatactggaacatcagctccccaccagatctccagctgcacagctc tcaaacaggaacatcagctccccacagggtctccagctgcacggctctcaaacaagaaca tcagctccccacagggtctccagctgcacggccctcaacctgcaacactggctccccacc gggtctcccgatgcacggccctcaaactgcaacatcagttccccccgggcatacagctgc atggccttaaactggaacatcagctccccgctaggtctccaggagcacggtcctcaaact ggaacatcagctccctgccaggtcaccagctgcatggccctcaaactggaacatcacctc cccgccaggtctccagctgcatggccctcaaattgcaacatcagctcccatcagagcctc cagctgcatggccatcaaactggaacatcagctcccccgcgggtctccagctgcacagac ctcaaacttgaacatcagctccccgccgggtcatcaactgcatggccctcaaactggaac atcagctccacccctgggtctccagtagcacggccctacaactggaacatcagcttcccc ctgggtctccggctgcacagccctacaaccggaacatcagctccctgccgggtctccagc tgcacagccctcaaactggaacatcagctccccgctgagttcaaactattccagtgcacg gccatcaaactggaacatcagctccccggccctcaaaccggaacatcagctccccgccgg atctccagctgcacagctgtcaacatcagctcctccccgagtcctcagctgcacgaccct caagttagaacatcagcttctccccaagtcttcagctgcgtgaccctcaatctagaacat cagttcctctacagagccgtgtgtctagggcccggggtgcggaacgggcagttcttcaac acagtgagcggcagcgggcgtcccgaaggttctcaggccttgtctccgtggagggcattt tgtggcctctcccagggcagccggcaggagccaggcgagaacagatgcgtctgtagcagg agggcgttgatgcctctgagagatacagagcgacccaccgtctgctttgctgggactgtc cagtgttcgcagggaaagtcctgcatccggcacgctggggcggtggctgcaccagctgag aaccagtctctgattttcggggtgagaaaagtagaggccccatctcctaacaccatcgcc ttgggggtgaggatttcaacagatgaatttcggggaacacggacattcaaaccctcacaa agccccacccaccaaggagagtctgtctggagagaagacattagggagccaccagggttg tggagcccctggtctgtggtgctggcttatggcagccctactgccccagtgagatgggaa gagacagcagacacaatgagaagacaccacacgctcaggcggaaaccccccgggggcctc tgggggctgcggtcggtgccaggggtcccgggcagctgcctgaacgcgcacagcggctcc tgccccgcagcctccgccccgcgcccgcgtcctcgggccggcagcgccccctggtgccgc ctcgggtctgtgcagggccgggcgggctgcgcggagcacctgcaggcgcctccatgggtg acggaagacagaggggctgaccaggtcttctacaggtcagtgggcgatcaacagctggac gcgtag >gi568815595r:195468903_195679461|GENSCAN_predicted_peptide_6|629_aa MEKLMGPPMDQDSASTKTCPETHRKSGPRSREQHGALGAANAHDTVRDGERYDRRSRPRR REQHGALGAANAHDTVRDGERRSLRRDWRDCAAATTDVSGVRGLSRLPSARRLALALAKA ISAQYPVVDHEFDAVVVGINAALGNMEEDNWRWHFYDTVKGSDWLGDQDAIHYVTEQAPT AMVEVENYGMPFSRTEDGKIYQRAFGGHSLKFGKGRQAHRCCCVADRTGHSILHTLYGRS LRYDTSCFVEYFALDLLMENGECRGVFALCIQDGSIHRIRAKNTIVATGYENEGNPVMCD NVDEPKVIMLSETTQAQKDRYCMSLICGSKTVTTHRNRIGQWLPGAGGMQTVAMLIKGVT SRHKTGGPDVGHCVQSLLSVVSIGYGRTYLSCTSAHTSTSDGTAMITRAGLPCQDLEFVQ FHPTGTYGAGCLITEGCRGEGGILINSQGERFMERYAPIAKDLASRDVVSRWMTLEIREG RGCGPEKDHVYLQLHHLPPEQLAMPLPGISETAMIFAGVDVTKEPIPVLPTVHYNMDGIP TSYEGQVLRHGNGQDQIVPSLYACGEAACASAHGVNRLGANSLLDLVVWSGMCPEHRRVV QAWIASNNVQEKEYRVWECESYVHEEQDS >gi568815595r:195468903_195679461|GENSCAN_predicted_CDS_6|1890_bp atggagaagctcatgggacccccgatggaccaggacagtgccagcactaagacgtgccct gaaactcacaggaagagcggaccaagaagccgggaacagcacggggcactgggagctgca aacgcccacgatactgtgagagacggagaaaggtatgacaggaggagcagaccaagaaga cgggaacagcacggcgcactgggagctgcaaatgcccacgataccgtgagagatggagaa aggcgcagtctgcgcagggactggcgggactgcgcggcggcgactacagacgtgtcgggg gtccggggcctgtcgcggttgccaagcgctcggcgcttggcgctggcgctggccaaggcg atttctgctcagtatccagtagtggatcatgaatttgatgcagtggtggttggaatcaat gctgctctggggaacatggaggaggacaactggaggtggcatttctatgacaccgtgaag ggctccgactggctgggggaccaggatgccatccactacgtgacggagcaggcccccact gccatggtcgaggtagaaaattatggcatgccgtttagcagaactgaagatgggaagatt tatcagcgtgcatttggcggacacagcctcaagtttggaaagggcaggcaggcccatcgg tgctgctgtgtggctgatcggaccggccactcaatattgcacaccttatatgggaggtct ctgcgatatgataccagctgttttgtggagtattttgccttggatctcctgatggagaat ggggagtgccgtggtgtcttcgcactgtgcatacaggacgggtccatccatcgcataaga gcaaagaatactattgttgccacaggctatgaaaacgaaggaaatcctgtcatgtgtgac aacgtggatgaacccaaagtcattatgttaagtgaaacgacccaggcacagaaagacaga tactgcatgtcactcatatgtggatctaaaactgtcacaactcacagaaacagaatagga cagtggttgccaggggctgggggaatgcagactgtggcgatgctgattaaaggtgtaact tcccgtcacaagacaggaggtccggacgtgggccactgtgtgcagtcactgctctctgtt gtttccataggctacgggcgcacctacttgagctgcacgtctgcccacaccagcaccagc gacggcacggccatgatcaccagggcaggccttccttgccaggacctcgagtttgttcag ttccaccccacaggcacatatggtgctggttgtctcattacggaaggatgtcgtggagag ggaggcattctcattaacagtcaaggcgaaaggtttatggagcgatacgcccccatcgcg aaggacctggcgtctagagatgtggtgtctcggtggatgactctggagatccgcgaagga agaggctgtggccctgagaaagatcacgtctacctgcagctgcaccacctacctccagag cagctggccatgcccttgcccggcatttcagagacagccatgatcttcgctggtgtggac gtcacgaaggagccgatccctgtcctccccaccgtgcattataacatggacggcattccc accagctacgaggggcaggtcctgaggcacgggaatggccaggatcagattgtgcccagc ctgtacgcctgtggggaggccgcctgtgcctctgcacatggtgtcaaccgcctcggggca aactcgctgttggacctggttgtctggtcaggcatgtgccctgagcatcgcagagtcgtg caggcctggatagcgtcaaataatgtgcaggaaaaggaataccgtgtgtgggagtgtgag tcttatgtgcacgaagaacaggacagttag