GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:04:17 Sequence gi568815587r:77519631_77737714 : 218084 bp : 40.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 916 1126 211 0 1 77 44 147 0.136 8.19 1.02 Intr + 21934 22025 92 2 2 81 66 55 0.013 1.39 1.03 Intr + 31943 32131 189 1 0 99 33 101 0.800 4.56 1.04 Intr + 32770 32848 79 0 1 78 110 119 0.522 11.31 1.05 Term + 34584 34741 158 1 2 98 49 127 0.994 6.91 1.06 PlyA + 34885 34890 6 1.05 2.00 Prom + 36686 36725 40 -5.45 2.01 Init + 40067 40225 159 1 0 77 57 100 0.476 5.67 2.02 Intr + 41178 41319 142 2 1 51 86 78 0.513 2.91 2.03 Intr + 48134 48186 53 0 2 65 85 24 0.369 -2.49 2.04 Intr + 48763 49173 411 0 0 73 40 261 0.063 13.26 2.05 Intr + 49348 49372 25 0 1 94 61 21 0.061 -3.32 2.06 Intr + 51885 52055 171 1 0 85 91 91 0.075 8.09 2.07 Intr + 53237 53485 249 0 0 -16 19 229 0.060 2.09 2.08 Term + 54587 54762 176 0 2 58 45 168 0.524 6.44 2.09 PlyA + 55292 55297 6 1.05 3.00 Prom + 56231 56270 40 -1.75 3.01 Init + 70363 71036 674 0 2 90 87 515 0.565 44.30 3.02 Intr + 86053 86143 91 1 1 55 96 43 0.070 0.88 3.03 Term + 92081 92194 114 1 0 118 48 62 0.673 2.79 3.04 PlyA + 92940 92945 6 1.05 4.09 PlyA - 96544 96539 6 1.05 4.08 Term - 100065 99998 68 1 2 45 48 112 0.544 0.02 4.07 Intr - 103043 102870 174 0 0 64 113 109 0.987 9.99 4.06 Intr - 105440 105333 108 0 0 23 49 173 0.979 6.24 4.05 Intr - 106188 106087 102 1 0 61 67 181 0.992 12.53 4.04 Intr - 110269 110133 137 0 2 67 95 82 0.653 6.09 4.03 Intr - 118110 117960 151 1 1 72 75 159 0.609 11.30 4.02 Intr - 121788 121665 124 0 1 26 93 53 0.082 -1.16 4.01 Init - 137116 136877 240 1 0 44 98 169 0.529 11.42 4.00 Prom - 146351 146312 40 -6.95 5.09 PlyA - 146619 146614 6 1.05 5.08 Term - 147861 147287 575 1 2 51 47 887 0.989 74.53 5.07 Intr - 152600 152412 189 0 0 63 116 264 0.992 25.34 5.06 Intr - 155626 155406 221 0 2 61 58 279 0.978 19.22 5.05 Intr - 157369 157162 208 2 1 59 57 398 0.988 31.01 5.04 Intr - 158523 158456 68 2 2 70 87 186 0.999 14.33 5.03 Intr - 171608 171529 80 2 2 104 63 111 0.984 7.63 5.02 Intr - 179063 178857 207 2 0 34 41 293 0.728 17.55 5.01 Init - 182842 181091 1752 1 0 63 93 1736 0.480 161.63 5.00 Prom - 200588 200549 40 -6.35 6.02 PlyA - 201517 201512 6 1.05 6.01 Sngl - 215396 214845 552 2 0 109 46 634 0.936 57.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 29275 29295 21 0 0 83 105 22 0.835 3.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:77519631_77737714|GENSCAN_predicted_peptide_1|242_aa MSGFWVPYPKPNPKPTSLRFGNAVWRMHPRGVSRTMETQLPVSEERTEEEKGKRRQFFEE VPGLQDTFERGLQTQTELTTCSPGSSARRQHIMGLLSLYNCAAPRQAKPDLNSSSASAPP PYNPSITSPPHTRSGLQFRSVASPPLPAQQFPLTEVAGAEGIVKLRTDAAQSPRKPPGPS QTLQVTATVEGHTWFIDGSSSRPSHQSPVKAGYAIVSSTSIIESMALPASTTSQQAELIA LT >gi568815587r:77519631_77737714|GENSCAN_predicted_CDS_1|729_bp atgtcaggcttctgggttccctaccccaagcccaatcctaagccaaccagtttaaggttt gggaatgcagtttggaggatgcatccgagaggagtgtcccgtactatggagacacaatta cctgtaagtgaagagaggacagaggaggaaaaaggaaaaagaaggcagtttttcgaagaa gtcccagggcttcaggatacatttgaaaggggccttcaaactcagactgaattaaccacc tgctctcctggttcttcagctcgaagacagcatatcatgggccttctcagcctctataat tgcgctgctcctcgccaggccaagccagatctcaattcttcctcagcctctgctcctcca ccctataatccttctatcacctcccctcctcacacccggtccggcttacagtttcgttcc gtggctagccctcccctacctgcccaacaatttcctcttacagaggtggctggagctgaa ggcatagtcaagctgaggactgacgctgcccaatcgcctcgcaagcctcctggaccatca cagacacttcaggtaactgctacagtggagggccacacttggtttattgatggtagttcc tccaggcccagtcaccaatcaccagtaaaggcaggctatgctatagtgtcttccacatct atcattgagtctatggccctgcccgcttccactacctctcaacaagctgaactcattgcc ttaacttga >gi568815587r:77519631_77737714|GENSCAN_predicted_peptide_2|461_aa MPAPTCCDNQNCLQTMPNAPRGGGVEGGANLLPPPSTENYFPGVCKALGQCVWYQWNPSL SSAGRIDEWPICTRVLWIGGVTVYTGKSQWTGPHMQPKPKVLLRDSRDWWLVLYLDHSGG KVPLALGVSKDATGSQSLEPGTLEILSTAAELASKPQNKVLPPLPSPFHKQESLFMATAA LSPWQVLPGYHQCSLNVQGLFNQLAVNAVRPGTHPSGKRTPLWPRAGPEMPSMSQDLELG TPRACLVLYPTVAELAVGSLLAQGGWSPGPRETCDPRCRRENFSPEGKSALRDPAPLPSS ATDPERSYRMILETAHFRNHVALVGISTIKVYGFGATGWMVTMARFTACKSCTGIYSSDL GPSSGFHTGKSGVFRGTGIGLRSHIYRAVLNVCRHPVQPFGELGVVHWIHHSRLKLTTQD KWTSQQDPDHPTRLILRRDQTAAEDDRPVLVTPEANQPTHG >gi568815587r:77519631_77737714|GENSCAN_predicted_CDS_2|1386_bp atgcccgctcccacctgctgtgacaatcaaaactgtctacagacaatgccaaatgcccca aggggtggtggggtggaggggggtgcaaacttgctgccaccaccttccacagaaaactat ttccctggagtgtgcaaggccctgggccagtgtgtgtggtatcagtggaacccttccctt tcatcagcaggcagaatagatgagtggcccatctgcacaagagtgctctggataggaggt gtcacagtgtacacaggcaagagtcagtggacaggcccacacatgcagcccaagccaaaa gttttgctaagagacagccgggactggtggctggtgctttatttagatcattcgggtggt aaggtccccctggccctgggtgtgtccaaagatgccactgggagccagagtctggagcca ggaaccttagaaatcttatctactgctgctgagctggcatccaagccacaaaacaaagtc cttccccctcttccctcccctttccacaaacaggagtctctcttcatggctactgctgcc ctgagcccgtggcaagtactgccaggctaccaccaatgttcactcaacgtccaagggctc ttcaatcagcttgcggtgaatgctgtgaggcccgggactcacccttcagggaaaaggact cccctctggcccagggcaggtccagaaatgccatccatgagccaagacctggaactgggg acaccaagagcctgcttggtgctctatcccactgtggctgagctggcagttggttccctt ctggcccaggggggctggagccctgggccaagggagacctgtgaccccaggtgccgccgg gagaacttcagcccagaggggaaatcagctctccgtgacccagcacccctacccagcagc gcaacagatcctgagaggagctacaggatgattctagaaacagcgcacttcaggaatcat gtggcacttgttggaataagtaccattaaagtatatggatttggtgccacgggatggatg gtcaccatggcccggtttacggcatgcaaatcttgcaccggtatctactcatcagacctt ggtcccagcagcggctttcatactggcaaaagtggagtattccgggggactggcatcgga ttaagatcccatatttatagagccgttctaaatgtttgcagacaccctgtacagcctttt ggggaactgggtgttgtgcattggatccaccacagtcggctgaaactgacaactcaagac aagtggaccagccagcaggacccagatcatccaactcggctgatcctgagacgggaccaa actgctgccgaagacgaccgccctgttctggtcactccggaggctaaccagcctacgcat ggctga >gi568815587r:77519631_77737714|GENSCAN_predicted_peptide_3|292_aa MSPLLGLRSELQDTCTSLGLMLSVVLLMGLARVVARQQLHRPVAHAFVLEFLATFQLCCC THELQLLSEQHPAHPTWTLTLVYFFSLVHGLTLVGTSSNPCGVMMQMMLGGMSPETGAVR LLAQLVSALCSRYCTSALWSLGLTQYHVSERSFACKNPIRVDLLKAVITEAVCSFLFHSA LLHFQEVRTKLRIHLLAALITFLVYAGLSFSPNTWHFQSLYDMRLKPVPGAKKVEDLCVK ELFHPLLERTLKSSQILVPVSLSTITTVILLGDRYKDDPSSCTALEFTGSLL >gi568815587r:77519631_77737714|GENSCAN_predicted_CDS_3|879_bp atgtcgccgctgctggggctccggtccgagctgcaggacacctgcacctcgctgggactg atgctgtcggtggtgctgctcatggggctggcccgcgtagtcgcccggcagcagctgcac aggccggtggcccacgccttcgtcctggagtttctagccaccttccagctctgctgctgc acccacgagctgcaactgctgagcgaacagcaccccgcgcaccccacctggacgctgacg ctcgtctacttcttctcgcttgtgcatggcctgactctggtgggcacgtccagcaacccg tgcggcgtgatgatgcagatgatgctggggggcatgtcccccgagacgggtgcggtgagg ctattggctcagctggttagtgccctgtgcagcaggtactgcacaagcgccttgtggagc ttgggtctgacccagtatcacgtcagcgagaggagcttcgcttgcaagaatcccatccga gtcgacttgctcaaagcggtcatcacagaggccgtctgctcctttctcttccacagcgct ctgctgcacttccaggaagtccgaaccaagcttcgtatccacctgctggctgcactcatc acctttttggtctatgcaggtttgtcattctcaccaaatacttggcacttccagagcctc tatgacatgaggctgaaaccggtccctggtgctaaaaaggttgaggatctctgtgttaag gaattatttcatccattacttgaaagaacattaaaaagtagccagatcttagttcctgtc tcactctctaccatcaccacagttatacttcttggtgacagatacaaagatgatccttcc agctgcacggccttggagttcactggatctctcctttga >gi568815587r:77519631_77737714|GENSCAN_predicted_peptide_4|367_aa MKVHSTVWKLEKQRLKGPDTESSQVQVPPRGFPLATSCSPHVNEVVALNRSDWLWKATNQ RLKWRVQRSLSSFCNQSEAKKKKKKELRELKTLICGALKSERELTRNPSCCDRSKDTVDS AGPVLPHSAAMSFLKSFPPPGPAEGLLRQQPDTEAVLNGKGLGTGTLYIAESRLSWLDGS GLGFSLEYPTISLHALSRDRSDCLGEHLYVMVNAKFEEESKEPVADEEEEDSDDDVEPIT EFRFVPSDKSALEAMFTAMCECQALHPDPEDEDSDDYDGEEYDVEAHEQGQGDIPTFYTY EEGLSHLTAEGQATLERLEGMLSQSVSSQYNMAGVRTEDSIRDYEDGMEVDTTPTVAGQF EDADVDH >gi568815587r:77519631_77737714|GENSCAN_predicted_CDS_4|1104_bp atgaaagtacactccacagtgtggaagctggagaagcagcgactcaagggcccggataca gaatcttctcaggtccaagtaccccctagaggttttccattggccacttcctgctcacct catgtaaatgaagtggtagctctcaatcggtctgattggttgtggaaagcaaccaatcaa agactgaagtggagggtacaacgctcactctcctctttttgcaaccaatcagaggctaag aaaaaaaaaaaaaaggaactcagagagctcaaaacactaatctgtggagctctgaaatct gagagagaacttaccaggaaccccagctgctgtgacagatcaaaggacacagtggattct gcagggcctgtgttgccgcactctgctgctatgagcttcctcaaaagtttcccgccgcct gggccagcggaggggctcctgcggcagcagccagacactgaggctgtgctgaacgggaag ggcctcggcactggtaccctttacatcgctgagagccgcctgtcttggttagatggctct ggattaggattctcactggaataccccaccattagtttacatgcattatccagggaccga agtgactgtctaggagagcatttgtatgttatggtgaatgccaaatttgaagaagaatca aaagaacctgttgctgatgaagaagaggaagacagtgatgatgatgttgaacctattact gaatttagatttgtgcctagtgataaatcagcgttggaggcaatgttcactgcaatgtgc gaatgccaggccttgcatccagatcctgaggatgaggattcagatgactacgatggagaa gaatatgatgtggaagcacatgaacaaggacagggggacatccctacattttacacctat gaagaaggattatcccatctaacagcagaaggccaagccacactggagagattagaagga atgctttctcagtctgtgagcagccagtataatatggctggggtcaggacagaagattca ataagagattatgaagatgggatggaggtggataccacaccaacagttgctggacagttt gaggatgcagatgttgatcactga >gi568815587r:77519631_77737714|GENSCAN_predicted_peptide_5|1099_aa MKSEEQPMDLENRSTANVLEETTVKKEKEDEKELVKLPVIVKLEKPLPENEEKKIIKEES DSFKENVKPIKVEVKECRADPKDTKSSMEKPVAQEPERIEFGGNIKSSHEITEKSTEETE KLKNDQQAKIPLKKREIKLSDDFDSPVKGPLCKSVTPTKEFLKDEIKQEEETCKRISTIT ALGHEGKQLVNGEVSDERVAPNFKTEPIETKFYETKEESYSPSKDRNIITEGNGTESLNS VITSMKTGELEKETAPLRKDADSSISVLEIHSQKAQIEEPDPPEMETSLDSSEMAKDLSS KTALSSTESCTMKGEEKSPKTKKDKRPPILECLEKLEKSKKTFLDKDAQRLSPIPEEVPK STLESEKPGSPEAAETSPPSNIIDHCEKLASEKEVVECQSTSTVGGQSVKKVDLETLKED SEFTKVEMDNLDNAQTSGIEEPSETKGSMQKSKFKYKLVPEEETTASENTEITSERQKEG IKLTIRISSRKKKPDSPPKVLEPENKQEKTEKEEEKTNVGRTLRRSPRISRPTAKVAEIR DQKADKKRGEGEDEVEEESTALQKTDKKEILKKSEKDTNSKVSKVKPKGKVRWTGSRTRG RWKYSSNDESEGSGSEKSSAASEEEEEKESEEAILADDDEPCKKCGLPNHPELKLLCEKL EEQLQDLDVALKKKERAERRFDEFDEAIDEAIEDDIKEADGGGVGRGKDISTITGHRGKD ISTILDEERKENKRPQRAAAARRKKRRRLNDLDSDSNLDEEESEDEFKISDGSQDEFVVS DENPDESEEDPPSNDDSDTDFCSRRLRRHPSRPMRQSRRLRRKTPKKKYSDDDEEEESEE NSRDSESDFSDDFSDDFVETRRRRSRRNQKRQINYKEDSESDGSQKSLRRGKEIRRVHKR RLSSSESEESYLSKNSEDDELAKESKRSVRKRGRSTDEYSEADEEEEEEEGKPSRKRLHR IETDEEESCDNAHGDANQPARDSQPRVLPSEQESTKKPYRIESDEEEDFENVGKVGSPLD YSLVDLPSTNGQSPGKAIENLIGKPTEKSQTPKDNSTASASLASNGTSGGQEAGAPEEEE DELLRVTDLVDYVCNSEQL >gi568815587r:77519631_77737714|GENSCAN_predicted_CDS_5|3300_bp atgaaaagtgaggagcagcctatggatttagaaaaccgttctacagccaatgttctagaa gagactactgtgaaaaaagaaaaagaagatgaaaaggaacttgtgaaactgccagtcata gtgaagctagaaaaacctttgccagaaaatgaagaaaaaaagattatcaaagaagaaagt gattccttcaaggaaaatgtcaaacccattaaagttgaggtgaaggaatgtagagcagat cctaaagataccaaaagtagcatggagaagccagtggcacaggagcctgaaaggatcgaa tttggtggcaatattaaatcttctcacgaaattactgagaaatctactgaagaaactgag aaacttaaaaatgaccagcaggccaagataccactaaaaaaacgagaaattaaactgagt gatgattttgacagtccagtcaagggacctttgtgtaaatcagttactccaacaaaagag tttttgaaagatgaaataaaacaagaggaagagacttgtaaaaggatctctacaatcact gctttgggtcatgaagggaaacagctggtaaatggagaagttagtgatgaaagggtagct ccaaattttaagacagaaccaatagagacaaagttttatgagacaaaggaagagagctat agcccctctaaggacagaaatatcatcacggagggaaatggaacagagtccttaaattct gtcataacaagtatgaaaacaggtgagcttgagaaagaaacagcccctttgaggaaagat gcagatagttcaatatcagtcttagagatccatagtcaaaaagcacaaatagaggaaccc gatcctccagaaatggaaacttctcttgattcttctgagatggcaaaagatctctcttca aaaactgctttatcttccaccgagtcgtgtaccatgaaaggtgaagagaagtctcccaaa actaagaaggataagcgcccaccaatcctagaatgtcttgaaaagttagagaagtccaaa aagacttttcttgataaggacgcacaaagattgagtccaataccagaagaagttccaaag agtactctagagtcagaaaagcctggctctcctgaggcagctgaaacttctccaccatct aatatcattgaccactgtgagaaactagcctcagaaaaagaagtggtagaatgccagagt acaagtactgttggtggccagtctgtgaaaaaagtagacctagaaaccctaaaagaggat tctgagttcacaaaggtagaaatggataatctggacaatgcccagacctctggcatagag gagccttctgagacaaagggttctatgcaaaaaagcaaattcaaatataagttggttcct gaagaagaaaccactgcctcagaaaatacagagataacctctgaaaggcagaaagagggc atcaaattaacaatcaggatatcaagtcggaaaaagaagcccgattctccccccaaagtt ctagaaccagaaaacaagcaagagaagacagaaaaggaagaggagaaaacaaatgtgggt cgtactttaagaagatctccaagaatatctagacccactgcaaaagtggctgagatcaga gatcagaaagctgataaaaaaagaggggaaggagaagatgaggtggaagaagagtcaaca gctttgcaaaaaactgacaaaaaggaaattttgaaaaaatcagagaaagatacaaattct aaagtaagcaaggtaaaacccaaaggcaaagttcgatggactggttctcggacacgtggc agatggaaatattccagcaatgatgaaagtgaagggtctggcagtgaaaaatcatctgca gcttcagaagaggaggaagaaaaggaaagtgaagaagccatcctagcagatgatgatgaa ccatgcaaaaaatgtggccttccaaaccatcctgagctaaaactgctctgtgaaaaatta gaggaacagttgcaggatttggatgttgccttaaagaagaaagagcgtgccgaacgaaga tttgatgagtttgatgaagcaattgatgaagctattgaagatgacatcaaagaagccgat ggaggaggagttggccgaggaaaagatatctccaccatcacaggtcatcgtgggaaagac atctctactattttggatgaagaaagaaaagaaaataaacgaccccagagggcagctgct gctcgaaggaagaaacgccggcgattaaatgatctggacagtgatagcaacctggatgaa gaagagagcgaggatgaattcaagatcagtgatggatctcaagatgagtttgttgtgtct gatgaaaacccagatgaaagtgaagaagatccgccatctaatgatgacagtgacactgac ttttgtagccgtagactgaggcgacacccctctcggccaatgaggcagagcaggcgtttg cgaagaaagaccccaaagaaaaaatattccgatgatgatgaagaggaggaatctgaggag aatagtagagactctgaaagtgacttcagtgatgattttagtgatgattttgtagaaact cggcgaaggcggtcaaggagaaatcagaaaagacaaattaactacaaagaagactcagaa agtgacggttcccagaagagtttgcgacgtggtaaagaaataaggcgagtacacaagcga agactttccagctcagagagtgaagagagctatttgtccaagaactctgaagatgatgag ctagctaaagaatcaaagcggtcagttcgaaagcggggccgaagcacagacgagtattca gaagcagatgaggaggaggaggaagaggaaggcaaaccatcccgcaaacggctacaccgg attgagacggatgaggaggagagttgtgacaatgctcatggagatgcaaatcagcctgcc cgtgacagccagcctagggtcctgccctcagaacaagagagcaccaagaagccctaccgg atagaaagtgatgaggaagaggactttgaaaatgtaggcaaagtggggagcccattggac tatagcttagtggacttaccttcaaccaatggacagagccctggcaaagccattgagaac ttgattggcaagcctactgagaagtctcagacccccaaggacaacagcacagccagtgca agcctagcctccaatgggacaagtggtgggcaggaggcaggagcaccagaagaggaggaa gatgagcttttgagagtgactgaccttgttgattatgtctgtaacagtgaacagttataa >gi568815587r:77519631_77737714|GENSCAN_predicted_peptide_6|183_aa MRTTSTSQVRQNYHQDSEAAINRQINLELCASYVYLSMSYCFDRDDVALKNFAKYFLHQS HEEREHAEKLMKLQNQRGGRIFLQDIKKPDCDDWESGLNVMECALHLEKNVNQSLLELHK LATDKNDPHLCDFIETHYLNEQVKAIKELDDHVTNLHKMGALESGLAEYLFDKHTLGDSD NES >gi568815587r:77519631_77737714|GENSCAN_predicted_CDS_6|552_bp atgaggaccacgtccacctcacaggtgcgccagaactaccaccaggactcagaggccgcc atcaaccgccagatcaacctagagctctgtgcctcctacgtttacctgtccatgtcttac tgctttgaccgtgatgatgtggctttgaagaactttgccaaatactttcttcaccaatct catgaggagagggagcatgctgagaaactgatgaagctgcagaaccaacgaggtggccga atcttccttcaggatatcaaaaaaccagactgtgatgactgggagagcgggctgaatgtg atggagtgtgcattacatttggaaaaaaatgtgaatcagtcactactggaactgcacaaa ttggccactgacaaaaatgacccccatttgtgtgacttcattgagacacattacctgaat gagcaggtgaaagccatcaaagaattggatgaccatgtgaccaacttgcacaagatggga gcactcgaatctggcttggcagaatatctctttgacaagcacaccctgggagacagtgat aatgaaagctaa