GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:53:14 Sequence gi568815597r:200307021_200509594 : 202574 bp : 41.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 22255 22380 126 0 0 71 48 197 0.658 14.21 1.02 Term + 24330 24431 102 2 0 101 41 102 0.686 4.10 1.03 PlyA + 25336 25341 6 1.05 2.00 Prom + 46746 46785 40 -2.25 2.01 Init + 54334 54400 67 0 1 93 72 18 0.802 1.20 2.02 Term + 55394 55698 305 0 2 82 45 150 0.795 4.45 2.03 PlyA + 57002 57007 6 1.05 3.09 PlyA - 64346 64341 6 1.05 3.08 Term - 65227 65171 57 1 0 111 49 27 0.019 -2.19 3.07 Intr - 102703 100055 2649 1 0 128 95 1796 0.203 170.68 3.06 Intr - 105633 105592 42 0 0 114 107 12 0.222 3.22 3.05 Intr - 147646 147561 86 2 2 33 89 86 0.039 1.82 3.04 Intr - 171134 170993 142 2 1 71 113 27 0.391 2.51 3.03 Intr - 185041 184967 75 1 0 49 113 47 0.716 2.09 3.02 Intr - 186527 186438 90 2 0 121 45 76 0.897 5.87 3.01 Init - 187073 187023 51 2 0 35 115 34 0.365 2.01 3.00 Prom - 194403 194364 40 -3.25 4.00 Prom + 195288 195327 40 -8.15 4.01 Init + 198063 198539 477 2 0 36 78 212 0.778 10.46 4.02 Term + 199350 199724 375 2 0 61 32 252 0.928 10.65 4.03 PlyA + 199915 199920 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:200307021_200509594|GENSCAN_predicted_peptide_1|75_aa MHKKVVLFCLSGDENIIVEEGRGTLAGEVGQAVDNSYTVVVKVPTSTPEKGWLLSLAHML PPQLPKKGEGVTLTE >gi568815597r:200307021_200509594|GENSCAN_predicted_CDS_1|228_bp atgcataagaaggtggtgctcttctgcctgagtggggacgagaacatcatcgttgaggag ggcagggggaccctggcaggtgaagtggggcaggctgttgacaactcctacaccgtggtt gtcaaggtgcccacatcaaccccggaaaaaggctggctgctcagcttagctcacatgctt cctccacagcttccaaaaaaaggagagggtgtgacactgacggaataa >gi568815597r:200307021_200509594|GENSCAN_predicted_peptide_2|123_aa MGKNEPLVFPPNLFPLHILATPGGSASELVLPGPLKKPSLLSAFSDPILGPEKRNHSNVV GLKSIVPDPKACKQLRLKPCLLEACAVPLCCLLESFHALVLPNTGHCSTRSNEPNSMSNQ GVE >gi568815597r:200307021_200509594|GENSCAN_predicted_CDS_2|372_bp atgggcaaaaatgaacccttggtgttccccccaaatctgtttcctctgcatatccttgcc accccaggaggctcggcctctgaactagttcttcctgggcctctgaaaaaaccctccttg ctgagtgccttctctgaccccatattaggccctgagaaaaggaaccattctaatgttgta gggctcaaatcaattgttccagatcccaaggcttgtaaacaattgagactgaaaccatgc ctgctagaagcctgtgctgtcccactgtgctgcttgctggagagcttccatgcccttgtc ctgcccaacacaggccattgttctacccgtagtaacgagcccaattccatgtctaatcag ggagtagaatga >gi568815597r:200307021_200509594|GENSCAN_predicted_peptide_3|1063_aa MSAMAAETVHSPPQLHKALKHLLSCGISVNYLLRRQDAIEAAGSENLAWVTAVVGIVSKA VSSNVDFVHSEKANTLRPEATERGLSEAGSLQSPGFQSSSSCEALGTGHCSAALLACLQE GTLKNHVWPEISKWRCRAGSWTFVSGAQGLFIFLIYGVYNTEPGLLRGMKIGSGFLSGGG GTGSSGGSGSGGGGSGGGGGGGSSGRRAEMEPTFPQGMVMFNHRLPPVTSFTRPAGSAAP PPQCVLSSSTSAAPAAEPPPPPAPDMTFKKEPAASAAAFPSQRTSWGFLQSLVSIKQEKP ADPEEQQSHHHHHHHHYGGLFAGAEERSPGLGGGEGGSHGVIQDLSILHQHVQQQPAQHH RDVLLSSSSRTDDHHGTEEPKQDTNVKKAKRPKPESQGIKAKRKPSASSKPSLVGDGEGA ILSPSQKPHICDHCSAAFRSSYHLRRHVLIHTGERPFQCSQCSMGFIQKYLLQRHEKIHS REKPFGCDQCSMKFIQKYHMERHKRTHSGEKPYKCDTCQQYFSRTDRLLKHRRTCGEVIV KGATSAEPGSSNHTNMGNLAVLSQGNTSSSRRKTKSKSIAIENKEQKTGKTNESQISNNI NMQSYSVEMPTVSSSGGIIGTGIDELQKRVPKLIFKKGSRKNTDKNYLNFVSPLPDIVGQ KSLSGKPSGSLGIVSNNSVETIGLLQSTSGKQGQISSNYDDAMQFSKKRRYLPTASSNSA FSINVGHMVSQQSVIQSAGVSVLDNEAPLSLIDSSALNAEIKSCHDKSGIPDEVLQSILD QYSNKSESQKEDPFNIAEPRVDLHTSGEHSELVQEENLSPGTQTPSNDKASMLQEYSKYL QQAFEKSTNASFTLGHGFQFVSLSSPLHNHTLFPEKQIYTTSPLECGFGQSVTSVLPSSL PKPPFGMLFGSQPGLYLSALDATHQQLTPSQELDDLIDSQKNLETSSAFQSSSQKLTSQK EQKNLESSTGFQIPSQELASQIDPQKDIEPRTTYQIENFAQAFGSQFKSGSRVPMTFITN SNGEVDHRVRTSVSDFSGYTNMMSDRRAKTIDFSGVAKHVTKG >gi568815597r:200307021_200509594|GENSCAN_predicted_CDS_3|3192_bp atgtctgcaatggcagcagagacagtacattctcctccccagctgcataaggctttaaag cacctcctttcctgtggaatttctgtaaattatttgctcagacgccaagatgccatagaa gcagctggctctgagaatctggcctgggtgactgcggtggttggcattgtttcaaaggct gtatcctcaaatgtagactttgttcattctgagaaggccaacaccttgaggccagaggca acagaaagaggattgagtgaagcaggaagcctgcaaagccctggattccaatccagctct agctgcgaggcgctgggcacgggtcactgctctgcggctctgcttgcttgtctgcaagag ggaaccctgaaaaaccatgtgtggcctgagataagcaagtggagatgccgagctggcagc tggacatttgtgtctggtgctcaaggtctcttcatcttcctgatatatggggtgtacaac acagagccaggcctcctccggggtatgaaaatcggcagtgggttcctgagtggcggcgga ggtaccggcagtagcggtggtagcggctccggcggcggtggtagtggcggcggcggcggc ggcggcagcagcggcaggagggcagagatggaacccacctttccccagggtatggttatg ttcaaccaccgtcttcccccggtcaccagcttcacccggccggcggggtcggccgcccct cccccgcaatgcgtgttatcctcctctacctccgcagccccggccgctgagcccccccct ccgccagccccggacatgactttcaagaaggagccggcggcgtcagccgcggccttcccc tcgcagaggacctcctgggggttcttgcagtctttggttagcatcaaacaggagaaaccc gcggatcctgaggagcagcagtcccaccaccaccatcaccaccaccactatggggggctg ttcgctggagctgaagagaggtctccaggcctaggaggcggtgaaggggggagtcacggc gtcatccaggacctcagtattctccaccagcatgtccagcagcaaccagcccagcaccac cgtgacgtattactcagcagcagtagcaggactgatgaccaccatggcactgaggagcca aagcaggacactaatgtcaaaaaggcaaaaaggccaaagccagaatctcagggaatcaaa gccaagaggaagccaagtgcatcttccaaaccttctttggttggagatggagaaggtgcc atcctctccccaagtcagaaacctcatatctgtgatcactgtagtgctgctttccgaagc tcctatcacctgcggagacatgtcctcattcatacaggagaaagacctttccagtgcagc cagtgtagtatgggtttcattcagaaatacctactacagagacatgagaaaattcatagt agagagaagccatttggatgtgatcagtgcagcatgaagtttattcagaagtaccatatg gagagacacaagaggacacatagtggagaaaagccatataagtgtgacacttgccaacag tatttttcaaggactgatagattgttgaagcacaggcgcacatgtggtgaagtcatagtt aaaggagccactagtgcagaacctgggtcatcaaaccataccaatatgggtaatctggct gtgttgtctcagggaaatacaagttcttcaaggagaaaaacaaagtcaaaaagcatagct attgaaaataaggaacagaagaccggtaaaacaaatgaatcgcaaatttcaaataatata aacatgcagagttactcagtagaaatgcctaccgtgtcttccagtggaggcataattggc actggaatagatgaactgcagaagagggtgccaaaattgatctttaagaaaggaagcaga aagaatacagataaaaactaccttaactttgtgtcaccattaccagacatagtaggacag aaatccttgtctggaaaaccaagtggctcacttggcatagtatcaaataatagtgtggag accattggtcttctccaaagtacaagtggcaaacaaggtcagataagtagtaattatgat gatgccatgcagttttcaaagaaaagaagatatttaccaactgccagcagcaacagtgcc ttttctataaacgtaggacacatggtctcccaacagtctgtcattcagtctgcaggtgtc agtgttttggacaatgaggcaccattgtcacttattgactcctcagctctaaatgctgaa attaaatcttgtcatgacaagtctggaattcctgatgaggttttacaaagtattttggat caatactccaacaaatcagaaagccagaaagaggatcctttcaatattgcagaaccacga gtggatttacacacctcaggagaacactcagaattggttcaagaagaaaatttgagccca ggcacccaaacaccttcaaatgataaagcaagtatgttgcaagaatactccaaatacctc caacaggcttttgaaaaatccactaatgcaagttttactcttggacacggtttccaattt gtcagtttgtcttcacctctccacaaccacactttgtttccagaaaaacaaatatacact acgtctcctttggagtgtggtttcggccaatctgttacctcagtgttgccatcttcattg ccaaagcctccttttgggatgttgtttggatctcagccaggtctttatttgtctgctttg gatgctacacatcagcagttgacaccttcccaggagctggatgatctgatagattctcag aagaacttagagacttcatcagccttccagtcctcatctcagaaattgactagccagaag gaacagaaaaacttagagtcttcaacaggctttcagattccatctcaggagttagctagc cagatagatcctcagaaagacatagagcctagaacaacgtatcagattgagaactttgca caagcgtttggttctcagtttaagtcgggcagcagggtgccaatgacctttatcactaac tctaatggagaagtggaccatagagtaaggacttcagtgtcagatttctcagggtataca aatatgatgtctgatagaagagctaagactattgacttttcaggagttgcaaaacatgtt acaaaaggatag >gi568815597r:200307021_200509594|GENSCAN_predicted_peptide_4|283_aa MIAPLHSNPGDRVRPCKRERKKKGRKERKEGKKERKKERKKERKKERKERKKERKKERKK EKEGKKERKKEREGRKERKKEKEKRKKERREGRKEEESKKKKKERTRERRKKERKKEKER KRKKRERKGRERGRKEGRKEGRKEKRKEEKKEKREKGRKNKLLKIMDKRIWLWWTKESER EDIWRWFLFVFGEEKINKAEGGEERCGRRKTCRLVFGHKELVLATPISFWEFAMAALRHS TKLLLLPGCRQPPPSSATGGVVNVFYVLCCSLQPHRPQAVVIS >gi568815597r:200307021_200509594|GENSCAN_predicted_CDS_4|852_bp atgattgcaccactgcattccaacccaggtgacagagtgagaccttgtaagagagaaaga aaaaagaaaggaaggaaagaaagaaaggaaggaaagaaagaaagaaagaaagaaagaaag aaagaaagaaagaaagaaagaaaggaaagaaagaaggaaagaaagaaggaaagaaagaaa gagaaggaaggaaagaaagaaagaaagaaagaaagagaaggaaggaaggaaagaaagaaa gaaaaagaaaaaagaaagaaagaaaggagggaaggaaggaaggaggaagaaagcaagaag aaaaagaaagaaagaacgagagaaagaagaaagaaagaaagaaagaaagagaaagagaga aagagaaagaagagagaaaggaagggaagggagagaggaaggaaggaaggaaggaaggaa ggaaggaaagaaaaaaggaaggaagaaaagaaagagaagagagagaaaggaaggaaaaat aaattattgaagatcatggataagagaatttggctttggtggacaaaagagagtgagaga gaagacatttggcgatggtttctttttgtctttggtgaagaaaagataaacaaggcagaa ggaggtgaggagagatgtggaaggagaaagacctgcagattggtatttggacacaaggag ctcgttttagcgacaccaatatccttttgggagtttgcaatggcggctctcaggcacagc acaaaactgctcctgttgcctggatgccgccagccgccacccagctctgccacgggaggt gtcgtaaatgtgttttatgtgttgtgttgctcgctgcagcctcatcgtccccaggctgtt gtcatctcctaa