GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:05:52 Sequence gi568815581r:49601461_49807833 : 206373 bp : 44.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 106 101 6 1.05 1.08 Term - 547 404 144 1 0 70 40 193 0.237 10.61 1.07 Intr - 5912 5790 123 2 0 35 116 164 0.999 14.68 1.06 Intr - 6469 6414 56 2 2 74 95 44 0.333 2.40 1.05 Intr - 9997 9820 178 1 1 103 77 111 0.977 10.99 1.04 Intr - 17648 17521 128 0 2 115 105 147 0.993 19.50 1.03 Intr - 17925 17774 152 2 2 97 34 135 0.998 8.81 1.02 Intr - 20652 20486 167 0 2 117 77 92 0.869 9.76 1.01 Init - 21314 21273 42 2 0 75 111 9 0.655 2.33 1.00 Prom - 51965 51926 40 -2.76 2.03 PlyA - 52000 51995 6 1.05 2.02 Term - 57501 57394 108 0 0 76 39 77 0.887 0.11 2.01 Init - 61711 61649 63 1 0 59 99 79 0.807 5.26 2.00 Prom - 67312 67273 40 -4.46 3.04 PlyA - 68370 68365 6 1.05 3.03 Term - 69911 69829 83 0 2 97 42 54 0.679 -0.44 3.02 Intr - 76545 76473 73 0 1 82 109 54 0.917 5.88 3.01 Init - 76705 76625 81 1 0 77 70 78 0.666 3.92 3.00 Prom - 77027 76988 40 -6.16 4.11 PlyA - 80579 80574 6 1.05 4.10 Term - 81012 80891 122 1 2 105 47 72 0.097 3.44 4.09 Intr - 101551 101398 154 1 1 95 106 145 0.998 16.65 4.08 Intr - 101834 101728 107 0 2 51 109 113 0.999 9.63 4.07 Intr - 102565 102383 183 2 0 95 67 25 0.606 0.86 4.06 Intr - 102766 102640 127 1 1 87 72 133 0.997 11.85 4.05 Intr - 103821 103664 158 1 2 84 56 99 0.978 6.03 4.04 Intr - 104436 104406 31 0 1 57 115 21 0.865 -0.60 4.03 Intr - 104874 104744 131 1 2 102 67 66 0.884 6.31 4.02 Intr - 105608 105505 104 1 2 95 65 102 0.881 8.42 4.01 Init - 106484 106270 215 2 2 35 50 340 0.781 21.12 4.00 Prom - 108699 108660 40 -6.76 5.11 PlyA - 108888 108883 6 1.05 5.10 Term - 110095 109795 301 0 1 102 55 160 0.954 8.79 5.09 Intr - 114855 114705 151 1 1 66 94 136 0.509 11.22 5.08 Intr - 116254 116053 202 1 1 90 121 155 0.915 17.86 5.07 Intr - 118434 118300 135 0 0 109 96 86 0.983 12.26 5.06 Intr - 118976 118866 111 2 0 63 75 65 0.896 3.28 5.05 Intr - 121134 121039 96 0 0 116 55 56 0.468 5.31 5.04 Intr - 131260 131091 170 2 2 67 95 153 0.543 13.57 5.03 Intr - 155968 155778 191 0 2 73 72 86 0.222 4.73 5.02 Intr - 161775 161700 76 1 1 92 3 70 0.076 -2.53 5.01 Init - 162627 162432 196 0 1 102 99 149 0.855 14.49 5.00 Prom - 175790 175751 40 -4.36 6.00 Prom + 181491 181530 40 -1.96 6.01 Init + 184093 184098 6 0 0 57 91 0 0.447 -1.83 6.02 Intr + 187225 187389 165 0 0 26 100 162 0.933 11.36 6.03 Term + 187541 187657 117 1 0 76 38 73 0.632 -0.36 6.04 PlyA + 189373 189378 6 1.05 7.00 Prom + 192767 192806 40 -5.86 7.01 Init + 193883 194186 304 1 1 96 69 282 0.781 24.54 7.02 Intr + 195290 195466 177 0 0 -9 105 168 0.782 8.69 7.03 Intr + 196859 197098 240 0 0 79 68 206 0.438 15.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100050 99998 53 1 2 120 49 55 0.923 2.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:49601461_49807833|GENSCAN_predicted_peptide_1|329_aa MSSGPVAESWCYTQVVHFLFNCFLFFYQQIKVVKFSYMWTINNFSFCREEMGEVIKSSTF SSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAM ESQRAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQNTMN MVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEESK KNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCSNL SVENAAEILILADLHSADQLKTQAVDFIN >gi568815581r:49601461_49807833|GENSCAN_predicted_CDS_1|990_bp atgtcgagtggccccgtagctgagagttggtgctacacacaggttgtccacttcctattt aattgcttcctgtttttctatcaacagatcaaggtagtgaaattctcctacatgtggacc atcaataactttagcttttgccgggaggaaatgggtgaagtcattaaaagttctacattt tcatcaggagcaaatgataaactgaaatggtgtttgcgagtaaaccccaaagggttagat gaagaaagcaaagattacctgtcactttacctgttactggtcagctgtccaaagagtgaa gttcgggcaaaattcaaattctccatcctgaatgccaagggagaagaaaccaaagctatg gagagtcaacgggcatataggtttgtgcaaggcaaagactggggattcaagaaattcatc cgtagagattttcttttggatgaggccaacgggcttctccctgatgacaagcttaccctc ttctgcgaggtgagtgttgtgcaagattctgtcaacatttctggccagaataccatgaac atggtaaaggttcctgagtgccggctggcagatgagttaggaggactgtgggagaattcc cggttcacagactgctgcttgtgtgttgccggccaggaattccaggctcacaaggctatc ttagcagctcgttctccggtttttagtgccatgtttgaacatgaaatggaggagagcaaa aagaatcgagttgaaatcaatgatgtggagcctgaagtttttaaggaaatgatgtgcttc atttacacggggaaggctccaaacctcgacaaaatggctgatgatttgctggcagctgct gacaagtatgccctggagcgcttaaaggtcatgtgtgaggatgccctctgcagtaacctg tccgtggagaacgctgcagaaattctcatcctggccgacctccacagtgcagatcagttg aaaactcaggcagtggatttcatcaactag >gi568815581r:49601461_49807833|GENSCAN_predicted_peptide_2|56_aa MLARPSTSHLLCAWFLTGHGSRDKRAEKNSEVGVWEVEKVASRSLLRCGSMDEHMK >gi568815581r:49601461_49807833|GENSCAN_predicted_CDS_2|171_bp atgcttgctcgcccgtccacctctcacctcctgtgcgcctggttcctaacaggccatggg tcgagggataagagggctgagaaaaattcagaagttggtgtctgggaagtggagaaagtt gcaagcagatctttgctaaggtgtgggagcatggatgagcacatgaagtag >gi568815581r:49601461_49807833|GENSCAN_predicted_peptide_3|78_aa MGRRPRGVGSGGTRWLRRRADAMTPRLVPDVADSMRQAQGDGDQQLSPPLSVNYEVGAKV IAVLSFNGKNLNYFCTNL >gi568815581r:49601461_49807833|GENSCAN_predicted_CDS_3|237_bp atggggaggaggccgcgcggggtggggtctggcggtacgcgctggctgcgtcgacgtgct gacgccatgacgccccggctggtcccggatgttgcggacagtatgaggcaagcgcagggg gacggggaccagcagctgtcgccgccgctctcagtaaattatgaggttggtgcaaaagta attgcggttttgtcatttaatggtaaaaacctcaattacttttgcaccaacctctaa >gi568815581r:49601461_49807833|GENSCAN_predicted_peptide_4|443_aa MRPLPPVGDVRLELSPPPPLLPVPVVSGSPVGSSGRLMASSSSLVPDRLRLPLCFLGVFV CYFYYGILQEKITRGKYGEGAKQETFTFALTLVFIQCVINAVFAKILIQFFDTARVDRTR SWLYAACSISYLGAMVSSNSALQFVNYPTQVLGKSCKPIPVMLLGVTLLKKKYPLAKYLC VLLIVAGVALFMYKPKKVVGIEEHTVGYGELLLLLSLTLDGLTGVSQDHMRAHYQTGSNH MMLNINLWSTLLLGMAVSCPDQGPELVPRCPFVQALEKPSWKNLHQQDLFATCYWHNEGE SCVSCHGKTLSSKTQGGILFTGELWEFLSFAERYPAIIYNILLFGLTSALGQSFIFMTVV YFGPLTCSIITTTRKFFTILASVILFANPISPMQWVGTVLVFLELQGKPHLCLLDCKMEP KILVGISEPLGKIPHPDLAHNSQ >gi568815581r:49601461_49807833|GENSCAN_predicted_CDS_4|1332_bp atgaggcccctgccgccggtcggcgatgtccggctggagctgtcgcctccgccgccgctg ctgccggtgccggttgtgagcgggtctccagtcggctcctctgggcgtctcatggcctct agcagctccctggtgcccgaccggctgcgcctgccgctctgcttcctgggtgtctttgtc tgctatttttactatgggatcctgcaggaaaagataacaagaggaaagtatggggaagga gccaagcaggagacgttcacctttgccttaactttggtcttcattcaatgtgtgatcaat gctgtgtttgccaagatcttgatccagttttttgacactgccagggtggatcgtacccgg agctggctctatgctgcctgttctatctcctatctgggtgccatggtctccagcaattca gcactacagtttgtcaactacccaactcaggtccttggtaaatcctgcaagccaatccca gtcatgctccttggggtgaccctcttgaagaagaagtacccgttggccaagtacctgtgt gtgctgttaattgtggctggagtggcccttttcatgtacaaacccaagaaagttgttggg atagaagaacacacagtcggctatggagagctactcttgctattatcgctgaccctggat ggactgactggtgtttcccaggaccacatgcgggctcattaccaaacaggctccaaccac atgatgctgaacatcaacctttggtcgacattgctgctgggaatggctgttagttgccca gaccaagggcctgaattagttccaagatgcccttttgttcaagcccttgagaaaccttcc tggaaaaacctgcaccagcaggatttgtttgctacttgttactggcacaatgaaggagag tcgtgtgtcagttgtcatgggaagacactcagtagtaaaacacagggaggaatcctgttc actggggagctctgggagttcttgagctttgctgaaaggtaccctgccatcatctataac atcctgctctttgggctgaccagtgccctgggtcagagcttcatctttatgacggttgtg tattttggtcccctgacctgctccatcatcactacaactcgaaagttcttcacaattttg gcctctgtgatcctcttcgccaatcccatcagccccatgcagtgggtgggcactgtgctt gtgttcctggagcttcaagggaaaccacacctttgcttgctggactgtaaaatggagccg aagatcctcgttggcatctcagagcctttggggaagatcccgcacccagacctggctcat aacagccaatga >gi568815581r:49601461_49807833|GENSCAN_predicted_peptide_5|542_aa MAGAAAGGRGGGAWGPGRGGAGGLRRGCSPPAPAGSPRAGLQPLRATIPFQLQQPHQRRD GGGRAEVVRSQILDLLNMAVSFQPSFVYERVPTYALIDPYCVGALQLIKGGFNELKRMLS KYFIIEKLQVQALGDLELKSRPCTDIYPALLRHTASVPCSVAPEKSVCRPQPLQVRRTFS LDTILSSYLLGQWPRDADGAFTCCTNDKATQTPLSWQELEGERASSCAHKRSASWGSTDH RKEISKLKQQLQRTKLSRSGKEKERGSPLLGDHAVRGALRASPPSFPSGSPVLRLSPCLH RSLEGLNQELEEVFVKEQGEEELLRILDIPDGHRAPAPPQSGSCDHPLLLLEPGNLASSP SMSLASPQPCGLASHEEHRGAAEELASTPNDKASSPGHPAFLEDGSPSPVLAFAASPRPN HSYIFKREPPEGCEKVRVFEEATSPGPDLAFLTSCPDKNKVHFNPTGSAFCPVNLMKPLF PGMGFIFRNCPSNPGSPLPPASPRPPPRKDPEASKASPLPFEPWQRTPPSEEPVLFQSSL MV >gi568815581r:49601461_49807833|GENSCAN_predicted_CDS_5|1629_bp atggcgggggccgcagcgggcggcagaggcggaggtgcctgggggccggggcgcggaggg gccggggggctccggcggggctgctctcccccagcccccgccggctccccccgggctggg ctgcagccgctcagggccacgatccccttccagctgcagcagccgcaccagcgccgggac gggggtggccgtgcagaagttgtacgttcccagattcttgacctgcttaatatggctgtc tcattccaaccatcgtttgtgtacgagcgtgtgcccacatatgccttgatcgacccttac tgtgtgggagcactgcagctaatcaagggtggctttaatgaacttaaaagaatgctatcc aagtatttcataatagaaaaacttcaagtccaggccctgggggacctggaactgaagtcc aggccatgtacagacatttaccctgccctgctcaggcatacagccagcgtcccatgctcg gtggccccagaaaagtcagtgtgtaggcctcagccacttcaggtccggcgtacattctcc ctggacaccatcctcagctcctaccttctgggccagtggccacgagatgctgatggggcc ttcacctgctgcaccaatgacaaggccacccagacgcccctgtcctggcaagagctagaa ggtgagcgtgccagttcctgtgcacacaagcgctcagcatcctggggcagcacagaccac cgaaaagagatttccaagttgaagcaacaactgcagaggacgaagctgagccgcagtggg aaagagaaggagcgaggttcaccactcctaggggaccacgcagtgcggggagcactgagg gcgtcccctcccagcttcccctcagggtcccctgtcttgcgactcagcccctgcctgcac aggagcctggaagggctcaaccaagagctggaggaggtatttgtgaaggagcagggagaa gaggagctgctgaggatccttgatatccctgatgggcaccgggccccagctcctccccag agtggcagctgtgatcatcccctcctcctcctggagcctggcaaccttgccagctctcct tccatgtccttggcatctccccagccttgtggcctggccagtcatgaggaacatcggggt gccgccgaggagctggcatccacccccaacgacaaagcctcctctccaggacacccagcc tttcttgaagatggcagcccatctccagtccttgcctttgctgcctcccctcgacctaat catagctacatcttcaaacgggagcccccagaaggctgtgagaaagtgcgtgtgtttgaa gaagccacgtctccaggtcctgacctggccttcctgacttcctgtcctgacaagaacaaa gtccatttcaacccgactggctcagccttctgccccgtcaacctgatgaagcccctcttc cccggcatgggcttcatcttccgtaactgcccctcaaacccgggatctccccttcccccg gccagccccaggccaccacctcggaaggatccggaagcctccaaggcctccccactgcca ttcgagccatggcagcgcaccccaccatcagaagagcctgtgcttttccagagctccctg atggtctga >gi568815581r:49601461_49807833|GENSCAN_predicted_peptide_6|95_aa MQNAPDAERQEALGIVRRIGTDTEAATEPAGATVPAAAAAARIGTVGPQPPAMPRRKKNV GPTLASAYACNYSRPLLAWIGGVLINWSLKRLEAY >gi568815581r:49601461_49807833|GENSCAN_predicted_CDS_6|288_bp atgcagaacgctccagacgctgagaggcaggaggcactagggatcgtccgcaggattggg actgatacagaggccgccacggagcccgccggagccaccgttcctgctgctgccgccgct gcccgaatcggaaccgtcgggccgcagccgccggcaatgccgcgaaggaagaaaaatgtg ggcccgaccctggcctcggcctatgcttgcaactattcccgccctctgcttgcctggatc ggaggcgttcttattaattggtcgctgaaacgtctggaagcatattga >gi568815581r:49601461_49807833|GENSCAN_predicted_peptide_7|241_aa MVLLESEQFLTELTRLFQKCWTSGSIYITLKKYGGGTKPVPKKGSVEGFEPSDKCLLRAI DGKKKVSTVVSSNQVNKFQMAYSNLPSANMDGLKKRDKKNKNSSPVRNLQSFGTEEPAYS TRRVTRSQQQPTPVTPKKYPLRQTRSSGSETEQVVDFSDRETKNTADHDESPPRTPTGNA PSSESDIDISSPNVSHDESIAKDMSLKDSGSDLSHRPKRRRFHESYNFNMKCPTPGCNSL X >gi568815581r:49601461_49807833|GENSCAN_predicted_CDS_7|723_bp atggtgttgttggagagtgagcagttcctgacggagctgaccagacttttccagaagtgc tggacatcgggcagcatctatatcaccttgaagaagtatggtggtggaaccaaacccgtt ccaaagaaaggttctgtggagggctttgagccctcagacaagtgtctgttaagagctatc gatgggaaaaagaaggtcagcactgtggtgagctccaaccaagtgaataagtttcagatg gcttattcaaacctaccgagtgctaacatggatgggctgaagaaaagggacaaaaagaac aaaaattccagtcctgttcgaaatctgcagtcttttggcactgaggagcctgcttactct accagaagagtgacccgtagtcagcagcagcctaccccagtgacaccgaaaaaataccct cttcggcagactcgttcatctggttcagaaactgagcaagtggttgatttttcagataga gaaactaaaaatacagctgatcatgatgagtcaccgcctcgaactccaactggaaatgcg ccttcttctgagtctgacatagacatctccagccccaatgtatctcacgatgagagcatt gccaaggacatgtccctgaaggactcaggcagtgatctctctcatcgccccaagcgccgt cgcttccatgaaagctacaacttcaatatgaagtgtcctacaccaggctgtaactctcta gnn