GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:00:31 Sequence gi568815597f:39959347_40171530 : 212184 bp : 43.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 3381 4058 678 2 0 28 46 945 0.789 80.49 1.02 PlyA + 4098 4103 6 -0.45 2.00 Prom + 4860 4899 40 -8.36 2.01 Init + 5276 5301 26 1 2 81 57 -9 0.350 -5.54 2.02 Intr + 5865 5988 124 0 1 127 113 184 0.993 25.39 2.03 Intr + 6125 6203 79 1 1 64 78 77 0.969 3.42 2.04 Intr + 6511 6668 158 2 2 99 89 41 0.993 5.03 2.05 Intr + 7255 7345 91 0 1 70 94 86 0.997 6.97 2.06 Intr + 7465 7586 122 2 2 97 101 192 0.999 21.61 2.07 Intr + 7740 7823 84 2 0 62 101 63 0.970 5.02 2.08 Intr + 8282 8365 84 1 0 113 70 61 0.986 6.82 2.09 Intr + 8458 8570 113 0 2 84 82 132 0.999 11.38 2.10 Intr + 8988 9131 144 0 0 74 64 176 0.787 13.20 2.11 Intr + 9223 9399 177 1 0 73 105 278 0.999 27.03 2.12 Term + 10159 10222 64 1 1 131 49 50 0.993 2.76 2.13 PlyA + 14539 14544 6 1.05 3.00 Prom + 37701 37740 40 -2.36 3.01 Init + 95450 95457 8 1 2 114 91 0 0.662 3.40 3.02 Intr + 99991 100112 122 1 2 65 121 77 0.416 8.84 3.03 Intr + 100721 100824 104 0 2 73 75 100 0.744 7.09 3.04 Intr + 102389 102466 78 1 0 91 100 45 0.980 5.75 3.05 Intr + 104881 105024 144 0 0 99 89 105 0.997 12.18 3.06 Intr + 105128 105213 86 1 2 101 75 91 0.891 7.72 3.07 Intr + 107026 107069 44 1 2 124 37 6 0.293 -3.02 3.08 Intr + 108288 108371 84 1 0 117 101 36 0.360 7.59 3.09 Intr + 110344 110528 185 2 2 22 89 129 0.405 5.91 3.10 Intr + 110813 110936 124 1 1 60 95 125 0.999 10.56 3.11 Intr + 111084 111166 83 1 2 114 93 104 0.999 12.86 3.12 Intr + 111490 111633 144 0 0 124 78 123 0.999 15.38 3.13 Term + 112104 112187 84 2 0 130 47 91 0.999 6.85 3.14 PlyA + 113243 113248 6 1.05 4.07 PlyA - 113384 113379 6 1.05 4.06 Term - 114837 114715 123 0 0 115 47 79 0.987 4.88 4.05 Intr - 117567 117496 72 0 0 101 93 -4 0.657 1.00 4.04 Intr - 119312 119214 99 2 0 95 78 186 0.988 18.61 4.03 Intr - 121141 121051 91 0 1 113 102 106 0.999 14.50 4.02 Intr - 130166 130064 103 0 1 31 92 133 0.443 7.23 4.01 Init - 137892 137769 124 0 1 106 92 175 0.948 18.03 4.00 Prom - 153061 153022 40 -2.16 5.04 PlyA - 157440 157435 6 1.05 5.03 Term - 174066 173813 254 1 2 40 38 208 0.309 6.80 5.02 Intr - 175592 175416 177 0 0 2 4 243 0.197 7.19 5.01 Init - 181872 181851 22 0 1 74 77 21 0.228 -1.16 5.00 Prom - 188206 188167 40 -4.26 6.04 PlyA - 188366 188361 6 1.05 6.03 Term - 202230 202012 219 0 0 96 41 179 0.129 11.04 6.02 Intr - 203766 203638 129 0 0 85 93 5 0.131 1.59 6.01 Init - 207911 207894 18 2 0 78 116 43 0.245 5.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 202054 202290 237 0 0 101 94 236 0.806 21.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:39959347_40171530|GENSCAN_predicted_peptide_1|225_aa MPVTKLGRLVKDMKIKSLQEIYLFSLPIKGSEIIDFFLGASLKDEVLKIMPVQKQTRAGQ RTRFKAFVAIGDYNGHVGLGVKCSKEVATAIRGAIILAKLSIVPVRRGYWGNKISKPHTV PCKVTGRCGSVLVRLIPAPRSTGIVSTPVPKKLLMMAGIDDCYTSARGCTATLGSFAKGT FDAISKTYSYLTPDLWKETVFTKSPDQEFTDHLIKTHQAPAVATT >gi568815597f:39959347_40171530|GENSCAN_predicted_CDS_1|678_bp atgcctgtcaccaagctgggccgcttggtgaaggacatgaagatcaagtccctgcaggag atctatctcttctctctgcccattaagggatctgagatcattgactttttcctgggggcc tctctcaaggatgaggttttgaagattatgccagtgcagaagcagacccgtgccggccag cgcaccaggttcaaggcgtttgttgctatcggagactacaatggccatgtcggtctgggt gttaagtgctccaaggaggtggccaccgccatccgtggggccatcatcctggccaagctc tccattgtccccgtgcgcagaggctactgggggaacaagatcagcaagccccacaccgtc ccttgcaaggtgacaggccgctgcggctctgtgctggtgcgtctcatccctgcacccagg agcactggcatcgtctccacacctgtgcccaagaagctgctcatgatggctggtatcgat gactgctacacctcagcccggggctgcactgccaccctgggcagctttgccaagggcacc tttgatgccatctctaagacctacagctacctgacccccgacctctggaaggagactgta ttcaccaagtctcccgatcaggaattcactgaccacctcatcaagacccaccaggctcca gctgtggctacaacatag >gi568815597f:39959347_40171530|GENSCAN_predicted_peptide_2|421_aa MPASLCVNGIIFSTPLAVIAYFLIWFVPDFPHGQTYWYLLFYCLFETMVTCFHVPYSALT MFISTEQTERDSATAYRMTVEVLGTVLGTAIQGQIVGQADTPCFQDLNSSTVASQSANHT HGTTSHRETQKAYLLAAGVIVCIYIICAVILILGVREQREPYEAQQSEPIAYFRGLRLVM SHGPYIKLITGFLFTSLAFMLVEGNFVLFCTYTLGFRNEFQNLLLAIMLSATLTIPIWQW FLTRFGKKTAVYVGISSAVPFLILVALMESNLIITYAVAVAAGISVAAAFLLPWSMLPDV IDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLGISTLSLDFAGYQTRGCSQPERVKFT LNMLVTMAPIVLILLGLLLFKMYPIDEERRRQNKKALQALRDEASSSGCSETDSTELASI L >gi568815597f:39959347_40171530|GENSCAN_predicted_CDS_2|1266_bp atgccagcttccctgtgtgtgaatgggatcatcttctccacgcccctggccgtcattgcc tacttcctcatctggttcgtgcccgacttcccacacggccagacctattggtacctgctt ttctattgcctctttgaaacaatggtcacgtgtttccatgttccctactcggctctcacc atgttcatcagcaccgagcagactgagcgggattctgccaccgcctatcggatgactgtg gaagtgctgggcacagtgctgggcacggcgatccagggacaaatcgtgggccaagcagac acgccttgtttccaggacctcaatagctctacagtagcttcacaaagtgccaaccataca catggcaccacctcacacagggaaacgcaaaaggcatacctgctggcagcgggggtcatt gtctgtatctatataatctgtgctgtcatcctgatcctgggcgtgcgggagcagagagaa ccctatgaagcccagcagtctgagccaatcgcctacttccggggcctacggctggtcatg agccacggcccatacatcaaacttattactggcttcctcttcacctccttggctttcatg ctggtggaggggaactttgtcttgttttgcacctacaccttgggcttccgcaatgaattc cagaatctactcctggccatcatgctctcggccactttaaccattcccatctggcagtgg ttcttgacccggtttggcaagaagacagctgtatatgttgggatctcatcagcagtgcca tttctcatcttggtggccctcatggagagtaacctcatcattacatatgcggtagctgtg gcagctggcatcagtgtggcagctgccttcttactaccctggtccatgctgcctgatgtc attgacgacttccatctgaagcagccccacttccatggaaccgagcccatcttcttctcc ttctatgtcttcttcaccaagtttgcctctggagtgtcactgggcatttctaccctcagt ctggactttgcagggtaccagacccgtggctgctcgcagccggaacgtgtcaagtttaca ctgaacatgctcgtgaccatggctcccatagttctcatcctgctgggcctgctgctcttc aaaatgtaccccattgatgaggagaggcggcggcagaataagaaggccctgcaggcactg agggacgaggccagcagctctggctgctcagaaacagactccacagagctggctagcatc ctctag >gi568815597f:39959347_40171530|GENSCAN_predicted_peptide_3|429_aa MPRWSIMADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSKAGAAPYVQAFDSLLAGP VAEYLKISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLSDLLAPISEQIKE VITFREKNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDAAMFYTNRVLKEYKD VMGSTGFLDFSTTKVFSTISCSYESASRSSLFAQINQGESITHALKHVSDDMKTHKNPAL KAQSGPVRSGPKPFSAPKPQTSPSPKRATKKEPAVLELEGKKWRVENQENVSNLVIEDTE LKQVAYIYKCVNTTLQIKGKINSITVDNCKKLGLVFDDVVGIVEIINSKDVKVQVMGKVP TISINKTDGCHAYLSKNSLDCEIVSAKSSEMNVLIPTEGGDFNEFPVPEQFKTLWNGQKL VTTVTEIAG >gi568815597f:39959347_40171530|GENSCAN_predicted_CDS_3|1290_bp atgcccaggtggtccattatggctgacatgcaaaatctggtagaaagattggagagggca gtgggccgcctggaggcagtatctcatacctctgacatgcaccgtgggtatgcagacagt ccttcaaaagcaggagcagctccatatgtgcaggcatttgactcgctgcttgctggtcct gtggcagagtacttgaagatcagtaaagagattgggggagacgtgcagaaacatgcggag atggtccacacaggtttgaagttggagcgagctctgttggttacagcttctcagtgtcaa cagccagcagaaaataagctttccgatttgttggcacccatctcagagcagatcaaagaa gtgataacctttcgggagaagaaccgaggcagcaagttgtttaatcacctgtcagctgtc agcgaaagtatccaggccctgggctgggtggctatggctcccaagcctggcccttatgtg aaagaaatgaatgatgccgccatgttttatacaaaccgagtcctcaaagagtacaaagat gtcatggggtccacaggtttcttagacttcagcactaccaaagtcttctctaccatttca tgctcatatgagtctgcttcccgctcatcactgttcgcgcagattaatcagggggagagc attacacatgccctgaaacatgtatctgatgacatgaagactcacaagaaccctgccctg aaggctcagagtggtccagtacgcagtggccccaaaccattctctgcacctaaaccccaa accagcccatcccccaaacgagccacaaagaaggagccagctgtacttgaactggagggc aagaagtggagagtggaaaatcaggaaaatgtttccaacctggtgattgaggacacagag ctgaaacaggtggcttacatatacaagtgtgtcaacacgacattgcaaatcaagggcaaa attaactccattacagtagataactgtaagaaacttggcctggtattcgatgacgtggtg ggcattgtggagataatcaacagtaaggatgtcaaagttcaggtaatgggtaaagtgcca accatatccatcaacaaaacagatggctgccatgcttacctgagcaagaattccctggat tgtgaaatagtcagtgccaaatcttccgagatgaatgtcctcattcctacagaaggcggt gactttaatgaattcccagttcctgagcagttcaagaccctatggaacgggcagaagttg gtcaccacagtgacagaaattgctggataa >gi568815597f:39959347_40171530|GENSCAN_predicted_peptide_4|203_aa MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGVFGLPRCPGESSHICDFI RKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALK KFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFL ATEGDHLQLSEEWFYAHIIPFLG >gi568815597f:39959347_40171530|GENSCAN_predicted_CDS_4|612_bp atggcgtcgcccggctgcctgtggctcttggctgtggctctcctgccatggacctgcgct tctcgggcgctgcagcatctggacccgccggcgccgctgccgttggtgatctggcatggg atgggtgtttttggactccctcgatgcccaggagagagctctcacatctgtgacttcatc cgaaaaacactgaatgctggggcgtactccaaagttgttcaggaacgcctcgtgcaagcc gaatactggcatgaccccataaaggaggatgtgtatcgcaaccacagcatcttcttggca gatataaatcaggagcggggtatcaatgagtcctacaagaaaaacctgatggccctgaag aagtttgtgatggtgaaattcctcaatgattccattgtggaccctgtagattcggagtgg tttggattttacagaagtggccaagccaaggaaaccattcccttacaggagacctccctg tacacacaggaccgcctggggctaaaggaaatggacaatgcaggacagctagtgtttctg gctacagaaggggaccatcttcagttgtctgaagaatggttttatgcccacatcatacca ttccttggatga >gi568815597f:39959347_40171530|GENSCAN_predicted_peptide_5|150_aa MAKPAAQAWVTEQDPVEEEKKEEKETEKEKGKEEEEEEEDEETYYEIQVILSMRKKKRRR RRRRRPSHEASVGSLGDKGSHNLKAPKAEGPEQGGSVLVVLVEANEDMVGAQLLLGELLE NCKAILAPLGQRTTRNLRCRGCRRSARLAS >gi568815597f:39959347_40171530|GENSCAN_predicted_CDS_5|453_bp atggcaaagcctgcagctcaagcctgggtgacagaacaagaccctgttgaagaagaaaag aaggaggagaaggagacagagaaggagaaggggaaagaggaggaggaggaagaggaggat gaggagacctactatgagattcaagtgattcttagtatgaggaagaagaagaggaggagg aggaggaggaggagacctagccatgaagcaagtgtcgggtctcttggggacaaggggtct cacaatctcaaagccccaaaagctgaaggtccggagcaaggcggctctgtcctcgtggtt cttgtggaagcaaatgaagacatggtcggcgctcagctgctcctcggcgaactgctggag aactgcaaagctatcctcgctcccctggggcagcgcaccaccaggaatctccgatgcaga ggctgccgccgctcagcacggctcgccagttaa >gi568815597f:39959347_40171530|GENSCAN_predicted_peptide_6|121_aa MAINLKLIVFPIRISHEGSLLTIVSTTPPFPSLPKPHWKYCHSYHLPPTLCLQLPQTRPQ SRGSRRWRYGAMTPNHGLSLDSISGSRYRRSLSPSPGDSGGVSLSVRHLPTAHQRVGNQA T >gi568815597f:39959347_40171530|GENSCAN_predicted_CDS_6|366_bp atggccatcaacctcaagttaattgtctttcccatcaggatatcccatgaaggctctctc ctcaccattgtctctacaacacctccttttccatcattaccaaaaccgcactggaaatac tgtcattcctaccacctcccaccaactctctgtctccagctgccacagacacggccgcag tcccgaggctcccggcgctggagatacggggcgatgaccccgaaccatggactcagtctc gactccatctccggctcccgctaccgccggagcctcagccccagccccggcgacagcggc ggcgtctccctttccgtccgccatcttcccacggcccaccagcgcgtaggcaaccaagcc acgtga