GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:17:57 Sequence gi568815581f:36225412_36434441 : 209030 bp : 45.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13772 13774 3 1 0 113 81 0 0.703 1.80 1.02 Intr + 16288 16377 90 0 0 110 0 65 0.075 0.09 1.03 Term + 24162 24293 132 2 0 105 43 163 0.842 11.59 1.04 PlyA + 25232 25237 6 1.05 2.14 PlyA - 25508 25503 6 1.05 2.13 Term - 27183 27026 158 1 2 95 39 96 0.006 3.50 2.12 Intr - 29111 28899 213 0 0 101 49 81 0.005 4.19 2.11 Intr - 30226 30074 153 2 0 85 80 82 0.020 7.14 2.10 Intr - 30696 30597 100 0 1 91 89 78 0.022 7.88 2.09 Intr - 32102 32050 53 0 2 100 38 72 0.016 2.03 2.08 Intr - 32538 32418 121 0 1 92 110 146 0.994 17.27 2.07 Intr - 33130 33082 49 0 1 97 117 62 0.987 8.68 2.06 Intr - 34488 34379 110 0 2 102 82 146 0.996 14.48 2.05 Intr - 34925 34818 108 2 0 99 101 160 0.975 18.88 2.04 Intr - 35449 35369 81 1 0 52 92 42 0.602 0.83 2.03 Intr - 35971 35932 40 0 1 80 116 -10 0.939 -0.77 2.02 Intr - 37340 37255 86 2 2 81 79 118 0.923 8.82 2.01 Init - 37574 37503 72 2 0 71 82 55 0.663 4.27 2.00 Prom - 43250 43211 40 -3.26 3.00 Prom + 43539 43578 40 -6.96 3.01 Init + 47328 47330 3 2 0 71 101 0 0.685 -0.40 3.02 Intr + 48788 48872 85 1 1 131 116 54 0.961 11.89 3.03 Intr + 71898 71975 78 1 0 88 110 55 0.971 7.22 3.04 Intr + 81886 82007 122 2 2 80 64 50 0.736 2.01 3.05 Intr + 88026 88226 201 2 0 92 84 171 0.941 16.58 3.06 Intr + 94545 94649 105 2 0 73 93 108 0.803 10.21 3.07 Intr + 100235 100320 86 1 2 81 79 118 0.909 8.82 3.08 Intr + 101604 101643 40 0 1 80 116 -10 0.919 -0.77 3.09 Intr + 102126 102206 81 2 0 52 92 42 0.589 0.83 3.10 Intr + 102650 102757 108 1 0 102 101 162 0.993 19.38 3.11 Intr + 103087 103196 110 0 2 102 82 150 0.999 14.88 3.12 Intr + 104445 104493 49 0 1 97 117 62 0.988 8.68 3.13 Intr + 105037 105157 121 0 1 92 110 146 0.994 17.27 3.14 Intr + 105473 105525 53 0 2 100 38 77 0.355 2.53 3.15 Intr + 106879 106978 100 0 1 91 89 78 0.429 7.88 3.16 Intr + 107349 107501 153 1 0 85 80 82 0.436 7.14 3.17 Intr + 108465 108677 213 1 0 101 49 81 0.305 4.19 3.18 Term + 110393 110550 158 0 2 95 39 129 0.375 6.80 3.19 PlyA + 113850 113855 6 1.05 4.00 Prom + 131813 131852 40 -4.06 4.01 Init + 137655 137657 3 2 0 113 81 0 0.577 1.80 4.02 Term + 148054 148185 132 0 0 105 43 126 0.556 7.89 4.03 PlyA + 149124 149129 6 1.05 5.14 PlyA - 149400 149395 6 1.05 5.13 Term - 151075 150918 158 2 2 95 39 129 0.048 6.80 5.12 Intr - 153003 152791 213 1 0 101 49 81 0.041 4.19 5.11 Intr - 154137 153985 153 1 0 85 80 82 0.058 7.14 5.10 Intr - 154607 154508 100 2 1 91 89 78 0.062 7.88 5.09 Intr - 156013 155961 53 2 2 100 38 77 0.048 2.53 5.08 Intr - 156449 156329 121 2 1 92 110 146 0.994 17.27 5.07 Intr - 157041 156993 49 2 1 97 117 62 0.988 8.68 5.06 Intr - 158399 158290 110 2 2 102 82 150 0.999 14.88 5.05 Intr - 158836 158729 108 1 0 102 101 153 0.993 18.48 5.04 Intr - 159360 159280 81 0 0 52 92 42 0.605 0.83 5.03 Intr - 159882 159843 40 2 1 80 116 -10 0.949 -0.77 5.02 Intr - 161250 161165 86 0 2 81 79 118 0.907 8.82 5.01 Init - 161484 161413 72 0 0 71 82 55 0.670 4.27 5.00 Prom - 162077 162038 40 -14.47 6.00 Prom + 162303 162342 40 -14.86 6.01 Init + 162748 162799 52 0 1 123 105 85 0.998 15.02 6.02 Intr + 165162 165194 33 1 0 80 110 35 0.887 3.09 6.03 Intr + 165881 166019 139 0 1 70 59 112 0.801 6.02 6.04 Intr + 169188 169256 69 0 0 87 102 28 0.275 2.40 6.05 Intr + 193868 193883 16 2 1 138 55 2 0.052 -2.55 6.06 Term + 199135 199266 132 0 0 105 43 126 0.747 7.89 6.07 PlyA + 200205 200210 6 1.05 7.08 PlyA - 200481 200476 6 1.05 7.07 Term - 202156 201999 158 2 2 95 39 133 0.212 7.20 7.06 Intr - 204085 203873 213 2 0 101 49 85 0.181 4.59 7.05 Intr - 205219 205067 153 2 0 85 80 86 0.204 7.54 7.04 Intr - 205689 205590 100 0 1 91 89 78 0.222 7.88 7.03 Intr - 207095 207043 53 0 2 100 38 77 0.140 2.53 7.02 Intr - 207531 207411 121 0 1 92 110 146 0.994 17.27 7.01 Intr - 208123 208075 49 0 1 97 117 85 0.984 10.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 32102 31945 158 0 2 100 48 123 0.977 7.60 S.002 Init + 38838 38889 52 2 1 123 105 85 0.939 15.02 S.003 Term - 156013 155856 158 2 2 100 48 128 0.937 8.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:36225412_36434441|GENSCAN_predicted_peptide_1|74_aa MVETELKLICGDVLDVLDKHLIPAATTGKSKAVCEMFDVRGKQHIQIPKLYTSSVTRHLH HFRLMQDSQPLDRS >gi568815581f:36225412_36434441|GENSCAN_predicted_CDS_1|225_bp atggttgagactgagctaaagttaatctgtggcgacgttctggatgtactggacaaacac ctcattccagcagctacaactggcaagtccaaggcagtctgtgagatgtttgatgtccga ggcaaacagcacattcagatccccaagctctacacctccagtgtgaccaggcacctgcac cacttcaggctcatgcaggactcacagcctttggaccgcagctaa >gi568815581f:36225412_36434441|GENSCAN_predicted_peptide_2|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEM KMKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAAQAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNSIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:36225412_36434441|GENSCAN_predicted_CDS_2|1344_bp atggacgtggtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatg aagatgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccaggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcgggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaactccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581f:36225412_36434441|GENSCAN_predicted_peptide_3|621_aa MVRQATNQIVMNCADIDIITASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTL KIDFVGELNDKMKGFYRSKYTTPSGEVRYAAVTQFENVIDRKPYPDDENLVEVKFARTPV TSTYLVAFVVGEYDFVETRSKDGVCVCVYTPVGKAEQGKFALEVSVGHPSEVDEICDAIS YSKGASVIRMLHDYIGDKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAREAK QIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNIEEMKLKNPG RYQIMKEKGKRSSEHIQRIDRDISGTLRKHMFFRDRYGTKQRELLHILLAYEEYNPEVGY CRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISLGLT LRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKH LRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIW SASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVARLL GDGCDPEDRAQASVMPGWNEL >gi568815581f:36225412_36434441|GENSCAN_predicted_CDS_3|1866_bp atggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattattaca gcttcatatgcaccagaaggagatgaagaaatacatgctacaggatttaactatcagaat gaagatgaaaaagtcaccttgtctttccctagtactctgcaaacaggtacgggaacctta aagatagattttgttggagagctgaatgacaaaatgaaaggtttctatagaagtaagtat actaccccttctggagaggtgcgctatgctgctgtaacacagtttgagaatgtaattgac cggaaaccataccctgatgatgaaaatttagtggaagtgaagtttgcccgcacacctgtt acatctacatatctggtggcatttgttgtgggtgaatatgactttgtagaaacaaggtca aaagatggtgtgtgtgtctgtgtttacactcctgttggcaaagcagaacaaggaaaattt gcattagaggtcagtgtgggccatccatctgaggttgatgagatatgtgatgctatatca tatagcaaaggtgcatctgtcatccgaatgctgcatgactacattggggataagggacac cgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaacaacaacgtc gatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgggaggcgaag caaattcggcgggagatcagccgaaagagcaagtgggtggatatgctgggagactgggag aaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgcccatgaacatc cggggcccgatgtggtcagtcctcctgaacattgaggaaatgaagttgaaaaaccccgga agataccagatcatgaaggagaagggcaagaggtcatctgagcacatccagcgcatcgac cgggacataagcgggacattaaggaagcatatgttcttcagggatcgatacggaaccaag cagcgggaactactccacatcctcctggcatatgaggagtataacccggaggtgggctac tgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgaggaggatgca ttctgggcactggtgcagctgctggccagtgagaggcactccctgcaggctgcccgggct cctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctcgggctcacc ctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccgataacaaga atcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggcccgtgggca cgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtgctcaagcat cttagggcctctatgaagaaactaacaagaaagcagggggacctgccacccccagccaaa cccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcgggaagaccctc tgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcggcccatttgg tcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggggctgtccgg gaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaacgccattgtt aatgcacggaggaggaacctgactgttagacctgggttttccagggttgcacggcttctg ggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccaggatggaatgag ctgtga >gi568815581f:36225412_36434441|GENSCAN_predicted_peptide_4|44_aa MAVCEMFHVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581f:36225412_36434441|GENSCAN_predicted_CDS_4|135_bp atggcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatccccaagctc tacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcct ttggacctcagctaa >gi568815581f:36225412_36434441|GENSCAN_predicted_peptide_5|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEM KLKNPGRYQIMKEKGKRSSEHIQRIDRDISGTLRKHMFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:36225412_36434441|GENSCAN_predicted_CDS_5|1344_bp atggacgtggtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatg aagttgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacataagcgggacattaaggaagcatatgttcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcgggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaacgccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581f:36225412_36434441|GENSCAN_predicted_peptide_6|146_aa METKEPIVYTGSVERAPGPFGALATSSPDHTMEAGSPVGTTRASCDLQFLLPGVWGAHSG EQGIPHTPHFEDAGRGWNICPEKANTSIGHVSLSPFATPLVMYAVCEMFHVRGKQHIQIP KLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581f:36225412_36434441|GENSCAN_predicted_CDS_6|441_bp atggagacgaaagagccaatcgtctacacgggcagtgtagaacgggcgcctgggcctttt ggagccctggccacgtcctccccagatcacacgatggaagctggcagccccgtgggcacc actcgagccagctgtgacctgcagtttctgcttcctggagtgtggggcgcccactcagga gagcagggcataccccacacccctcattttgaggatgctgggaggggatggaatatttgt cctgagaaggccaatacatccatcggacacgtgtctctatccccatttgctacgcctttg gttatgtatgcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatcccc aagctctacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactca cagcctttggacctcagctaa >gi568815581f:36225412_36434441|GENSCAN_predicted_peptide_7|282_aa XQRELLHILLAYEEYNPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAAR APAAIGAHEWADQAQISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPW ARFCNRFVDTWARDEDTVLKHLRASMKKLTRKKGDVPPPAKPEQGSSASRPVPASRGGKT LCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAI VNARRRNLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWDEL >gi568815581f:36225412_36434441|GENSCAN_predicted_CDS_7|849_bp nngcagcgggaactactccacatcctcctggcatatgaggagtacaacccggaggtgggc tactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgaggaggat gcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggctgcccgg gctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctcgggctc accctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccgataaca agaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggcccgtgg gcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtgctcaag catcttagggcctctatgaagaaactaacaagaaagaagggggacgtgccacccccagcc aaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcgggaagacc ctctgcaagggggacagacaggcccctccaggcccaccagcccggttcccgcggcccatt tggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggggctgtc cgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaacgccatt gttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgcacggctt ctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccaggatgggat gagctgtga