GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:17:59 Sequence gi568815595f:194036381_194238230 : 201850 bp : 46.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2744 2904 161 1 2 34 49 119 0.158 1.90 1.02 Intr + 7386 7450 65 0 2 96 97 20 0.132 2.06 1.03 Term + 15071 15237 167 0 2 105 38 66 0.146 1.38 1.04 PlyA + 16416 16421 6 1.05 2.03 PlyA - 19237 19232 6 1.05 2.02 Term - 21767 21743 25 1 1 131 54 50 0.111 3.60 2.01 Init - 27540 27407 134 0 2 73 58 116 0.103 6.72 2.00 Prom - 43245 43206 40 -3.66 3.03 PlyA - 44387 44382 6 1.05 3.02 Term - 63236 63088 149 0 2 70 47 163 0.984 8.46 3.01 Init - 65881 65707 175 1 1 54 72 86 0.400 3.11 3.00 Prom - 70979 70940 40 -5.66 4.00 Prom + 71438 71477 40 -4.16 4.01 Init + 71581 71782 202 0 1 91 20 187 0.222 11.24 4.02 Term + 79645 79700 56 2 2 99 41 74 0.450 1.52 4.03 PlyA + 80051 80056 6 1.05 5.03 PlyA - 80421 80416 6 1.05 5.02 Term - 82727 82569 159 2 0 107 48 115 0.546 7.34 5.01 Init - 92076 92041 36 0 0 84 105 38 0.178 3.28 5.00 Prom - 94321 94282 40 -7.36 6.00 Prom + 96486 96525 40 -4.66 6.01 Init + 100001 100108 108 1 0 58 116 160 0.803 16.02 6.02 Intr + 100237 100332 96 0 0 54 95 79 0.975 5.41 6.03 Intr + 100581 100668 88 2 1 88 76 129 0.966 11.24 6.04 Term + 101303 101853 551 0 2 121 55 756 0.998 70.06 6.05 PlyA + 102206 102211 6 1.05 7.00 Prom + 114008 114047 40 -5.26 7.01 Sngl + 118028 118216 189 1 0 93 37 173 0.711 7.88 7.02 PlyA + 118970 118975 6 1.05 8.08 PlyA - 121150 121145 6 1.05 8.07 Term - 130229 130149 81 2 0 101 48 35 0.055 -1.51 8.06 Intr - 139303 139251 53 2 2 123 95 -8 0.175 2.03 8.05 Intr - 140415 140283 133 0 1 25 30 150 0.252 3.12 8.04 Intr - 141503 141371 133 1 1 93 44 88 0.271 5.55 8.03 Intr - 188012 187890 123 1 0 14 60 119 0.162 1.40 8.02 Intr - 188955 188871 85 1 1 73 76 78 0.839 3.98 8.01 Init - 190663 190573 91 1 1 73 81 84 0.909 6.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_1|130_aa MSDGSPERRADGHSIYNQQCPVQEEQRLCELVALPDISLSAFSCKNLGNQLCYRIQTLQS KYNTEKSFSTQLSTFELAWELPMAIPTWPLLVTSLARRSLLPKGYSLREHENQCVNNGSM HQETSPQLQP >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_1|393_bp atgagtgacgggtcccctgagagaagagcggatggccacagcatctacaaccagcagtgc ccagtgcaggaggaacagcggctttgcgagcttgtggctcttccagacatttccttatca gctttctcctgcaagaatctgggcaatcaactgtgttacagaattcagactcttcagagc aaatacaacaccgagaaatcattctccactcagctaagcacattcgaacttgcatgggaa ttacccatggccattcccacatggccactcctggtcacatctctggccagaaggtctctc ctccccaagggctacagcttaagggaacatgaaaatcagtgtgtgaacaatggcagcatg caccaggaaaccagccctcagcttcagccctga >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_2|52_aa MCKEQWGVRGTDVTFRVRVSQIIWEGHTYNRDPVKDEPETVLTGICYVVNRD >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_2|159_bp atgtgcaaggagcagtggggagtcagagggacagatgtgacctttcgtgtcagggtgtcc cagattatttgggaaggccacacctacaaccgtgacccagtcaaagacgagcctgaaaca gtactgactgggatatgttatgtggtcaacagggactga >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_3|107_aa MWCSANGQTTAVEMPLRAQRKGTLVHLGEATPFLSQPGEWELYLLKAIMMWSLCRVTVAL RQRDVLMAANPKGAVPSDDATFSTRELFSFLAVAAAREESTGNVHQL >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_3|324_bp atgtggtgctcagccaatgggcagacaacagcagtggagatgccactgagggcccagaga aaaggcactttggtacatctgggagaggccactcccttcctgagccagccaggagagtgg gagctgtacctcttgaaagccatcatgatgtggagtctctgccgggtcacagtggctctt cggcagagggatgtgctgatggctgccaatccgaagggtgctgtaccatctgacgatgcc accttctcaacacgggagctgttctctttcctggctgttgctgcggccagggaagagagc acaggaaatgtgcaccaactctga >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_4|85_aa MALDNPASTSVNYELLDSQISLTKSGNMANMKEKEDYGLIEGMLSNIVNGVLKDGLAEDI KMASFCNSCCDYQAVRSLEVAAKRS >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_4|258_bp atggctttagacaatccagcaagcaccagtgtgaactatgagcttcttgatagccaaata tcactgacaaaaagtggaaatatggctaacatgaaagagaaagaagactatggcctgata gaagggatgttgagtaacatagtgaatggggtcctcaaggatggcttagcagaagacatc aagatggccagcttttgtaatagctgctgtgactatcaggctgttcggagtctggaagtg gctgctaaacgctcgtga >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_5|64_aa MNFIAKTLSHLLATTTPTPTSPLNTCCLGHGQQLGSASRVQPLEEGTTSNAAAASEALKH KNQH >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_5|195_bp atgaattttatagctaagacactgagccatctgctggcaaccaccaccccaacacccact agccctctgaacacctgctgtctaggtcatggacagcaactaggctctgcttcacgtgtg cagcccctggaggaaggtacaacctcaaatgccgctgcagcctcagaggcccttaaacac aagaaccagcactga >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_6|280_aa MPADIMEKNSSSPVAATPASVNTTPDKPKTASEHRKSSKPIMEKRRRARINESLSQLKTL ILDALKKDSSRHSKLEKADILEMTVKHLRNLQRAQMTAALSTDPSVLGKYRAGFSECMNE VTRFLSTCEGVNTEVRTRLLGHLANCMTQINAMTYPGQPHPALQAPPPPPPGPGGPQHAP FAPPPPLVPIPGGAAPPPGGAPCKLGSQAGEAAKVFGGFQVVPAPDGQFAFLIPNGAFAH SGPVIPVYTSNSGTSVGPNAVSPSSGPSLTADSMWRPWRN >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_6|843_bp atgccagctgatataatggagaaaaattcctcgtccccggtggctgctaccccagccagt gtcaacacgacaccggataaaccaaagacagcatctgagcacagaaagtcatcaaagcct attatggagaaaagacgaagagcaagaataaatgaaagtctgagccagctgaaaacactg attttggatgctctgaagaaagatagctcgcggcattccaagctggagaaggcggacatt ctggaaatgacagtgaagcacctccggaacctgcagcgggcgcagatgacggctgcgctg agcacagacccaagtgtgctggggaagtaccgagccggcttcagcgagtgcatgaacgag gtgacccgcttcctgtccacgtgcgagggcgttaataccgaggtgcgcactcggctgctc ggccacctggccaactgcatgacccagatcaatgccatgacctaccccgggcagccgcac cccgccttgcaggcgccgccaccgcccccaccgggacccggcggcccccagcacgcgccg ttcgcgccgccgccgccactcgtgcccatccccgggggcgcggcgccccctcccggcggc gccccctgcaagctgggcagccaggctggagaggcggctaaggtgtttggaggcttccag gtggtaccggctcccgatggccagtttgctttcctcattcccaacggggccttcgcgcac agcggccctgtcatccccgtctacaccagcaacagcggcacctccgtgggccccaacgca gtgtcaccttccagcggcccctcgcttacggcggactccatgtggaggccgtggcggaac tga >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_7|62_aa MADPSIMAEPSLPAITAKAPDMRKPSLVLQLVESQDDYSTANAMWSRRVAQLSPLSPQHD ER >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_7|189_bp atggctgatcccagcatcatggctgagcccagcctgccggccatcactgccaaggcacca gacatgaggaagccatctttggtgttgcagctggtggagtcccaggatgactacagcaca gccaatgccatgtggagcagaagagtcgcccagctgagcccactcagccctcagcatgat gaaagataa >gi568815595f:194036381_194238230|GENSCAN_predicted_peptide_8|232_aa MDWVAYKQQKFITVLEAAKSNIEALADQVSGSSLAFPESWRRGLAAVKLIASEIRKACRL GGGGAAAGSGVDVRAAEEEWREEQLGVLLEEEAEPGQWGWVAGVDAESGEGEAADPILAE ATAQEQRSCWPAEALGYGEEVIVLLGWLCFELSRTSDNSSTVPQQEKRLAQQPPADTGFS YLNQNKMPGISRGPMRHSRGEKTLYLTPKAVRNLVANQKEETQALGPKCTQL >gi568815595f:194036381_194238230|GENSCAN_predicted_CDS_8|699_bp atggactgggtggcttataaacaacagaagtttatcacagttctggaggctgcgaagtcc aacatcgaggcactggcagatcaagtgtctggctcctctcttgcattccctgagtcctgg cggcggggcctggcagccgtcaagctcatcgcctctgagatcagaaaggcttgcagatta ggtggtggtggagctgctgctggaagtggtgtggacgtacgggcagcggaggaggagtgg agagaggagcagctgggggtgttgctggaggaggaagcagagccggggcagtggggctgg gttgcaggggtggacgcagaatctggggagggagaggctgctgaccctatccttgcagag gccacagcccaggagcagaggagctgctggccggcagaggccctggggtatggtgaggag gtcattgtgctgctgggctggctctgcttcgagttatcccgcacctctgacaacagctcc acggtgccccagcaggaaaagcgcttagctcagcagccgcctgcagacaccggcttttca tacttaaatcaaaacaagatgccagggataagtcgcggtcccatgcgacacagcaggggt gaaaagactttgtacctgacccctaaagctgtgaggaatcttgttgctaatcagaaggaa gaaacccaggctcttggtcctaaatgcacacaattatga