GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:21:37 Sequence gi568815596f:98269976_98497252 : 227277 bp : 44.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 707 908 202 2 1 126 90 125 0.993 15.79 1.02 Intr + 6916 7083 168 0 0 67 50 62 0.235 0.44 1.03 Intr + 8472 8600 129 2 0 70 30 73 0.132 0.59 1.04 Intr + 20536 20647 112 0 1 114 68 52 0.738 5.75 1.05 Intr + 27932 28056 125 0 2 92 80 113 0.990 11.20 1.06 Intr + 30104 30241 138 1 0 104 89 184 0.999 20.76 1.07 Intr + 32087 32177 91 1 1 108 52 44 0.529 2.37 1.08 Intr + 36097 36229 133 2 1 64 72 14 0.128 -2.90 1.09 Intr + 41598 41816 219 0 0 31 53 186 0.840 6.82 1.10 Intr + 41844 42057 214 0 1 101 -15 210 0.668 10.72 1.11 Term + 43874 43990 117 1 0 46 39 162 0.981 5.64 1.12 PlyA + 44750 44755 6 1.05 2.00 Prom + 48270 48309 40 -1.96 2.01 Sngl + 50585 50989 405 1 0 48 44 154 0.895 3.28 2.02 PlyA + 52231 52236 6 1.05 3.05 PlyA - 52274 52269 6 1.05 3.04 Term - 73091 72995 97 1 1 101 45 88 0.814 3.24 3.03 Intr - 77114 76873 242 2 2 97 86 68 0.540 3.85 3.02 Intr - 78918 78874 45 0 0 90 82 37 0.632 1.91 3.01 Init - 87498 87430 69 0 0 98 94 23 0.734 4.95 3.00 Prom - 90061 90022 40 -6.56 4.00 Prom + 91151 91190 40 -6.46 4.01 Init + 100001 100101 101 1 2 88 110 48 0.739 6.74 4.02 Intr + 107712 107825 114 0 0 47 93 131 0.920 9.06 4.03 Intr + 110200 110379 180 1 0 96 116 97 0.883 12.28 4.04 Intr + 113413 113466 54 1 0 127 83 62 0.820 7.69 4.05 Intr + 119683 119799 117 1 0 67 115 219 0.589 22.08 4.06 Intr + 121889 121995 107 2 2 52 109 202 0.956 18.56 4.07 Term + 125869 127280 1412 2 2 119 48 2182 0.999 208.52 4.08 PlyA + 128603 128608 6 1.05 5.05 PlyA - 128981 128976 6 1.05 5.04 Term - 135467 135361 107 0 2 10 48 90 0.056 -4.23 5.03 Intr - 135732 135649 84 1 0 68 110 21 0.098 2.09 5.02 Intr - 147416 147314 103 2 1 48 82 95 0.150 4.65 5.01 Init - 160693 160607 87 1 0 68 69 64 0.204 3.14 5.00 Prom - 171246 171207 40 -4.06 6.02 PlyA - 171679 171674 6 1.05 6.01 Sngl - 175498 174650 849 1 0 67 42 314 0.909 19.59 6.00 Prom - 194853 194814 40 -3.96 7.11 PlyA - 196802 196797 6 1.05 7.10 Term - 197665 197486 180 1 0 -6 38 137 0.170 -3.09 7.09 Intr - 200546 200414 133 1 1 110 89 65 0.990 9.45 7.08 Intr - 202201 202014 188 1 2 65 37 120 0.792 3.09 7.07 Intr - 204744 204685 60 0 0 64 94 46 0.492 1.73 7.06 Intr - 207396 207218 179 1 2 74 -18 138 0.217 1.44 7.05 Intr - 211351 211257 95 0 2 82 60 41 0.089 0.31 7.04 Intr - 214511 214393 119 2 2 92 53 25 0.150 -1.34 7.03 Intr - 219219 219158 62 1 2 145 65 42 0.483 6.05 7.02 Intr - 220367 220207 161 1 2 110 82 -1 0.734 1.13 7.01 Init - 220868 220735 134 2 2 22 94 88 0.541 2.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:98269976_98497252|GENSCAN_predicted_peptide_1|549_aa XLDANKPIQYLENKTVLNQALERLNWPISLKELSMLESEILAGKMYIQQAMELQEAAKKN YANKAPGELPCRFLSLGQLHDFSVSKNQRWEGDTAANAVSREPVPAFYAPASTGSSWLAQ MRPKLLFSMQEESGHTNKLKMVNTGNFIADESDSQQDGELERGWSGKQQKLQGNPTKKTK SKRPDPLKGQKVIARCDENGFYFPGVVKKCVSRTQALVGFSYGDTKVVSTSFITPVGGAM PCPLLQVGDYVFAKIVIPKGFDFYVPAIVIALPNKHVATEKFYTVLKCNNRRGVCPQDPS VAPAGSMSLTSSQVEDSLLPECGLCAFTRLSILPGQLFLALSHLGNSKTELKRQHHHVMS QHPSPPRTGPYTQVLRAVGGSQTPKARGNELSGPKPSLKHWVHSAPETPFHGKRDVGKEL RSSRGDGLDISVDPAAIKAGEDVEARNSAFLFWPLKEADTQDSREPRREKPRRKKRPAKQ PLQQAAPSDSDGSSHGISSHGSCQGTHPEPRCTRPSFDVFNDSAYFSVMEEVAITTVNFH LQPIQNGSN >gi568815596f:98269976_98497252|GENSCAN_predicted_CDS_1|1650_bp nngttagatgcaaacaaaccaatacagtacttggaaaacaaaacagttttaaaccaggct ttagaacggttgaattggcccatttcactgaaagagctgtcgatgctggaaagtgaaatc ctagctgggaaaatgtacatccagcaggccatggaactccaggaggctgccaagaagaat tatgcaaacaaggccccgggagagctgccttgccgattcctaagtttggggcagcttcat gatttttctgtcagtaagaaccagcggtgggaaggagacacggcagcaaatgctgtgtcc agggagcctgtgcctgctttttatgctccagcttccacgggcagttcctggctggcccag atgagacctaaactcttgttcagcatgcaggaagaatcaggtcacacgaacaaattgaag atggtaaatacagggaattttattgctgatgaaagtgactctcagcaggatggagagcta gaaaggggatggagtgggaagcaacagaaattgcaaggaaatccaacaaagaaaaccaaa tcaaaaagaccagatcccctcaaaggacagaaggttattgcaagatgtgatgaaaatggc ttttattttccaggggttgtgaagaagtgtgtgagccgcacccaagcactggtgggcttc agttacggagacaccaaggtcgtgtccacctccttcatcacgcctgtggggggcgccatg ccctgcccgctgctccaggttggagattatgtgtttgccaaaattgtgatacccaaagga tttgacttctatgtccctgccattgtcatagcacttcccaataagcatgtggccacagaa aaattctacacagttttgaagtgtaacaaccggagaggagtatgcccccaggaccccagt gtggccccagccggctccatgagcttgacctccagccaggtggaagactcactgcttcct gaatgcggtctctgtgccttcaccagattaagtattctccctggacagctattccttgcc ctcagccacctggggaactccaagactgagctcaaacgtcaacaccaccatgtgatgtcc cagcaccctagcccccccaggacgggtccgtatactcaagttctccgtgccgtgggaggc tcccaaacaccaaaggctcgtggaaatgaactgagtggacctaagcccagcctaaagcat tgggttcacagcgctccagagactcctttccatgggaagagggacgtgggcaaggagctg agaagcagccgtggggatggcttggacatctctgtagaccccgcagccatcaaggcagga gaggatgtggaggcgaggaactctgctttcctcttctggccactgaaagaagcggacacg caggattccagagagccaagacgagagaagcccaggaggaaaaagaggcccgccaagcag ccactccagcaggcggcgccctcggactcggacggctcctcccacggcatcagctcccat gggtcctgccaggggacacaccccgagcccaggtgcacaaggccatcatttgatgtgttc aatgatagtgcctacttcagtgtaatggaggaagttgcaattacaacggtgaacttccac ctccaaccaatacagaatggcagcaactag >gi568815596f:98269976_98497252|GENSCAN_predicted_peptide_2|134_aa MDNKVQAVVDSDGDEEIVGNCSKGCLCYAKRLAAFCLCSRDLWNFEIERDDLGYLVEEIS KQQSTQEVTERRSSENLQPNDAIGEKTPFSGEKFQPTAEICISNEEPNANHQDNGENVSR PCQRPSWQPLPSQA >gi568815596f:98269976_98497252|GENSCAN_predicted_CDS_2|405_bp atggacaataaagtccaggctgtggtggactcagatggagatgaggaaattgttgggaac tgtagtaaaggttgcctttgctatgcaaagagactggcagcattttgcctctgctctaga gatctgtggaactttgaaattgagagagatgatttagggtatctggtggaagaaatttct aagcagcaaagcactcaagaggtgacagagcgtagaagttcggaaaatttgcagcccaat gatgcaataggagagaaaacgccattttctggggagaaattccagccaactgcagaaata tgcataagtaatgaggagccaaatgctaatcaccaagacaatggggaaaatgtctccagg ccctgtcagagaccttcatggcagcccctcccatcacaggcctag >gi568815596f:98269976_98497252|GENSCAN_predicted_peptide_3|150_aa MGAMQDGEEILNSALGTIKEASQGSVITSIYTRNAGHLATDSTSPGAARTREDAAPAPDL EPGKWVLPGLAVRGQRDRRKCMGWGRGCKPYPSPAGASPAPSASRPDGSAGSAQRGCGTS ALQTVAIGAALKTPVPAFHSPNFYCLTINA >gi568815596f:98269976_98497252|GENSCAN_predicted_CDS_3|453_bp atgggtgcaatgcaggatggggaggaaatccttaattctgccttagggaccattaaggag gcttcccagggctctgttatcacctccatttacacccggaatgcaggccacctggccact gactcaacctcccccggcgcggcgcggactcgagaagacgcggcgcccgctcccgacttg gagccggggaagtgggtccttcccggactggcggtccggggacagcgtgacaggaggaaa tgtatggggtggggtcgcggctgcaagccctacccgtccccagcgggagccagccctgct ccttctgcctcccgccccgacggctcagcgggctcggcccagcgcggctgtgggacgtca gctttgcaaacagtcgctataggggctgcccttaagaccccagtgccagctttccactct ccaaatttctactgtctgaccatcaatgcctga >gi568815596f:98269976_98497252|GENSCAN_predicted_peptide_4|694_aa MAKINTQYSHPSRTHLKVKTSDRDLNRAENGLSRAHSSSEETSSVLQPGIAMETRGLADS GQGSFTGQGIARLSRLIFLLRRWAARHVHHQDQGPDSFPDRFRGAELKEVSSQESNAQAN VGSQEPADRGRSAWPLAKCNTNTSNNTEEEKKTKKKDAIVVDPSSNLYYRWLTAIALPVF YNWYLLICRACFDELQSEYLMLWLVLDYSADVLYVLDVLVRARTGFLEQGLMVSDTNRLW QHYKTTTQFKLDVLSLVPTDLAYLKVGTNYPEVRFNRLLKFSRLFEFFDRTETRTNYPNM FRIGNLVLYILIIIHWNACIYFAISKFIGFGTDSWVYPNISIPEHGRLSRKYIYSLYWST LTLTTIGETPPPVKDEEYLFVVVDFLVGVLIFATIVGNVGSMISNMNASRAEFQAKIDSI KQYMQFRKVTKDLETRVIRWFDYLWANKKTVDEKEVLKSLPDKLKAEIAINVHLDTLKKV RIFQDCEAGLLVELVLKLRPTVFSPGDYICKKGDIGKEMYIINEGKLAVVADDGVTQFVV LSDGSYFGEISILNIKGSKSGNRRTANIRSIGYSDLFCLSKDDLMEALTEYPEAKKALEE KGRQILMKDNLIDEELARAGADPKDLEEKVEQLGSSLDTLQTRFARLLAEYNATQMKMKQ RLSQLESQVKGGGDKPLADGEVPGDATKTEDKQQ >gi568815596f:98269976_98497252|GENSCAN_predicted_CDS_4|2085_bp atggccaagatcaacacccaatactcccacccctccaggacccacctcaaggtaaagacc tcagaccgagatctcaatcgcgctgaaaatggcctcagcagagcccactcgtcaagtgag gagacatcgtcagtgctgcagccggggatcgccatggagaccagaggactggctgactcc gggcagggctccttcaccggccaggggatcgccaggctgtcgcgcctcatcttcttgctg cgcaggtgggctgccaggcatgtgcaccaccaggaccagggaccggactcttttcctgat cgtttccgtggagccgagcttaaggaggtgtccagccaagaaagcaatgcccaggcaaat gtgggcagccaggagccagcagacagagggagaagcgcctggcccctggccaaatgcaac actaacaccagcaacaacacggaggaggagaagaagacgaaaaagaaggatgcgatcgtg gtggacccgtccagcaacctgtactaccgctggctgaccgccatcgccctgcctgtcttc tataactggtatctgcttatttgcagggcctgtttcgatgagctgcagtccgagtacctg atgctgtggctggtcctggactactcggcagatgtcctgtatgtcttggatgtgcttgta cgagctcggacaggttttctcgagcaaggcttaatggtcagtgataccaacaggctgtgg cagcattacaagacgaccacgcagttcaagctggatgtgttgtccctggtccccaccgac ctggcttacttaaaggtgggcacaaactacccagaagtgaggttcaaccgcctactgaag ttttcccggctctttgaattctttgaccgcacagagacaaggaccaactaccccaatatg ttcaggattgggaacttggtcttgtacattctcatcatcatccactggaatgcctgcatc tactttgccatttccaagttcattggttttgggacagactcctgggtctacccaaacatc tcaatcccagagcatgggcgcctctccaggaagtacatttacagtctctactggtccacc ttgacccttaccaccattggtgagaccccaccccccgtgaaagatgaggagtatctcttt gtggtcgtagacttcttggtgggtgttctgatttttgccaccattgtgggcaatgtgggc tccatgatctcgaatatgaatgcctcacgggcagagttccaggccaagattgattccatc aagcagtacatgcagttccgcaaggtcaccaaggacttggagacgcgggttatccggtgg tttgactacctgtgggccaacaagaagacggtggatgagaaggaggtgctcaagagcctc ccagacaagctgaaggctgagatcgccatcaacgtgcacctggacacgctgaagaaggtt cgcatcttccaggactgtgaggcagggctgctggtggagctggtgctgaagctgcgaccc actgtgttcagccctggggattatatctgcaagaagggagatattgggaaggagatgtac atcatcaacgagggcaagctggccgtggtggctgatgatggggtcacccagttcgtggtc ctcagcgatggcagctacttcggggagatcagcattctgaacatcaaggggagcaagtcg gggaaccgcaggacggccaacatccgcagcattggctactcagacctgttctgcctctca aaggacgatctcatggaggccctcaccgagtaccccgaagccaagaaggccctggaggag aaaggacggcagatcctgatgaaagacaacctgatcgatgaggagctggccagggcgggc gcggaccccaaggaccttgaggagaaagtggagcagctggggtcctccctggacaccctg cagaccaggtttgcacgcctcctggctgagtacaacgccacccagatgaagatgaagcag cgtctcagccaactggaaagccaggtgaagggtggtggggacaagcccctggctgatggg gaagttcccggggatgctacaaaaacagaggacaaacaacagtga >gi568815596f:98269976_98497252|GENSCAN_predicted_peptide_5|126_aa MTQRKEDLRKSLDVVHGEEESQQMAKVTQACTATANEEKIMGQSHSDTDTTSAHLISLTS TVAAISALGGTPIPGMLWFLQTLRCITLVVSDLKPAQYLVMPKASSDHCPATTYIHSRPK GYTISR >gi568815596f:98269976_98497252|GENSCAN_predicted_CDS_5|381_bp atgacccagagaaaggaagatttaagaaaaagcctagacgttgttcatggagaagaagag agtcagcaaatggcaaaagtcacacaggcttgcacagccactgcaaatgaggagaaaatc atgggacagtcacactcagacaccgacaccacttctgctcacctcatcagcctaacttca acggtggctgccatatctgctttagggggcaccccaatcccgggaatgctgtggttcttg cagactcttagatgtatcaccttggtggtctcagacctgaagccagcacagtacttggtc atgcccaaggcctccagtgaccactgcccggctaccacctatattcactcaaggcccaag ggctatacaatcagcagatga >gi568815596f:98269976_98497252|GENSCAN_predicted_peptide_6|282_aa MHVYFHPVMFPPQTRQGLQLQEPVDPALEPHRAGTASASRGHAPARHPAPCWGGRVFTGL GVGRARIVLKRRHRRWTTGTVRQCNFHENRGRRPHPPQRGPGRAPAPAVRPRPRPPRRPR RLLPGPAHLPGAPRPPAPAAPLSRRAARQSPAAAAAPPPGCEVAGCGPGLRCWSSCPGPR SPTRRQTSSEPSTRAAALASRRRSRRRAQPRPGPAPLSRMRAEGPARALRPPPHYQPELG PPSETVGPVTELLAPIPTARFPQDPTLSCDCFTCKEFLNFWT >gi568815596f:98269976_98497252|GENSCAN_predicted_CDS_6|849_bp atgcatgtttactttcacccggttatgtttccaccccaaacacggcaaggcctccagctt caggagcccgtagacccagccctggagccgcatcgggcaggcaccgcgtctgcgtcccgc ggccacgcgcccgctcggcacccggctccctgctggggcggccgcgtcttcacggggctg ggggtcggccgggctcggattgttctcaaaagacgccaccgcaggtggaccactggcacc gttaggcagtgcaatttccatgaaaaccgcgggcgccgtccacatccgccccagcgggga cctggacgcgcccccgcgcccgctgtccgtccccggccccggcccccgcggcggccgcgt cgcctcctgcccggcccggcccacctacctggcgctccccggcccccggcgccagccgcc ccgctgtcccgccgggctgcgcgccagagccccgcagcagcagcagcgccgccgcctggc tgtgaagtcgcgggctgcgggcccggcctcaggtgctggtcctcctgccccggcccacgg agcccgacccggcggcaaaccagcagcgaaccctccacgcgcgccgcagccctagccagc cgccgccgctctagacgccgggcgcagccccgccctggccccgcccccttgtcgcgcatg cgcgctgagggcccggcccgcgccctgcgacccccgccccactaccagcccgagctgggt ccgccctctgagacagtagggccggtgaccgagctgctcgcgccaattcctactgctagg ttcccacaagatcccaccctatcctgtgactgcttcacctgcaaggagttcttgaacttc tggacttga >gi568815596f:98269976_98497252|GENSCAN_predicted_peptide_7|436_aa MTAKGLIKLKLLFGRRQAEDLHPHQLQVTVGDSGLIYSEAQASHRDSHVICSMLRKRKPD ETPSPSSDAQFSGMLAVAVSSLLCPRSSPSQQWHCHSPSTTILGSSDVSDTSFQPSTKLG PVHGFPPQSRDCILDFELIRKVTGAWSSLTAGSWETKIQEFSEEDILAPHRITPGPHSSK SPVHFLASLWGRKTKLYHHTIQDKEPRSVPEAGSSGRRDGGGAAHSMLCMAPEPDHPGSN PNCRSPATLHADADGNLQREAAKCQGDGTQVSTHVDHTSAPRQAETGPATGENEKMESHG RSKRGLVGADQCPAETLDSYTLLVGHKKRDKDSAWLRRRKAPPGLLAGAEAQRAMPPCGG KGQTTKEQGLRAQKQRQEKKLHIKNTEVKSMEAEKIISTQFQGQRTLMMLGSLDLLSPHV DGVANIADSFTVLTAT >gi568815596f:98269976_98497252|GENSCAN_predicted_CDS_7|1311_bp atgacagccaaaggactgattaaactcaagctgttgtttggccggaggcaggcagaggat ctccatccccaccagctgcaggtgactgttggagattctggcctcatctactcggaggcc caggcgtcacacagagattcccatgtgatatgcagcatgctcagaaaaaggaaaccggat gagacacccagcccttcaagtgatgcccaattttctgggatgctcgctgttgctgtgtcc agcctgctctgcccccggtcttctccatctcagcaatggcactgccattctccaagtacc accattcttgggagctcagatgtatctgacacctctttccaaccatcaaccaaactgggc cctgtgcatggcttcccaccccagtcacgtgactgcattctggactttgagctcatcagg aaagtaactggggcttggtcttccctgactgcaggatcatgggaaacaaagatacaggaa ttttcagaggaagacatccttgctccacacaggatcacaccaggtccacattcctccaag tctcctgttcacttcctggcctctctgtggggcaggaagaccaagctgtaccaccacacc attcaggacaaagagccaagaagcgtgcctgaagcaggcagctcagggagacgggacggc gggggagctgcacacagcatgctgtgcatggccccagagccagaccatccgggctccaac cccaactgccgctcacccgccacgctacacgctgatgctgatggcaatctgcaaagagag gctgccaaatgtcagggtgatggcacccaggtctccacccatgtggaccatacttctgct ccacggcaggcagaaacaggcccagccacaggagagaacgagaaaatggagtcacatggg agatcaaaaagaggccttgttggagcagaccaatgcccagctgaaactcttgattcctac accctccttgtggggcacaaaaagagggacaaggacagtgcctggctgagaaggcgaaag gccccaccaggcctccttgcaggggccgaagcccagcgggccatgcctccctgcggaggg aagggacagaccactaaggagcaaggacttagagctcagaagcagcgccaagaaaagaag cttcacatcaagaacacagaagttaaatccatggaggctgagaagataatttccactcag ttccaagggcaaaggacactgatgatgctaggttcactggacctgttgtccccacatgtg gacggtgtcgctaacattgctgacagcttcacagtcttaacagcaacataa