GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:40:39 Sequence gi568815596f:218034802_218235881 : 201080 bp : 44.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 44 234 191 0 2 44 80 150 0.560 9.10 1.02 Intr + 280 383 104 0 2 69 92 25 0.313 -0.03 1.03 Intr + 8839 8954 116 1 2 70 81 92 0.854 6.79 1.04 Term + 23794 23912 119 2 2 100 45 78 0.068 3.40 1.05 PlyA + 24338 24343 6 1.05 2.02 PlyA - 24381 24376 6 -1.75 2.01 Sngl - 25942 25538 405 1 0 85 47 514 0.679 43.28 2.00 Prom - 27009 26970 40 -8.26 3.00 Prom + 27091 27130 40 -6.66 3.01 Init + 28564 28621 58 0 1 67 75 79 0.900 5.97 3.02 Intr + 30719 30838 120 0 0 95 46 57 0.758 2.67 3.03 Intr + 33035 33085 51 0 0 119 59 59 0.822 4.98 3.04 Intr + 35952 36058 107 1 2 96 75 135 0.502 12.93 3.05 Intr + 37573 37698 126 0 0 73 63 93 0.985 6.18 3.06 Intr + 37978 38084 107 0 2 78 75 101 0.955 6.81 3.07 Intr + 38442 38585 144 0 0 37 70 158 0.406 8.30 3.08 Intr + 39015 39084 70 0 1 114 46 -42 0.494 -6.62 3.09 Intr + 40292 40939 648 1 0 92 100 205 0.386 14.24 3.10 Intr + 41626 41732 107 0 2 113 91 142 0.994 16.01 3.11 Intr + 48309 48455 147 0 0 106 72 180 0.840 17.55 3.12 Intr + 54451 54561 111 1 0 77 103 44 0.760 4.29 3.13 Term + 55151 55253 103 2 1 103 47 44 0.623 -0.45 3.14 PlyA + 58945 58950 6 1.05 4.02 PlyA - 59887 59882 6 1.05 4.01 Sngl - 62128 61919 210 1 0 60 42 187 0.321 6.49 4.00 Prom - 68266 68227 40 -2.46 5.02 PlyA - 68550 68545 6 -0.45 5.01 Sngl - 69382 68642 741 1 0 86 43 265 0.861 18.01 5.00 Prom - 71462 71423 40 -4.96 6.03 PlyA - 71631 71626 6 1.05 6.02 Term - 72819 72656 164 1 2 82 53 129 0.960 6.90 6.01 Init - 73134 72903 232 0 1 60 71 94 0.955 3.32 6.00 Prom - 74676 74637 40 -9.55 7.00 Prom + 76694 76733 40 -2.26 7.01 Init + 78116 78220 105 1 0 72 72 97 0.606 6.72 7.02 Intr + 78422 78593 172 1 1 44 74 101 0.922 3.82 7.03 Intr + 86881 87006 126 2 0 29 79 63 0.157 0.15 7.04 Intr + 94513 94564 52 2 1 107 100 -8 0.030 0.27 7.05 Term + 99976 101083 1108 1 1 103 47 1190 0.060 107.94 7.06 PlyA + 102424 102429 6 1.05 8.02 PlyA - 105317 105312 6 1.05 8.01 Sngl - 130410 129358 1053 0 0 61 48 1233 0.730 113.76 8.00 Prom - 143040 143001 40 -0.16 9.00 Prom + 146626 146665 40 -2.96 9.01 Init + 148658 148663 6 1 0 84 99 0 0.294 1.67 9.02 Term + 166267 166479 213 0 0 9 48 237 0.229 9.03 9.03 PlyA + 166978 166983 6 1.05 10.00 Prom + 178726 178765 40 -2.46 10.01 Init + 182291 182333 43 1 1 97 37 106 0.460 4.98 10.02 Intr + 182662 182743 82 2 1 55 88 234 0.405 18.80 10.03 Intr + 191119 191153 35 1 2 113 115 14 0.990 4.47 10.04 Intr + 193937 194049 113 0 2 86 94 72 0.954 7.70 10.05 Intr + 199551 199596 46 2 1 76 91 31 0.013 0.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_1|176_aa XQPSDTEETTTGIAAAELHGLPVPPGGEKEEKQEQVLKTKRKLAAVPATTMDQPPFGQQV TLPQRPGSCLYHFFPAQSLGTDSGSRAALSCPSQPGSPGWGKCMLIGPSAATGGHKKKHH KLHLWSTGLAAQPPDFEGQWGARRDLMCEIWKRELEQQFHSASAQEGQCKVPDTTI >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_1|531_bp ncccaaccaagcgacacggaagagacaaccacaggaatagcagctgcggagctccatggc ctccctgtgccccctggtggcgagaaggaagaaaagcaagagcaggtgctgaaaacgaaa aggaaactagcagcagtcccagctaccacaatggatcagcccccatttggccaacaggtg actctaccccagcgccctggatcctgcctctaccacttcttccctgcgcagtcactaggg acggactctggcagcagagccgcgctgtcgtgccccagccagccaggctctcctggatgg gggaagtgcatgctgattggtccatcagcagccacaggtgggcacaagaaaaagcaccac aagctccacctctggtccacaggactggcagcccagcccccagactttgaaggtcagtgg ggggccagaagggacctcatgtgtgagatctggaagcgggaattggagcagcagtttcat tctgcttcagctcaggaaggtcagtgcaaggtgccagacaccaccatatga >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_2|134_aa MLFCYRFTLHTLFKAHMGQKHWTMWVIFAVVLIFLLCWLPYNLVLLADTLMGTQMTNETC ERRNDINQALDATEILGILHSYLNPLIYAFIGQKFCHGLLKIIAIHGLISKDSLPKDSRP SFVGSSSGHTSTTL >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_2|405_bp atgctgttctgctacagattcaccctgcatacgctgtttaaggcccatatggggcagaag cactggaccatgtgggtcatctttgctgttgtcctcattttcctgctctgctggctgccc tacaacctggtcctgctggcagacaccctcatgggaacccagatgaccaatgagacctgt gagcgccgcaacgacatcaaccaggccctggatgccactgagattctgggcatccttcac agctacctcaatcccctcatctacgccttcattggccagaagttttgccatggacttctc aagattatagccatacacggcttgatcagcaaggactccctgcccaaagacagcaggcct tcctttgttggctcttcttcagggcacacttccactactctctaa >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_3|632_aa MPKNNVGEEVEKSELSHIAEEPLNEVGVWDKGFPGQEWWLVVGVAAVAGKKLVSQKKAAT WPNRHPLIRVKAGCVGAAVSAILQGYGDGQGPVTDTSAELHRLCGCLELLLQFDQKEQKS FLGPRKDYWDFLCTALRRQRGNMEPIHFVRSQDKLKTPLGKGRAFIRFCLARGQLAEALQ LCLLNSELTREWYGPRSPLLCPERQEDILDSLYALNGVAFELDLQQPDLDGAWPMFSESR CSSSTQTQGRRPRKNKDAPKKIPAAYGGPENVQIEDSHTSQAICLQDAPSGQQLAGLPRS QQQRHLPFFLEKKGESSRKHRYPQSMWEPEGKELQLDQEERAPWIEIFLGNSTPSTQGQG KGAMGTQKEVIGMEAEVTGVLLVAEGQRTTEGTHKKEAEWSHVQRLLMPSPRGAVEGAVS GSRQGSGGSSILGEPWVLQGHATKEDSTVENPQVQTEVTLVARREEQAEVSLQDEIKSLR LGLRKAEEQAQRQEQLLREQEGELQALREQLSRCQEERAELQAQLEQKQQEAERRDAMYQ EELGGQRDLVQAMKRRVLELIQEKDRLWQRLQHLSSMAPECCVACSKIFGRFSRRYPCRL CGGLLCHACSMDYKKRDRCCPPCAQGREAQVT >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_3|1899_bp atgcccaagaacaatgtgggtgaggaagtggaaaagtcagaactctcccacatcgcagag gagccactgaatgaggtgggtgtctgggacaagggattcccaggacaggagtggtggctg gtggtgggagtagcagctgtggctggcaagaagctggtttcccagaagaaggcagcgacc tggcccaacaggcaccccctcatccgtgtcaaagctggctgtgtgggagctgccgtctct gccatcctccagggctatggggatgggcaggggccagtgacggacaccagtgccgagctg cacagactctgtggctgcctggagctgctgctgcagtttgaccagaaagagcagaagagc ttcctggggcctcggaaggattactgggactttctctgcactgccctacgacggcagcgg ggaaacatggagccaatccactttgtccgttcccaggacaagttgaagacccctctgggg aaaggccgtgccttcatccgcttctgcctggcccgtgggcagctggctgaggccctgcag ctttgcctcctgaactcagagctcaccagggaatggtatggaccccggagccctctgctc tgcccagaacgccaagaagacatcctggactctctctatgctctcaatggggtggccttc gagttggacctccagcagccagacctggatggagcctggcccatgttctcagagtcacgc tgctccagttccacccaaacccagggaaggagacccagaaaaaacaaagatgccccaaag aagatcccagccgcatatggagggcctgaaaatgtccagattgaggactcacacaccagt caagccatctgtctgcaagatgcacccagtggacagcagctggcagggcttcccaggtcc cagcaacaaaggcatcttcctttctttttggaaaagaagggggaaagttccaggaaacat aggtacccccagagcatgtgggagccagaagggaaggagcttcagctagaccaggaggaa agagccccatggattgagatcttcctggggaactcaacacccagcacccagggacagggg aagggggctatgggcactcagaaggaggtgatagggatggaggctgaggtcacaggggtt ctgctggttgcagagggtcagagaacaacagaggggactcacaaaaaggaagcagagtgg agtcacgtccagaggctgctgatgcccagccccagaggggctgtagagggagcagtatca gggagcaggcaggggtcggggggctctagcatcctgggggagccctgggtccttcaggga cacgcaacaaaggaagactctaccgtggagaatccacaagtgcaaacagaagtgaccctt gtggccagaagggaggagcaagccgaggtgtccctgcaggacgagatcaagagcctcaga cttgggctccggaaggctgaggagcaggcccagcgccaggagcagctgctgagggagcag gagggggagctgcaggcacttcgggagcagctcagcaggtgtcaggaagagagagccgag ctgcaggcacagctggagcagaagcaacaggaggctgagaggagggatgccatgtaccag gaggagcttggagggcagcgggacttggtccaggccatgaagaggcgggtgttggaactg atccaagagaaggaccgcctgtggcagaggctccagcatctctcttccatggctcccgag tgctgtgtggcctgtagcaagatctttggccgattttctcggcggtatccatgcaggctc tgtggaggcctgctctgccatgcttgctccatggattacaagaagagagaccgctgctgc ccaccctgcgcccagggaagagaagcccaggtcacctga >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_4|69_aa MLNITNHQGNANENWNEMPPYSCKNGHNSKVKKTIDVGMDVVKRDHFYTAGENAKYNHYG KQCGDFLKN >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_4|210_bp atgctcaacatcactaatcatcagggaaatgcaaatgaaaactggaatgagatgccacct tactcctgcaagaatggccataattcaaaagtcaaaaaaacaatagatgttggcatggat gtggtgaaaagagaccacttttacactgctggtgagaatgcaaagtacaaccactatgga aaacagtgtggggatttcttaaagaactaa >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_5|246_aa MAMLPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKIILSQKNKAGGITLPD FKLYYKAIVTKSAWYWYQNRDIDQWNRIEPPEIIPHIYNYLIFDKPDKNKKWGKDSLFNK WCWENWLAICRKLELDPFLTSYTKINSRWIKDLNITPKIIKTLEENLGNTVQDIGMGKDF MSKTPKAMATKAKIDKWDLIKPKSFCTAKETTIRVNRQPTEWEKIFAFYPSDKGLISRIY KELKQM >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_5|741_bp atggccatgctgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaactggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcattgccaagataatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaagctatactacaaggctatagtaaccaaatcagcatggtactggtaccaaaacaga gatatagaccaatggaacagaatagagcccccggaaataataccacacatctacaactat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctggaactggatcccttccttaca tcttatacaaaaattaattcaagatggattaaagacttaaatattacacctaaaatcata aaaaccctagaagaaaacctaggtaataccgttcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaaccaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gagtgggagaaaatttttgctttctacccatctgacaaagggctaatatccagaatctac aaagaacttaaacaaatgtag >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_6|131_aa MKLPEEGSADSNICRSAIFAVLHPPLVIPRQTGSGVDLQQTPTDLKLRVLTVRRKTNKRK GHPHQNPICTSPSSKTKEECSSLPGMEQSWMENDFDELREEGFRQSVITNFSELKEDVGT HHKEAKNLEKR >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_6|396_bp atgaagcttccagaagaaggatcagcagacagcaacatttgccgttctgcaatatttgct gttttgcaccctccactggtgatacccaggcaaacagggtctggagtggacctccagcaa actccaacagacctgaagctgagggtcctgactgttagaaggaaaactaacaaacggaaa ggacatccacaccaaaaccccatctgtacatcaccatcatcaaagaccaaagaggaatgc agctccttgccaggaatggaacaaagttggatggagaatgactttgacgagttgagagaa gaaggcttcagacaatcggtaataacaaatttctctgagctaaaggaagatgttggaacc catcacaaagaagctaaaaaccttgaaaaaagatga >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_7|520_aa MGQVWALVCSTLEHFHTDDEEEGEYNEVTEEVTEQAGIRQARQEGDIEAWQFPVRIHPPD QQGNIIATFEPFPFKFGKAHLVDYIKACDAIGGVKLQAFTVSVTALKGGAYRVVCYSWQV RGLTDFRSEAADLHSHSCSSGGVLQVKSPATQSGFKFTSKMEDFNMESDSFEDFWKGEDL SNYSYSSTLPPFLLDAAPCEPESLEINKYFVVIIYALVFLLSLLGNSLVMLVILYSRVGR SVTDVYLLNLALADLLFALTLPIWAASKVNGWIFGTFLCKVVSLLKEVNFYSGILLLACI SVDRYLAIVHATRTLTQKRYLVKFICLSIWGLSLLLALPVLLFRRTVYSSNVSPACYEDM GNNTANWRMLLRILPQSFGFIVPLLIMLFCYGFTLRTLFKAHMGQKHRAMRVIFAVVLIF LLCWLPYNLVLLADTLMRTQVIQETCERRNHIDRALDATEILGILHSCLNPLIYAFIGQK FRHGLLKILAIHGLISKDSLPKDSRPSFVGSSSGHTSTTL >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_7|1563_bp atgggacaagtgtgggctctggtttgttccaccttggaacattttcacactgatgatgag gaggaaggcgagtataacgaagtaacagaagaggttacagagcaggcaggaattcggcaa gctagacaagagggtgatatagaggcttggcagttccctgttagaatacaccccccagat caacaaggaaatattatagctacatttgagccttttccttttaaatttgggaaagcacat ttagttgattatatcaaggcctgtgatgctatcggaggagtgaagctgcaggccttcaca gtgagtgttacagctcttaaaggtggtgcatacagagttgtttgttactcctggcaggtt cgtggtctcactgacttcaggagtgaagccgcagaccttcacagtcacagctgctcttct ggaggtgtcctacaggtgaaaagcccagcgacccagtcaggatttaagtttacctcaaaa atggaagattttaacatggagagtgacagctttgaagatttctggaaaggtgaagatctt agtaattacagttacagctctaccctgcccccttttctactagatgccgccccatgtgaa ccagaatccctggaaatcaacaagtattttgtggtcattatctatgccctggtattcctg ctgagcctgctgggaaactccctcgtgatgctggtcatcttatacagcagggtcggccgc tccgtcactgatgtctacctgctgaacctagccttggccgacctactctttgccctgacc ttgcccatctgggccgcctccaaggtgaatggctggatttttggcacattcctgtgcaag gtggtctcactcctgaaggaagtcaacttctatagtggcatcctgctactggcctgcatc agtgtggaccgttacctggccattgtccatgccacacgcacactgacccagaagcgctac ttggtcaaattcatatgtctcagcatctggggtctgtccttgctcctggccctgcctgtc ttacttttccgaaggaccgtctactcatccaatgttagcccagcctgctatgaggacatg ggcaacaatacagcaaactggcggatgctgttacggatcctgccccagtcctttggcttc atcgtgccactgctgatcatgctgttctgctacggattcaccctgcgtacgctgtttaag gcccacatggggcagaagcaccgggccatgcgggtcatctttgctgtcgtcctcatcttc ctgctctgctggctgccctacaacctggtcctgctggcagacaccctcatgaggacccag gtgatccaggagacctgtgagcgccgcaatcacatcgaccgggctctggatgccaccgag attctgggcatccttcacagctgcctcaaccccctcatctacgccttcattggccagaag tttcgccatggactcctcaagattctagctatacatggcttgatcagcaaggactccctg cccaaagacagcaggccttcctttgttggctcttcttcagggcacacttccactactctc taa >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_8|350_aa MSNITDPQMWDFDDLNFTGMPPADEDYSPCMLETETLNKYVVIIAYALVFLLSLLGNSLV MLVILYSRVGRSVTDVYLLNLALADLLFALTLPIWAASKVNGWIFGTFLCKVVSLLKEVN FYSGILLLACISVDRYLAIVHATRTLTQKRHLVKFVCLGCWGLSMNLSLPFFLFRQAYHP NNSSPVCYEVLGNDTAKWRMVLRILPHTFGFIVPLFVMLFCYGFTLRTLFKAHMGQKHRA MRVIFAVVLIFLLCWLPYNLVLLADTLMRTQVIQESCERRNNIGRALDATEILGFLHSCL NPIIYAFIGQNFRHGFLKILAMHGLVSKEFLARHRVTSYTSSSVNVSSNL >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_8|1053_bp atgtcaaatattacagatccacagatgtgggattttgatgatctaaatttcactggcatg ccacctgcagatgaagattacagcccctgtatgctagaaactgagacactcaacaagtat gttgtgatcatcgcctatgccctagtgttcctgctgagcctgctgggaaactccctggtg atgctggtcatcttatacagcagggtcggccgctccgtcactgatgtctacctgctgaac ctggccttggccgacctactctttgccctgaccttgcccatctgggccgcctccaaggtg aatggctggatttttggcacattcctgtgcaaggtggtctcactcctgaaggaagtcaac ttctacagtggcatcctgctgttggcctgcatcagtgtggaccgttacctggccattgtc catgccacacgcacactgacccagaagcgtcacttggtcaagtttgtttgtcttggctgc tggggactgtctatgaatctgtccctgcccttcttccttttccgccaggcttaccatcca aacaattccagtccagtttgctatgaggtcctgggaaatgacacagcaaaatggcggatg gtgttgcggatcctgcctcacacctttggcttcatcgtgccgctgtttgtcatgctgttc tgctatggattcaccctgcgtacactgtttaaggcccacatggggcagaagcaccgagcc atgagggtcatctttgctgtcgtcctcatcttcctgctttgctggctgccctacaacctg gtcctgctggcagacaccctcatgaggacccaggtgatccaggagagctgtgagcgccgc aacaacatcggccgggccctggatgccactgagattctgggatttctccatagctgcctc aaccccatcatctacgccttcatcggccaaaattttcgccatggattcctcaagatcctg gctatgcatggcctggtcagcaaggagttcttggcacgtcatcgtgttacctcctacact tcttcgtctgtcaatgtctcttccaacctctga >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_9|72_aa MQYCLKIKGEHPGLSIGDVGKKLGEMWNDTAADDKHLYEKKTAKLKEKYEKDIAAYRAKA KPDAAKKRSCPG >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_9|219_bp atgcagtattgcctaaaaatcaaaggagaacatcctggcctgtccattggtgatgttgga aagaaactgggagagatgtggaatgacactgctgcagacgacaagcacctttatgaaaag aagactgccaagctgaaggaaaaatatgaaaaggatattgctgcatatcgagctaaagca aagcctgatgcagcaaaaaaaaggagttgtccaggctga >gi568815596f:218034802_218235881|GENSCAN_predicted_peptide_10|107_aa MAPRGRASRRRAEVAAAMILLEVNNRIIEETLALKFENAAAGNKPEAVEVTFADFDGVLY HISNPNGDKTKVMVSISLKFYKELQAHGADELLKRVYGSFLVNPESX >gi568815596f:218034802_218235881|GENSCAN_predicted_CDS_10|321_bp atggcgccgcggggccgcgcgtccaggcggcgagcggaagtggccgccgccatgatcctg ctggaggtgaacaaccgcatcatcgaggagacgctcgcgctcaagttcgagaacgcggcc gccggaaacaaaccggaagcagtagaagtaacatttgcagatttcgatggggtcctctat catatttcaaatcctaatggagacaaaacaaaagtgatggtcagtatttctttgaaattc tacaaggaacttcaggcacatggtgctgatgagttattaaagagggtgtacgggagtttc ttggtaaatccagaatcagnn