GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:54:33 Sequence gi568815587f:10361514_10605881 : 244368 bp : 43.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 660 655 6 1.05 1.03 Term - 8000 7862 139 1 1 103 47 103 0.852 5.14 1.02 Intr - 17011 16844 168 0 0 109 72 58 0.259 5.36 1.01 Init - 26259 26081 179 0 2 58 49 100 0.190 1.84 1.00 Prom - 38352 38313 40 -2.66 2.04 PlyA - 38815 38810 6 1.05 2.03 Term - 41241 41079 163 2 1 74 37 128 0.018 3.71 2.02 Intr - 49023 48975 49 1 1 87 94 55 0.088 3.74 2.01 Init - 59675 59519 157 2 1 71 6 107 0.008 0.87 2.00 Prom - 59729 59690 40 -4.46 3.02 PlyA - 59846 59841 6 1.05 3.01 Sngl - 61356 60985 372 0 0 70 43 153 0.674 5.13 3.00 Prom - 86086 86047 40 -2.16 4.00 Prom + 87809 87848 40 -6.46 4.01 Init + 89509 89530 22 0 1 103 117 26 0.174 6.64 4.02 Intr + 89650 89751 102 2 0 90 99 38 0.853 5.25 4.03 Intr + 93873 93935 63 1 0 65 61 67 0.252 0.39 4.04 Intr + 100002 100227 226 1 1 86 76 403 0.828 35.94 4.05 Intr + 117013 117217 205 1 1 84 74 180 0.523 15.40 4.06 Intr + 120550 120712 163 0 1 115 80 197 0.979 21.15 4.07 Intr + 123307 123526 220 2 1 93 87 387 0.958 36.46 4.08 Intr + 125632 125851 220 1 1 84 100 148 0.800 13.90 4.09 Intr + 131836 132125 290 0 2 87 33 418 0.571 32.24 4.10 Intr + 133155 133317 163 0 1 80 25 125 0.647 5.38 4.11 Intr + 133386 133517 132 2 0 107 113 114 0.999 16.64 4.12 Intr + 134057 134220 164 1 2 67 83 279 0.996 24.07 4.13 Intr + 135299 135425 127 2 1 69 92 109 0.737 10.08 4.14 Intr + 138573 138736 164 2 2 60 96 294 0.998 26.17 4.15 Intr + 139957 140077 121 1 1 121 86 158 0.998 19.40 4.16 Intr + 141208 141381 174 0 0 73 116 100 0.950 11.44 4.17 Intr + 143036 143146 111 1 0 -52 80 181 0.655 3.88 4.18 Term + 144195 144371 177 2 0 94 42 181 0.999 11.79 4.19 PlyA + 145590 145595 6 1.05 5.12 PlyA - 147610 147605 6 1.05 5.11 Term - 149677 149611 67 0 1 91 38 48 0.242 -2.49 5.10 Intr - 157628 157521 108 1 0 48 115 66 0.427 4.70 5.09 Intr - 163860 163679 182 0 2 88 71 63 0.693 3.27 5.08 Intr - 169238 169130 109 1 1 131 100 32 0.733 8.99 5.07 Intr - 193339 193254 86 1 2 8 116 61 0.007 -0.48 5.06 Intr - 197784 197625 160 2 1 95 9 185 0.019 11.29 5.05 Intr - 198381 198303 79 1 1 116 97 11 0.971 3.41 5.04 Intr - 199287 198982 306 1 0 90 84 65 0.550 2.72 5.03 Intr - 202566 202427 140 2 2 84 105 20 0.975 3.41 5.02 Intr - 202861 202690 172 2 1 66 94 122 0.985 9.60 5.01 Init - 207019 206935 85 1 1 110 100 84 0.998 10.78 5.00 Prom - 207594 207555 40 -4.96 6.11 PlyA - 209744 209739 6 1.05 6.10 Term - 215062 214819 244 0 1 74 49 47 0.324 -5.23 6.09 Intr - 219076 218942 135 0 0 125 75 205 0.894 22.58 6.08 Intr - 220473 220354 120 2 0 94 84 124 0.972 12.21 6.07 Intr - 230099 230035 65 2 2 79 106 7 0.563 -0.88 6.06 Intr - 232086 231979 108 0 0 103 111 -7 0.778 3.58 6.05 Intr - 232682 232633 50 0 2 92 91 -2 0.755 -1.10 6.04 Intr - 239546 239405 142 2 1 126 95 188 0.952 23.23 6.03 Intr - 241738 241607 132 1 0 60 109 123 0.987 12.44 6.02 Intr - 243032 242874 159 2 0 102 48 127 0.593 10.28 6.01 Intr - 244173 244027 147 0 0 75 75 31 0.473 0.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 42606 42763 158 1 2 21 44 197 0.929 6.70 S.002 Term + 179549 179837 289 2 1 74 53 152 0.873 5.15 S.003 Term - 197784 197598 187 2 1 95 37 214 0.980 13.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:10361514_10605881|GENSCAN_predicted_peptide_1|161_aa MALHEPRIQPLMQPAHNFPSFIFINPTGKLDFTQDCAENQMLELGSGFLDKHIAIIRIYP CMFPCQGIHSLLAKNFHHSSSPALRTPELWKQNPLEQSWVLGYQKNLKALVLEYIGPLIQ REDAQTPGHRFQSKAQVMGSVGAALNSLPRDGYTSSPNLTT >gi568815587f:10361514_10605881|GENSCAN_predicted_CDS_1|486_bp atggccctgcatgagcccagaattcagcccctgatgcagccagcccacaacttcccaagt tttattttcattaacccaactggaaaattggacttcactcaggattgtgctgaaaaccag atgctggagctgggctctggtttcctagataaacacattgcaataatacgcatctaccct tgtatgtttccatgccagggtatccacagcctccttgcaaagaatttccaccactcaagt tccccagcactaaggacccccgagctctggaagcaaaatcctctggaacaatcatgggta ttagggtatcagaagaacctgaaagccttggtcctggagtatataggcccattgatccag cgggaggatgcccagacaccaggtcacaggtttcaaagtaaggcccaagtcatgggatcc gtgggggcagccctcaactcacttcccagagatgggtatacctctagccctaatctgacc acctga >gi568815587f:10361514_10605881|GENSCAN_predicted_peptide_2|122_aa MGNDFMSKTPKAMATKVKIDKWDLIKLKSFCTAKETIIRVNRQPSEKTTPSKNLIDLKHG AVPKDDLDRLYIYGKMATSNSKLHLPTYMSNGKRTSLFPNAQDQFLEFTQKDHSWAMYPS LN >gi568815587f:10361514_10605881|GENSCAN_predicted_CDS_2|369_bp atgggcaacgacttcatgtctaaaacaccaaaagcaatggcaacaaaagtcaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagta aacaggcaaccctcagaaaaaacaaccccatcaaaaaacttgattgatttgaaacatggg gctgtgcccaaagacgacttggacagactttacatctatggaaaaatggctaccagcaat tccaagcttcatcttcccacatacatgtccaatgggaaaagaacaagtctcttccccaat gctcaggatcaattcctggaattcactcagaaagaccactcttgggccatgtacccatcc ctgaactaa >gi568815587f:10361514_10605881|GENSCAN_predicted_peptide_3|123_aa MDKFLDTYTLSRLNQEEVKSLNRPITSSEIEAVMNSLPTKKSSEPDRFTAKFYQRDKEEL VPFLLKLFQTIEDEGLLPNSFYEASIILIPKPGRGTTEKENFRPISLMNIDRKILNKIPA NQI >gi568815587f:10361514_10605881|GENSCAN_predicted_CDS_3|372_bp atggataaattcctggacacatacaccctctcaagactaaaccaggaagaagtgaaatcc ctgaatagaccaataacaagttctgaaattgaggcagtaatgaatagcctaccaaccaaa aaaagctcagaaccagacagattcacagccaaattctaccagagggacaaagaggagctg gtaccattccttctgaaactattccaaacaatagaagatgagggactcctccctaactca ttttatgaggccagcatcatcctgataccaaaacctggcagaggcacaacagaaaaagaa aatttcaggccaatatccctgatgaacattgataggaaaatcctcaataaaataccagca aaccaaatctag >gi568815587f:10361514_10605881|GENSCAN_predicted_peptide_4|947_aa MALSSEPEAVALTPVRSLALSPVPVPVPTPASAVARAPAASGGVAESSQRSELEAHVGAV SGSEMPRQFPKLNISEVDEQVRLLAEKVFAKVLREEDSKDALSLFTVPEDCPIGQKEAKE RELQKELAEQKSVETAKRKKSFKMIRSQSLSLQMPPQQDWKGPPAASPAMSPTTPVVTGA TSLPTPAPYAMPEFQRVTISGDYCAGITLEDYEQAAKSLAKALMIREKYARLAYHRFPRI TSQYLGHPRADTAPPEEGLPDFHPPPLPQEDPYCLDDAPPNLDYLVHMQGGILFVYDNKK MLEHQEPHSLPYPDLETYTVDMSHILALITDGPTQGWPLQMLEASKLESSMRLNYGLSPS PTCRKTYCHRRLNFLESKFSLHEMLNEMSEFKELKSNPHRDFYNVRKVDTHIHAAACMNQ KHLLRFIKHTYQTEPDRTVAEKRGRKITLRQVFDGLHMDPYDLTVDSLDVHAVSELLLQC RQQTAALAGDSAPWKATMIVLAGRWGLCCSFTGGLAKGSQADRVRIHGPGAQEFLRFHAR IDIEDVFRCGHTEGRLVKGRQTFHRFDKFNSKYNPVGASELRDLYLKTENYLGGEYFARM VKEVARELEESKYQYSEPRLSIYGRSPEEWPNLAYWFIQHKVYSPNMRWIIQVPRIYDIF RSKKLLPNFGKMLENIFLPLFKATINPQDHRELHLFLKYVTGFDSVDDESKHSDHMFSDK SPNPDVWTSEQNPPYSYYLYYMYANIMVLNNLRRERGLSTFLFRPHCGEAGSITHLVSAF LTADNISHGLLLKKSPVLQYLYYLAQIPIAMSPLSNNSLFLEYSKNPLREFLHKGLHVSL STDDPMQFHYTKEALMEEYAIAAQVWKLSTCDLCEIARNSVLQSGLSHQEKQKFLGQNYY KEGPEGNDIRKTNVAQIRMAFRYETLCNELSFLSDAMKSEEITALTN >gi568815587f:10361514_10605881|GENSCAN_predicted_CDS_4|2844_bp atggccctgtcgtccgaacccgaggcggtggcgctgactccggtccgatcccttgccctg tcccctgttcctgtccctgtccctacccctgcctctgcggtggcccgagccccagcggcc tcaggaggagtggcagagtccagccagcgctcggagctggaggcccacgtgggagcagtg agcggctctgagatgccgcggcagtttcccaagctgaacatctctgaagtggatgagcaa gtccggctcctggcggagaaggtgtttgctaaagtgctccgagaagaggacagcaaagat gccctgtccctgttcactgtcccagaggactgccccatcgggcaaaaggaagccaaggag agggagctgcagaaggagctggcagagcagaagtctgtggagaccgcaaaaagaaagaaa agtttcaagatgattcggtcccagtccctgtctctgcaaatgccgccacagcaagattgg aagggccccccggcagccagtccggccatgtctcccacaacccctgtggtcactggagcc acttccctgcccacgccagcaccctatgccatgcctgagttccagcgggtcaccatcagc ggagattactgtgccgggatcactttggaggactatgagcaggcagccaagagtctggcc aaggccctaatgatccgggagaagtatgcgcggctcgcctaccaccgcttcccgcggatc acatcccagtacctgggtcatccgcgggcggatactgcacctccggaagagggccttcca gacttccaccctcctccactgccccaggaagacccctactgcctggatgatgcacccccc aacctggattacttggtccacatgcaggggggcatcctctttgtgtatgataacaagaag atgctggagcaccaggagccgcacagcctaccctaccccgacctggagacctacacggtg gacatgagccacatcctggctctcatcaccgatggccccacccagggttggcccctgcag atgctggaggcctccaaactggagagctcgatgcgactcaactatggtctctccccatct ccaacttgcaggaaaacctattgtcaccggcgactgaactttctggaatccaagttcagc cttcatgagatgttaaacgaaatgtccgagttcaaagagttgaagagtaacccccaccgg gacttctataacgtgagaaaggtggacacacacatccatgcggccgcctgcatgaaccaa aagcatctgctgcgcttcatcaagcacacataccagacggagcctgacaggactgtggca gagaagcggggccggaagatcaccctgcggcaggtgtttgacggcctgcacatggacccc tacgacctcactgtggactcactggatgtccacgcggtgagtgagcttctgctccagtgc cgccagcagacagcagccctggctggggactcagccccctggaaagccaccatgattgtg cttgccgggaggtggggcctctgctgttccttcacagggggccttgccaaaggatcccaa gctgaccgagtgaggatccatggtcctggtgctcaggagtttctaaggtttcatgcaaga attgacatagaagatgtctttcggtgtggccatacagaaggccgcctcgtaaagggccgg cagacattccaccgctttgacaagttcaactccaaatacaaccctgtgggggccagtgag ctgcgtgacctgtatttgaaaactgaaaactatctgggaggagagtactttgctcggatg gtcaaggaggttgcccgggagctggaggagagcaagtaccagtactcagagccacggctc tccatctacggccgcagtcctgaggagtggcccaacctggcctactggttcatccagcac aaggtctactctcccaacatgcgctggatcatccaggtgccccggatttatgacatattt aggtcaaagaagctgctgccaaactttgggaagatgctggagaacatcttcctgcccctt ttcaaggccactatcaacccccaagatcatcgagagcttcacctcttccttaaatatgtg acggggtttgacagcgtggatgatgagtccaagcacagcgaccacatgttttccgacaag agcccaaacccggacgtctggaccagtgagcagaacccaccctacagctactacctgtac tacatgtatgccaacatcatggtgctcaacaacctccgcagggagcgcggcctgagcacg ttcctgttccggccgcactgtggggaagccggctccatcacccacctggtgtctgccttc ctcactgctgacaacatttcccacgggctgctcctcaagaagagtccggtattgcagtat ctctactaccttgctcagatccccattgccatgtctcctcttagcaacaacagtttgttc ctcgaatattccaagaaccctctgagggaattcctacacaagggactgcatgtttctctt tccaccgatgaccccatgcagttccactacacgaaggaagcacttatggaagaatatgcc attgcagctcaagtgtggaagctgagcacctgcgacctgtgtgagatcgccaggaacagc gtgctgcagagcggcctctcgcatcaggaaaagcaaaagtttctgggacaaaattattat aaagaaggacctgaaggaaatgatattcgaaagacaaatgtggctcagatccggatggca ttccgatatgagaccttatgcaatgagctcagcttcctgtctgatgctatgaaatcagaa gagatcaccgccttgaccaactag >gi568815587f:10361514_10605881|GENSCAN_predicted_peptide_5|497_aa MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEA CRLLGLSLAGKDQVETALKASFETCSYGWVGDGFVVISRISPNPKCGKNGVGVLIWKVPV SRQFAAYCYNSSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPA PTTTPPAPASTSIPRRKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTAL LVLALLFFGAAAGLGFCYVKRYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKT DKNPEESKSPSKTTKNSRALFYLCSHTTTTVINSITKELNDKRTAKVASGQEKHLLFEVQ PGSDSSAFWKVVVRVVCTKINKSSGIVEASRIMNLYQFIQLYKDITSQAAGVLAQSSTSE EPDENSSSVTSCQASLWMGRVKQLTDEEECCICMDGRADLILPCAHSFCQKCIDKWQQTS TVHSPGVVDPYPNRQQK >gi568815587f:10361514_10605881|GENSCAN_predicted_CDS_5|1494_bp atggccaggtgcttcagcctggtgttgcttctcacttccatctggaccacgaggctcctg gtccaaggctctttgcgtgcagaagagctttccatccaggtgtcatgcagaattatgggg atcacccttgtgagcaaaaaggcgaaccagcagctgaatttcacagaagctaaggaggcc tgtaggctgctgggactaagtttggccggcaaggaccaagttgaaacagccttgaaagct agctttgaaacttgcagctatggctgggttggagatggattcgtggtcatctctaggatt agcccaaaccccaagtgtgggaaaaatggggtgggtgtcctgatttggaaggttccagtg agccgacagtttgcagcctattgttacaactcatctgatacttggactaactcgtgcatt ccagaaattatcaccaccaaagatcccatattcaacactcaaactgcaacacaaacaaca gaatttattgtcagtgacagtacctactcggtggcatccccttactctacaatacctgcc cctactactactcctcctgctccagcttccacttctattccacggagaaaaaaattgatt tgtgtcacagaagtttttatggaaactagcaccatgtctacagaaactgaaccatttgtt gaaaataaagcagcattcaagaatgaagctgctgggtttggaggtgtccccacggctctg ctagtgcttgctctcctcttctttggtgctgcagctggtcttggattttgctatgtcaaa aggtatgtgaaggccttcccttttacaaacaagaatcagcagaaggaaatgatcgaaacc aaagtagtaaaggaggagaaggccaatgatagcaaccctaatgaggaatcaaagaaaact gataaaaacccagaagagtccaagagtccaagcaaaactaccaaaaactcccgagccttg ttttacctctgctctcacaccaccacaacagtcatcaactcaataacaaaagaactcaat gacaaaagaacggctaaagtggcttctggccaggaaaaacatcttctctttgaggtacaa cctgggtctgattcctctgctttttggaaagtggttgtacgggtggtctgtaccaagatt aacaaaagcagtggcattgtggaggcatcacggatcatgaatttataccagtttattcaa ctttataaagatatcacaagtcaagcagcaggagtattggcacagagctccacctctgaa gaacctgatgaaaactcatcctctgtaacatcttgtcaggctagtctttggatgggaagg gtgaagcagctgaccgatgaggaggagtgttgtatctgtatggatgggcgggctgacctc atcctgccttgtgctcacagcttttgtcagaagtgtattgataaatggcagcagaccagt actgtccacagcccaggggttgtggacccctaccccaacagacagcaaaaataa >gi568815587f:10361514_10605881|GENSCAN_predicted_peptide_6|433_aa MYMDTALGMVEIKQAFKSDSLRLNPGSVTPCHLEQVNDRSGLGLFTHKQNVFVQLSLAFR NDSYTLESRINQAERERNLTEENTEKELENFKASITVIVEPDSSASLWHHCEHRETYQKL LEDIAVLHRLAARLSSRAEVVGAVRQEKRMSKATEVMMQYVENLKRTYEKDHAELMEFKK LANQNSSRSCGPSEDGVPRTARSMSLTLGKNMPRRRVSVAVVPKFNALNLPGQTPSSSSI PSLPALSESPNGKGSLPVTSALPALLENGKTNGDPDCEASAPALTLSCLEELSQETKARM EEEAYSKGFQEGLKKTKELQDLKEEEEEQKSESPEEPEEVEETEEEEKGPRSSKLEELVH FLQVMYPKLCQHWQVIWMMAAVMLVLTVVLGLYNSYNSCAEQADGPLGRSTCSAAQRDSW WSSGLQHEQPTEQ >gi568815587f:10361514_10605881|GENSCAN_predicted_CDS_6|1302_bp atgtatatggacacagctctcggcatggtggaaataaaacaggctttcaaatcagacagt ctgaggttaaatcctggctccgttactccgtgtcaccttgaacaagttaatgaccgttct ggtcttggcctctttacccataaacagaacgtgtttgtgcaactgtccttggcctttaga aatgacagctacactctggaatctagaattaaccaggctgaaagggaacgcaacctgaca gaggagaacactgagaaagaactggaaaacttcaaagcttccattacggtaattgtggag cctgattcctcagcttcactctggcaccactgtgagcaccgggaaacctaccagaagttg ctggaggacatcgctgtcctgcaccgcctggctgcccgcctctccagccgagctgaggtg gtaggcgccgtccgccaggaaaagcgcatgtcgaaagcaacggaagtgatgatgcagtat gtggagaatctaaagaggacgtatgagaaggaccatgcggagctcatggagtttaaaaag cttgcaaatcagaattcaagccgcagctgtggcccctctgaagatggggtccctcgcacg gcacggtccatgtccctcacgctgggaaagaatatgcctcgccggagggtcagcgttgct gtggttcctaagtttaatgccctgaatctgcctggccaaactcccagctcatcatccatt ccctccttaccagccttgtcggaatcacccaatgggaaaggcagcctacctgtcacttca gcactgcctgcacttttggaaaatggaaagacaaatggggacccagattgtgaagcctct gctcctgcgctgaccctgagctgcctggaggagcttagtcaggagaccaaggccaggatg gaggaagaagcctacagcaagggattccaagaaggtctaaagaagaccaaagaacttcaa gacctgaaggaggaggaggaagaacagaagagtgagagtcctgaggaacctgaagaggta gaagaaactgaggaagaggaaaagggcccaagaagcagcaaacttgaagaattggtccat ttcttacaagtcatgtatcccaaactgtgtcagcactggcaagtgatctggatgatggct gcagtgatgctggtcttgactgttgtgctggggctctacaattcctataactcttgtgca gagcaggctgatgggccccttggaagatccacttgctcggcagcccagagggactcctgg tggagctcaggactccagcatgagcagcctacagagcagtag