GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:17:34 Sequence gi568815597r:74606299_74824821 : 218523 bp : 37.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 604 303 302 0 2 70 44 272 0.195 16.45 1.08 Intr - 3353 3308 46 0 1 100 95 -14 0.078 -2.95 1.07 Intr - 14616 14436 181 0 1 119 81 102 0.768 11.12 1.06 Intr - 30140 30045 96 2 0 45 88 73 0.574 2.29 1.05 Intr - 35161 35033 129 1 0 64 105 91 0.967 8.37 1.04 Intr - 36800 36729 72 2 0 62 116 50 0.932 3.88 1.03 Intr - 40494 40369 126 0 0 48 83 83 0.903 3.76 1.02 Intr - 43017 42924 94 2 1 93 119 14 0.961 4.05 1.01 Init - 48480 48368 113 0 2 71 74 63 0.387 2.93 1.00 Prom - 51204 51165 40 -6.25 2.04 PlyA - 51221 51216 6 1.05 2.03 Term - 51976 51876 101 2 2 38 42 138 0.781 1.51 2.02 Intr - 52559 52460 100 2 1 94 98 51 0.936 5.46 2.01 Init - 52952 52890 63 2 0 97 89 -12 0.882 1.00 2.00 Prom - 56544 56505 40 -5.25 3.00 Prom + 63172 63211 40 -6.05 3.01 Init + 65337 65380 44 2 2 74 63 30 0.642 -0.96 3.02 Intr + 67124 67266 143 2 2 140 65 88 0.453 10.78 3.03 Term + 67431 67555 125 1 2 90 44 89 0.618 2.27 3.04 PlyA + 68734 68739 6 1.05 4.10 PlyA - 68831 68826 6 1.05 4.09 Term - 96371 96151 221 0 2 22 42 212 0.670 6.22 4.08 Intr - 100159 100036 124 1 1 50 27 144 0.562 3.74 4.07 Intr - 100696 100601 96 1 0 55 97 68 0.886 3.69 4.06 Intr - 100906 100805 102 1 0 49 92 64 0.831 2.35 4.05 Intr - 103949 103800 150 2 0 106 93 104 0.999 12.14 4.04 Intr - 108332 108281 52 1 1 99 83 65 0.998 5.09 4.03 Intr - 113074 112911 164 1 2 97 90 223 0.993 21.25 4.02 Intr - 116972 116820 153 2 0 63 67 134 0.989 8.15 4.01 Init - 118523 118413 111 2 0 108 75 98 0.965 10.76 4.00 Prom - 119001 118962 40 -8.95 5.00 Prom + 119126 119165 40 -6.05 5.01 Init + 122854 122898 45 0 0 90 59 93 0.311 7.33 5.02 Term + 126805 127026 222 0 0 85 49 166 0.701 8.33 5.03 PlyA + 127873 127878 6 1.05 6.00 Prom + 127988 128027 40 -14.16 6.01 Init + 128098 128148 51 0 0 63 98 74 0.675 7.11 6.02 Intr + 130244 130324 81 1 0 80 103 34 0.661 3.02 6.03 Intr + 132392 132490 99 1 0 63 58 110 0.577 4.89 6.04 Intr + 133760 133808 49 1 1 52 73 78 0.335 0.03 6.05 Intr + 133859 134044 186 0 0 35 79 147 0.285 7.24 6.06 Term + 134293 134750 458 2 2 8 41 228 0.244 4.40 6.07 PlyA + 134895 134900 6 -1.95 7.07 PlyA - 134946 134941 6 -4.04 7.06 Term - 135226 134949 278 2 2 84 42 269 0.274 16.54 7.05 Intr - 136332 135953 380 2 2 -38 48 252 0.212 2.28 7.04 Intr - 136819 136542 278 1 2 54 68 140 0.246 3.99 7.03 Intr - 137796 137653 144 0 0 103 83 53 0.372 5.86 7.02 Intr - 137979 137975 5 1 2 87 94 0 0.392 -7.57 7.01 Init - 138436 138277 160 1 1 84 89 125 0.780 12.23 7.00 Prom - 149473 149434 40 -2.25 8.00 Prom + 149690 149729 40 -5.45 8.01 Init + 149935 150278 344 0 2 60 47 286 0.846 18.45 8.02 Term + 150516 151179 664 0 1 79 38 303 0.884 16.75 8.03 PlyA + 152745 152750 6 1.05 9.00 Prom + 156948 156987 40 -1.95 9.01 Sngl + 157636 157815 180 0 0 80 43 201 0.702 9.55 9.02 PlyA + 158082 158087 6 1.05 10.00 Prom + 159849 159888 40 -5.85 10.01 Sngl + 159968 160138 171 1 0 18 44 248 0.820 7.98 10.02 PlyA + 160359 160364 6 1.05 11.04 PlyA - 160793 160788 6 1.05 11.03 Term - 167523 167408 116 1 2 73 33 64 0.015 -2.85 11.02 Intr - 185032 184943 90 2 0 102 93 26 0.036 3.55 11.01 Init - 195150 195066 85 0 1 66 65 89 0.556 5.43 11.00 Prom - 218256 218217 40 -3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_1|387_aa MGMIISSIMDCMSVSPQNSYVETLIPNVIVFGDEAFGRLLAAYNSLMDKHLAGYFNNTRI RRHLLRSGLITRSGRILSEKEYKLNMMKRDHQKYIRECLAQAIFHKVLDMERYHQLEIKK KLETLARKERIQRFKGEHTRRSVENNMPILSPHPPVGPKSNRGHSVLVDEGHSSPLALTA PRPYTAPGNMQPPIRLQPLPSNPAVETVPKDSRRIHKTSLHSNAAITMIYLGKNVHLSSD NPDFRDEIKVYQQHCGGENLCVYKGKLLEKGQVKVESDWFKWYRVRCIIAMGLDKKPSLP KSRKEKSTEKGEELKKAEGKVRKEREYVIPKRNEIKENKTSVSAKFSAQEIKTGLKEVVT AVEEMTSKGKPGQEVLEDDQENTLKYX >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_1|1161_bp atggggatgatcatatcttctattatggactgcatgtctgtgtctccccaaaattcatat gttgaaaccctaatccccaatgtgatagtatttggagatgaggcctttgggaggttactt gctgcatataatagccttatggataaacacctggctgggtattttaacaatacaaggata aggcgtcatctcttaagatcaggactgatcacaagaagtggaagaatactttctgaaaaa gaatataaactaaatatgatgaagcgggatcatcaaaaatatatccgggaatgcttagcc caggcaatttttcataaagttcttgatatggagcgttaccatcagcttgaaataaaaaag aaattggagaccttagctaggaaggagcgaatccagaggtttaagggagagcacacaaga aggtctgttgaaaataacatgccaatcctgtctccccacccaccagttggcccaaagagt aatcgtggccatagtgttctggttgatgaaggacattccagtccgttagcactgacagcc cctcgaccatatactgctccaggaaatatgcagcctccaattcgattacagcctcttccc agtaatcctgcagtagaaactgttccaaaggattcaagaaggattcataaaacatcctta catagtaatgcagctattacaatgatctatttggggaaaaatgtgcacctatcttctgat aatcctgacttccgggatgaaattaaagtttatcagcagcactgtggtggggaaaacctt tgtgtctacaaaggcaaactacttgaaaaaggtcaggtaaaagttgagtcagactggttt aaatggtacagggttaggtgcattattgcaatgggccttgacaaaaaaccgtctttgccg aaatctaggaaagaaaagagcactgagaaaggagaggaactgaagaaggctgaggggaaa gtgaggaaagagagagagtatgtgataccaaaaagaaatgagatcaaggagaacaaaacc tctgtttcagccaaattttcagctcaagaaataaaaacagggctcaaagaagtggtaact gctgtggaggaaatgacaagtaaaggaaaaccaggacaagaagtcttggaagacgaccag gaaaatactttaaaatatgnn >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_2|87_aa MVLSYPWTFACGNPSSWSNPLMRKLSLIVSKQLVGTHRVQVSIEDLKLGISEGGAKMGKE DKELGSGGEKEEQKEEEYLIENIQFYG >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_2|264_bp atggtactctcttatccctggacctttgcatgtggtaatccctcttcctggagcaatcct cttatgaggaaattgagcttaatagtgagtaaacaacttgttggaactcatagagtacaa gtgtcaatagaagatttaaaactgggtatttcagagggtggggctaagatgggaaaggag gacaaggaattgggaagtggaggggaaaaggaagagcagaaggaggaggaatacctcata gagaacattcagttttatggttaa >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_3|103_aa MRVTKNQHEWEDLISRREEEGQLAEAATCHSVEKLQLSLTQRGENGSFLQDPFWTPRQVR RVASLLLREERGKRPGSGEAEALLEPSLRKLEVKPASTPFSGP >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_3|312_bp atgagggtaactaaaaaccagcatgagtgggaagatttaataagcaggagggaggaggag gggcagctggctgaagccgccacctgccacagcgtggaaaagctccagttgtcacttacc cagcggggtgagaatggctcatttttgcaggatcccttttggaccccgcggcaggtgcgg agggtggcttccctgttgctaagggaggagagaggcaagcgcccagggtctggagaagcg gaggcgctactggaaccgagcctgcggaagctggaagtgaagccagcttctacccctttt tctgggccctga >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_4|390_aa MATGQKLMRAVRVFEFGGPEVLKLRSDIAVPIPKDHQVLIKVHACGVNPVETYIRSGTYS RKPLLPYTPGSDVAGVIEAVGDNASAFKKGDRVFTSSTISGGYAEYALAADHTVYKLPEK LDFKQGAAIGIPYFTAYRALIHSACVKAGESVLVHGASGGVGLAACQIARAYGLKILGTA GTEEGQKIVLQNGAHEVFNHREVNYIDKIKKYVGEKGIDIIIEMLANVNLSKDLSLLSHG GRVIVVGSRGTIEINPRDTMAKESSIIGVTLFSSTKEEFQQYAAALQAGMEIGWLKPVIG SQYPLEKVAEAHENIIHAKPDNVKIGAEDARTLVQQALELTQNIEEAISLQILIWKKLTS LMGREEGHKYISGIVVKSMGSEGKETGFKA >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_4|1173_bp atggcgactggacagaagttgatgagagctgttagagtttttgaatttggtgggccagaa gtcctgaaattgcgatcagatattgcagtaccgattccaaaagaccatcaggttctaatc aaggtccatgcatgtggtgtcaaccccgtggagacatacattcgctctggtacttatagt agaaaaccactcttaccctatactcctggctcagatgtggctggggtgatagaagctgtt ggagataatgcatctgctttcaagaaaggtgacagagttttcactagcagcacgatctct gggggttatgcagagtatgctcttgcagcagaccacactgtttacaaactacctgaaaaa ctggactttaaacaaggagctgccatcggcattccatattttactgcttatcgagctctg atccacagtgcctgtgtgaaagctggagagagtgttctggttcatggggcaagtggagga gttggattagcagcatgccaaattgctagagcttatggcttaaagattttgggcactgct ggtactgaggaaggacaaaagattgttttgcaaaatggagcccatgaagtgttcaatcac agagaagtgaattacattgataaaattaagaagtatgttggtgagaaaggaattgatata attattgaaatgttagctaatgtaaatcttagtaaagacttgagtcttctgtcacatgga ggacgagtgatagttgttggcagcagaggtactattgaaataaacccacgagacaccatg gcaaaggagtcgagtataattggagttactctcttttcctcaaccaaggaggaatttcag caatatgcagcagcccttcaagctggaatggaaattggctggttgaaacctgtgataggt tctcaatatccattggagaaggtggccgaggctcatgaaaatatcattcatgcaaagcca gataatgtaaaaattggagcagaagatgcccgtaccttagtccagcaggcactggaattg acgcaaaatattgaagaggctatctcactccagatattgatctggaagaaactgacctct cttatgggcagggaggaaggccacaagtacatctctggtatagttgtcaagagcatgggt tctgaaggcaaagaaactggatttaaagcctag >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_5|88_aa MTLRKEQVLEYAARGAKALEFPKLRTKRKFGPFRPPLASIWLPPGRDEATMKEPSADPES VTHGSQRGVQEMEGAMFEQSGPQPEGQC >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_5|267_bp atgacgttacggaaagagcaggttttggaatatgcagctcgaggtgcgaaggcactagaa tttcccaaattaagaacgaagaggaagtttggaccttttcggccaccgctcgcttcaata tggctgcccccagggagagacgaggctaccatgaaggagccgagcgcagaccctgagtcc gtcacccatggatcgcagcgcggagttcaggaaatggaaggcgcaatgtttgagcaaagc ggacctcagccggaagggcagtgttga >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_6|307_aa MAPKYGSNDAGDLDLPTGINGFEVQKQNCCWLLVTHKLCVKDDVIVALKKANGDATLKFE PFVLHVQCRQLQDAQILWVIGLADFKNEAVDPRNVQMCPEFLPSSGFVVLLTSGVKLQTF AVSVTALKGGMSRVVCSSQWVRGLADFRNEAADFHTRHRALIGAFLQSADWCVYNPLARH RALIGAFLQSADWCVYNPLARHRALIGAFLQSADWCIYSPLATHRALIGAFLQSADWCIC SPLARQKSSPSPHLTQKVQLASSLNPPSKQDTSTAAGNLAMTALATSCWIGVKKGPCSCS VLQRGTL >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_6|924_bp atggccccaaagtacgggagtaatgatgctggcgatttggatctgccaacaggtataaat ggttttgaggttcagaaacaaaactgttgctggctactggttacacacaaactttgtgta aaagatgatgtgattgtagctctgaagaaagcaaatggtgatgccactttgaaatttgaa ccatttgttcttcatgtgcagtgtcgacaattgcaggatgcacagattctgtgggttatt ggtctcgctgacttcaagaatgaagccgtggaccctcgcaatgttcagatgtgtccggag tttcttccttccagtgggttcgtggtcttgctgacttcaggagtgaagctgcagaccttc gcagtgagtgttacagctcttaaaggtggcatgtccagagttgtttgttcctcccagtgg gttcgtggtcttgctgatttcagaaatgaagccgcagacttccacactagacatagagcg ctgattggtgcgtttttacagagtgctgattggtgcgtttacaatcctttagctagacac agagcgctgattggtgcatttttacagagtgctgattggtgcgtttacaatcctttagct agacacagagcgctgattggtgcatttttacagagtgctgattggtgcatttacagtcct ttagctacacacagagcgctaattggtgcgtttttacagagtgctgattggtgcatttgc agtcctctagctagacagaaaagttctccaagtccccacttgacccagaaagtccagctg gcttcatctctcaatccaccctctaaacaggacacctcaactgctgctgggaatttggcg atgacggctctagctacttcctgctggataggggtgaagaaggggccctgcagttgtagt gtcctccagaggggaactctttag >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_7|414_aa MSVCNFILEVSETKNPSEGTNSRHILATQMGHIWRPRWDTFWRPRWDNRLLPSGKKETSK EISKGPQKRLGYQLCPLQAVAGGEFGPTWVHVSFSLSDLKQIKGNHWKVHCPREQTFSEP EAPNQLTQQQDSGCRGQVPAHVITLTEPWACLTIEGQEIDFLLDTGVAFSVLISCLRELS SRSVTIQGILGQPVTSHRKGEGEKAEGNRWADAEAKIAARQNIPLEIPTEGPLVWNNPLQ EVKLQYSPTETEWGLSRGHSFLPSAWLTTEEGKVLIPKASQWKILKTLHETVHMGIENTH QMAKSLFTGPNLLQAIRQVVKAYSLWEGPYSVILSTPTAVKMAGVESWIHHTPVKFWTPP EEPAGPSAQESQDQPDQPRYTCEPLEDLHLLFWKETSQTKKALTSDPEEKPIPP >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_7|1245_bp atgagtgtctgcaacttcattcttgaagtcagtgagaccaagaacccatcggaaggaacc aattctagacacattttggcaacccagatgggacacatttggcgacccagatgggacaca ttttggcgacccagatgggacaatcgcctattgccaagtgggaagaaggaaacaagcaaa gaaatctccaaaggaccacaaaaacgcctgggctatcagttatgtccccttcaagctgta gcgggaggggaatttggcccaacttgggtacatgtctccttctccctctctgatttaaag cagatcaagggcaatcactggaaggtgcactgccccagagaacaaacgttctctgagcca gaagcccccaaccagctgacccaacaacaggactcagggtgccgggggcaagtgccagct catgtcatcaccctcactgagccctgggcatgtttaaccattgagggccaggaaattgac ttcctcctggacactggtgtggctttctcagtgttaatctcctgtctcagggagttgtcc tcaaggtccgttaccatccaaggaatcctgggacagcctgtaaccagccatcggaaaggt gaaggagaaaaggcagaaggaaaccgttgggcagatgctgaggctaaaattgctgccagg cagaacatcccattagaaatacctacggaaggacccttggtatggaacaaccccctccaa gaggttaagctccagtattccccaaccgaaacagagtggggactttcacgggggcatagt tttctcccctcggcgtggttaacgacagaagaaggaaaggtacttatacccaaagccagc cagtggaaaatacttaaaaccctccacgaaactgttcacatgggtattgaaaacacgcat caaatggccaaatccctatttacagggccaaatctcctccaggccatccgacaggtagtc aaagcctattctttgtgggaaggaccatactcggtaatcctctctacccccactgcagtt aagatggcaggagtggaatcttggattcaccacaccccagttaaattttggacacccccg gaggaacctgcgggaccatcagctcaggagtcccaagatcagccagaccagcctcgatac acctgcgaaccattggaggacttgcatctcctattttggaaggaaacatcccagactaaa aaggctcttacctctgatcctgaggaaaaacccattcctccttaa >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_8|335_aa MWKRLWNWVTGRGWNSLEDSEEDRKMWESLELPRDLSNGFDQNADNDMDKEIQADVVSDG DEELVGNWSKGDSCYVLAKRLAAFYPCPRDLWNFDVERDDLGYPAEFLSSKAFKRPRGLG GKSGFVGWAQGPCAVCSLGTWCPVSQPLKPWLKGTNVELGAVASEGASPKPWQLPCGVEP TSGQKSRIGIWGPPSRFQKIYGNTWMPRWKFSAEAGPSWRTSARAVPKGSVGSEPPHRVP TRAMPSGAVRSGLPSSRPQNDRSTDSLYCVPGKATDTQCQLVKVARREAVSCKATEAELP KTMGTHLLHQHDLDVRHGVKGDHFGALMFDCPAGF >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_8|1008_bp atgtggaagcgactttggaactgggtaacaggtagaggttggaacagtttggaggattca gaagaagacaggaaaatgtgggaaagtttggaacttcctagagacttgtctaatggcttt gaccaaaatgctgataatgatatggacaaggaaatccaggctgatgtagtctcagatgga gatgaggaacttgttgggaactggagcaaaggtgactcttgttatgttttagcaaagaga ctggcagcattttacccctgccctcgagacttgtggaactttgatgttgagagagatgat ttagggtatccggcagaatttctaagcagcaaagcattcaagaggcccagaggcctagga ggaaaaagtggttttgtgggctgggcccaaggtccctgtgctgtgtgcagcctaggaact tggtgccctgtgtcccagccactcaagccatggctgaaagggaccaatgtagagcttggg gctgtggcttcagagggtgcaagccccaaaccttggcagcttccatgtggtgttgagcct acgagtggacaaaagtcaagaattggcatctggggacctccatctagatttcagaagata tatggcaacacctggatgcccagatggaagtttagtgcagaggcggggccctcatggaga acctctgctagggcagtgccaaagggaagtgtggggtcagagcccccacacagagtccct actcgggcaatgcctagtggagctgtgagaagcgggctaccatcctccagaccccagaat gatagatccaccgacagcttgtactgtgtacctggaaaagccacagacactcagtgccag cttgtgaaggtagccagaagggaagctgtatcctgcaaagccacagaggcagagctgccc aagactatgggaacccacctcttgcatcagcatgatctggatgtgagacatggagtcaaa ggagatcattttggagctttaatgtttgactgccctgctggattttag >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_9|59_aa MTNLHPKIKEKNNSSYIHKKKRNPEKTRAQCITKESDEELENDDDDDLGINVTIFPEDY >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_9|180_bp atgactaacttacatcccaagatcaaagagaaaaataactcatcatatattcataagaaa aaaagaaacccagaaaaaacacgtgcccagtgtattactaaagaaagtgatgaagaactt gaaaatgatgatgatgatgatctaggaatcaatgttaccatcttccctgaagattactaa >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_10|56_aa MLTVKQLLTGPSGSTPEEGIVIIEGDSFMCVIAPEDLPVVQDVEEEDSDTEDSDPV >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_10|171_bp atgttaactgtaaaacagcttctgacaggtccttcaggaagtactccagaagaaggcatt gttatcatagaaggtgacagcttcatgtgtgttattgcccctgaagaccttccagtggta caagatgtggaggaggaagacagtgatactgaggattctgaccctgtgtag >gi568815597r:74606299_74824821|GENSCAN_predicted_peptide_11|96_aa MTFDEVERHHKDSKRHLLKEKEKFLSARGCLEIQYMSVIMLNDSFKLFPLGLFDPWVQER RHDTGTDASYMLISIFRRSTSTGQPTDQPTKLRDHM >gi568815597r:74606299_74824821|GENSCAN_predicted_CDS_11|291_bp atgacctttgatgaagttgagaggcatcacaaagattcgaagaggcacctgttgaaggag aaagagaaatttctttcagcaagaggctgcttggaaatacagtacatgtcggtaataatg ttaaatgacagcttcaaactatttcccttaggcctgtttgacccgtgggttcaagaaaga aggcatgatactggaactgatgcaagctatatgctcatttctatatttagacgctcaaca tcaacaggtcagcccactgaccaaccaaccaagttaagggatcatatgtag