GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:16:01 Sequence gi568815597f:74633245_74864110 : 230866 bp : 37.19% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 3194 3099 96 2 0 45 88 73 0.148 2.29 1.05 Intr - 8215 8087 129 1 0 64 105 91 0.889 8.37 1.04 Intr - 9854 9783 72 2 0 62 116 50 0.930 3.88 1.03 Intr - 13548 13423 126 0 0 48 83 83 0.902 3.76 1.02 Intr - 16071 15978 94 2 1 93 119 14 0.960 4.05 1.01 Init - 21534 21422 113 0 2 71 74 63 0.387 2.93 1.00 Prom - 24258 24219 40 -6.25 2.04 PlyA - 24275 24270 6 1.05 2.03 Term - 25030 24930 101 2 2 38 42 138 0.781 1.51 2.02 Intr - 25613 25514 100 2 1 94 98 51 0.936 5.46 2.01 Init - 26006 25944 63 2 0 97 89 -12 0.882 1.00 2.00 Prom - 29598 29559 40 -5.25 3.00 Prom + 36226 36265 40 -6.05 3.01 Init + 38391 38434 44 2 2 74 63 30 0.642 -0.96 3.02 Intr + 40178 40320 143 2 2 140 65 88 0.453 10.78 3.03 Term + 40485 40609 125 1 2 90 44 89 0.618 2.27 3.04 PlyA + 41788 41793 6 1.05 4.10 PlyA - 41885 41880 6 1.05 4.09 Term - 69425 69205 221 0 2 22 42 212 0.670 6.22 4.08 Intr - 73213 73090 124 1 1 50 27 144 0.562 3.74 4.07 Intr - 73750 73655 96 1 0 55 97 68 0.886 3.69 4.06 Intr - 73960 73859 102 1 0 49 92 64 0.831 2.35 4.05 Intr - 77003 76854 150 2 0 106 93 104 0.999 12.14 4.04 Intr - 81386 81335 52 1 1 99 83 65 0.998 5.09 4.03 Intr - 86128 85965 164 1 2 97 90 223 0.993 21.25 4.02 Intr - 90026 89874 153 2 0 63 67 134 0.989 8.15 4.01 Init - 91577 91467 111 2 0 108 75 98 0.965 10.76 4.00 Prom - 92055 92016 40 -8.95 5.00 Prom + 92180 92219 40 -6.05 5.01 Init + 95908 95952 45 0 0 90 59 93 0.311 7.33 5.02 Term + 99859 100080 222 0 0 85 49 166 0.701 8.33 5.03 PlyA + 100927 100932 6 1.05 6.00 Prom + 101042 101081 40 -14.16 6.01 Init + 101152 101202 51 0 0 63 98 74 0.675 7.11 6.02 Intr + 103298 103378 81 1 0 80 103 34 0.661 3.02 6.03 Intr + 105446 105544 99 1 0 63 58 110 0.577 4.89 6.04 Intr + 106814 106862 49 1 1 52 73 78 0.335 0.03 6.05 Intr + 106913 107098 186 0 0 35 79 147 0.285 7.24 6.06 Term + 107347 107804 458 2 2 8 41 228 0.244 4.40 6.07 PlyA + 107949 107954 6 -1.95 7.07 PlyA - 108000 107995 6 -4.04 7.06 Term - 108280 108003 278 2 2 84 42 269 0.274 16.54 7.05 Intr - 109386 109007 380 2 2 -38 48 252 0.212 2.28 7.04 Intr - 109873 109596 278 1 2 54 68 140 0.246 3.99 7.03 Intr - 110850 110707 144 0 0 103 83 53 0.372 5.86 7.02 Intr - 111033 111029 5 1 2 87 94 0 0.392 -7.57 7.01 Init - 111490 111331 160 1 1 84 89 125 0.780 12.23 7.00 Prom - 122527 122488 40 -2.25 8.00 Prom + 122744 122783 40 -5.45 8.01 Init + 122989 123332 344 0 2 60 47 286 0.846 18.45 8.02 Term + 123570 124233 664 0 1 79 38 303 0.884 16.75 8.03 PlyA + 125799 125804 6 1.05 9.00 Prom + 130002 130041 40 -1.95 9.01 Sngl + 130690 130869 180 0 0 80 43 201 0.702 9.55 9.02 PlyA + 131136 131141 6 1.05 10.00 Prom + 132903 132942 40 -5.85 10.01 Init + 133022 133188 167 1 2 18 -1 245 0.089 7.75 10.02 Intr + 149383 149474 92 1 2 69 58 64 0.036 0.22 10.03 Intr + 159354 159393 40 1 1 113 86 33 0.186 1.96 10.04 Intr + 160315 160429 115 1 1 59 38 113 0.090 2.93 10.05 Intr + 211504 211620 117 0 0 91 92 45 0.300 4.94 10.06 Term + 220512 220610 99 2 0 98 48 15 0.005 -4.35 10.07 PlyA + 220615 220620 6 1.05 11.03 PlyA - 221079 221074 6 1.05 11.02 Term - 222599 222527 73 1 1 105 34 96 0.479 2.30 11.01 Init - 227064 226904 161 0 2 98 68 42 0.330 2.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 133022 133192 171 1 0 18 44 248 0.820 7.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_1|210_aa MGMIISSIMDCMSVSPQNSYVETLIPNVIVFGDEAFGRLLAAYNSLMDKHLAGYFNNTRI RRHLLRSGLITRSGRILSEKEYKLNMMKRDHQKYIRECLAQAIFHKVLDMERYHQLEIKK KLETLARKERIQRFKGEHTRRSVENNMPILSPHPPVGPKSNRGHSVLVDEGHSSPLALTA PRPYTAPGNMQPPIRLQPLPSNPAVETVPK >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_1|630_bp atggggatgatcatatcttctattatggactgcatgtctgtgtctccccaaaattcatat gttgaaaccctaatccccaatgtgatagtatttggagatgaggcctttgggaggttactt gctgcatataatagccttatggataaacacctggctgggtattttaacaatacaaggata aggcgtcatctcttaagatcaggactgatcacaagaagtggaagaatactttctgaaaaa gaatataaactaaatatgatgaagcgggatcatcaaaaatatatccgggaatgcttagcc caggcaatttttcataaagttcttgatatggagcgttaccatcagcttgaaataaaaaag aaattggagaccttagctaggaaggagcgaatccagaggtttaagggagagcacacaaga aggtctgttgaaaataacatgccaatcctgtctccccacccaccagttggcccaaagagt aatcgtggccatagtgttctggttgatgaaggacattccagtccgttagcactgacagcc cctcgaccatatactgctccaggaaatatgcagcctccaattcgattacagcctcttccc agtaatcctgcagtagaaactgttccaaag >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_2|87_aa MVLSYPWTFACGNPSSWSNPLMRKLSLIVSKQLVGTHRVQVSIEDLKLGISEGGAKMGKE DKELGSGGEKEEQKEEEYLIENIQFYG >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_2|264_bp atggtactctcttatccctggacctttgcatgtggtaatccctcttcctggagcaatcct cttatgaggaaattgagcttaatagtgagtaaacaacttgttggaactcatagagtacaa gtgtcaatagaagatttaaaactgggtatttcagagggtggggctaagatgggaaaggag gacaaggaattgggaagtggaggggaaaaggaagagcagaaggaggaggaatacctcata gagaacattcagttttatggttaa >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_3|103_aa MRVTKNQHEWEDLISRREEEGQLAEAATCHSVEKLQLSLTQRGENGSFLQDPFWTPRQVR RVASLLLREERGKRPGSGEAEALLEPSLRKLEVKPASTPFSGP >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_3|312_bp atgagggtaactaaaaaccagcatgagtgggaagatttaataagcaggagggaggaggag gggcagctggctgaagccgccacctgccacagcgtggaaaagctccagttgtcacttacc cagcggggtgagaatggctcatttttgcaggatcccttttggaccccgcggcaggtgcgg agggtggcttccctgttgctaagggaggagagaggcaagcgcccagggtctggagaagcg gaggcgctactggaaccgagcctgcggaagctggaagtgaagccagcttctacccctttt tctgggccctga >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_4|390_aa MATGQKLMRAVRVFEFGGPEVLKLRSDIAVPIPKDHQVLIKVHACGVNPVETYIRSGTYS RKPLLPYTPGSDVAGVIEAVGDNASAFKKGDRVFTSSTISGGYAEYALAADHTVYKLPEK LDFKQGAAIGIPYFTAYRALIHSACVKAGESVLVHGASGGVGLAACQIARAYGLKILGTA GTEEGQKIVLQNGAHEVFNHREVNYIDKIKKYVGEKGIDIIIEMLANVNLSKDLSLLSHG GRVIVVGSRGTIEINPRDTMAKESSIIGVTLFSSTKEEFQQYAAALQAGMEIGWLKPVIG SQYPLEKVAEAHENIIHAKPDNVKIGAEDARTLVQQALELTQNIEEAISLQILIWKKLTS LMGREEGHKYISGIVVKSMGSEGKETGFKA >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_4|1173_bp atggcgactggacagaagttgatgagagctgttagagtttttgaatttggtgggccagaa gtcctgaaattgcgatcagatattgcagtaccgattccaaaagaccatcaggttctaatc aaggtccatgcatgtggtgtcaaccccgtggagacatacattcgctctggtacttatagt agaaaaccactcttaccctatactcctggctcagatgtggctggggtgatagaagctgtt ggagataatgcatctgctttcaagaaaggtgacagagttttcactagcagcacgatctct gggggttatgcagagtatgctcttgcagcagaccacactgtttacaaactacctgaaaaa ctggactttaaacaaggagctgccatcggcattccatattttactgcttatcgagctctg atccacagtgcctgtgtgaaagctggagagagtgttctggttcatggggcaagtggagga gttggattagcagcatgccaaattgctagagcttatggcttaaagattttgggcactgct ggtactgaggaaggacaaaagattgttttgcaaaatggagcccatgaagtgttcaatcac agagaagtgaattacattgataaaattaagaagtatgttggtgagaaaggaattgatata attattgaaatgttagctaatgtaaatcttagtaaagacttgagtcttctgtcacatgga ggacgagtgatagttgttggcagcagaggtactattgaaataaacccacgagacaccatg gcaaaggagtcgagtataattggagttactctcttttcctcaaccaaggaggaatttcag caatatgcagcagcccttcaagctggaatggaaattggctggttgaaacctgtgataggt tctcaatatccattggagaaggtggccgaggctcatgaaaatatcattcatgcaaagcca gataatgtaaaaattggagcagaagatgcccgtaccttagtccagcaggcactggaattg acgcaaaatattgaagaggctatctcactccagatattgatctggaagaaactgacctct cttatgggcagggaggaaggccacaagtacatctctggtatagttgtcaagagcatgggt tctgaaggcaaagaaactggatttaaagcctag >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_5|88_aa MTLRKEQVLEYAARGAKALEFPKLRTKRKFGPFRPPLASIWLPPGRDEATMKEPSADPES VTHGSQRGVQEMEGAMFEQSGPQPEGQC >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_5|267_bp atgacgttacggaaagagcaggttttggaatatgcagctcgaggtgcgaaggcactagaa tttcccaaattaagaacgaagaggaagtttggaccttttcggccaccgctcgcttcaata tggctgcccccagggagagacgaggctaccatgaaggagccgagcgcagaccctgagtcc gtcacccatggatcgcagcgcggagttcaggaaatggaaggcgcaatgtttgagcaaagc ggacctcagccggaagggcagtgttga >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_6|307_aa MAPKYGSNDAGDLDLPTGINGFEVQKQNCCWLLVTHKLCVKDDVIVALKKANGDATLKFE PFVLHVQCRQLQDAQILWVIGLADFKNEAVDPRNVQMCPEFLPSSGFVVLLTSGVKLQTF AVSVTALKGGMSRVVCSSQWVRGLADFRNEAADFHTRHRALIGAFLQSADWCVYNPLARH RALIGAFLQSADWCVYNPLARHRALIGAFLQSADWCIYSPLATHRALIGAFLQSADWCIC SPLARQKSSPSPHLTQKVQLASSLNPPSKQDTSTAAGNLAMTALATSCWIGVKKGPCSCS VLQRGTL >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_6|924_bp atggccccaaagtacgggagtaatgatgctggcgatttggatctgccaacaggtataaat ggttttgaggttcagaaacaaaactgttgctggctactggttacacacaaactttgtgta aaagatgatgtgattgtagctctgaagaaagcaaatggtgatgccactttgaaatttgaa ccatttgttcttcatgtgcagtgtcgacaattgcaggatgcacagattctgtgggttatt ggtctcgctgacttcaagaatgaagccgtggaccctcgcaatgttcagatgtgtccggag tttcttccttccagtgggttcgtggtcttgctgacttcaggagtgaagctgcagaccttc gcagtgagtgttacagctcttaaaggtggcatgtccagagttgtttgttcctcccagtgg gttcgtggtcttgctgatttcagaaatgaagccgcagacttccacactagacatagagcg ctgattggtgcgtttttacagagtgctgattggtgcgtttacaatcctttagctagacac agagcgctgattggtgcatttttacagagtgctgattggtgcgtttacaatcctttagct agacacagagcgctgattggtgcatttttacagagtgctgattggtgcatttacagtcct ttagctacacacagagcgctaattggtgcgtttttacagagtgctgattggtgcatttgc agtcctctagctagacagaaaagttctccaagtccccacttgacccagaaagtccagctg gcttcatctctcaatccaccctctaaacaggacacctcaactgctgctgggaatttggcg atgacggctctagctacttcctgctggataggggtgaagaaggggccctgcagttgtagt gtcctccagaggggaactctttag >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_7|414_aa MSVCNFILEVSETKNPSEGTNSRHILATQMGHIWRPRWDTFWRPRWDNRLLPSGKKETSK EISKGPQKRLGYQLCPLQAVAGGEFGPTWVHVSFSLSDLKQIKGNHWKVHCPREQTFSEP EAPNQLTQQQDSGCRGQVPAHVITLTEPWACLTIEGQEIDFLLDTGVAFSVLISCLRELS SRSVTIQGILGQPVTSHRKGEGEKAEGNRWADAEAKIAARQNIPLEIPTEGPLVWNNPLQ EVKLQYSPTETEWGLSRGHSFLPSAWLTTEEGKVLIPKASQWKILKTLHETVHMGIENTH QMAKSLFTGPNLLQAIRQVVKAYSLWEGPYSVILSTPTAVKMAGVESWIHHTPVKFWTPP EEPAGPSAQESQDQPDQPRYTCEPLEDLHLLFWKETSQTKKALTSDPEEKPIPP >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_7|1245_bp atgagtgtctgcaacttcattcttgaagtcagtgagaccaagaacccatcggaaggaacc aattctagacacattttggcaacccagatgggacacatttggcgacccagatgggacaca ttttggcgacccagatgggacaatcgcctattgccaagtgggaagaaggaaacaagcaaa gaaatctccaaaggaccacaaaaacgcctgggctatcagttatgtccccttcaagctgta gcgggaggggaatttggcccaacttgggtacatgtctccttctccctctctgatttaaag cagatcaagggcaatcactggaaggtgcactgccccagagaacaaacgttctctgagcca gaagcccccaaccagctgacccaacaacaggactcagggtgccgggggcaagtgccagct catgtcatcaccctcactgagccctgggcatgtttaaccattgagggccaggaaattgac ttcctcctggacactggtgtggctttctcagtgttaatctcctgtctcagggagttgtcc tcaaggtccgttaccatccaaggaatcctgggacagcctgtaaccagccatcggaaaggt gaaggagaaaaggcagaaggaaaccgttgggcagatgctgaggctaaaattgctgccagg cagaacatcccattagaaatacctacggaaggacccttggtatggaacaaccccctccaa gaggttaagctccagtattccccaaccgaaacagagtggggactttcacgggggcatagt tttctcccctcggcgtggttaacgacagaagaaggaaaggtacttatacccaaagccagc cagtggaaaatacttaaaaccctccacgaaactgttcacatgggtattgaaaacacgcat caaatggccaaatccctatttacagggccaaatctcctccaggccatccgacaggtagtc aaagcctattctttgtgggaaggaccatactcggtaatcctctctacccccactgcagtt aagatggcaggagtggaatcttggattcaccacaccccagttaaattttggacacccccg gaggaacctgcgggaccatcagctcaggagtcccaagatcagccagaccagcctcgatac acctgcgaaccattggaggacttgcatctcctattttggaaggaaacatcccagactaaa aaggctcttacctctgatcctgaggaaaaacccattcctccttaa >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_8|335_aa MWKRLWNWVTGRGWNSLEDSEEDRKMWESLELPRDLSNGFDQNADNDMDKEIQADVVSDG DEELVGNWSKGDSCYVLAKRLAAFYPCPRDLWNFDVERDDLGYPAEFLSSKAFKRPRGLG GKSGFVGWAQGPCAVCSLGTWCPVSQPLKPWLKGTNVELGAVASEGASPKPWQLPCGVEP TSGQKSRIGIWGPPSRFQKIYGNTWMPRWKFSAEAGPSWRTSARAVPKGSVGSEPPHRVP TRAMPSGAVRSGLPSSRPQNDRSTDSLYCVPGKATDTQCQLVKVARREAVSCKATEAELP KTMGTHLLHQHDLDVRHGVKGDHFGALMFDCPAGF >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_8|1008_bp atgtggaagcgactttggaactgggtaacaggtagaggttggaacagtttggaggattca gaagaagacaggaaaatgtgggaaagtttggaacttcctagagacttgtctaatggcttt gaccaaaatgctgataatgatatggacaaggaaatccaggctgatgtagtctcagatgga gatgaggaacttgttgggaactggagcaaaggtgactcttgttatgttttagcaaagaga ctggcagcattttacccctgccctcgagacttgtggaactttgatgttgagagagatgat ttagggtatccggcagaatttctaagcagcaaagcattcaagaggcccagaggcctagga ggaaaaagtggttttgtgggctgggcccaaggtccctgtgctgtgtgcagcctaggaact tggtgccctgtgtcccagccactcaagccatggctgaaagggaccaatgtagagcttggg gctgtggcttcagagggtgcaagccccaaaccttggcagcttccatgtggtgttgagcct acgagtggacaaaagtcaagaattggcatctggggacctccatctagatttcagaagata tatggcaacacctggatgcccagatggaagtttagtgcagaggcggggccctcatggaga acctctgctagggcagtgccaaagggaagtgtggggtcagagcccccacacagagtccct actcgggcaatgcctagtggagctgtgagaagcgggctaccatcctccagaccccagaat gatagatccaccgacagcttgtactgtgtacctggaaaagccacagacactcagtgccag cttgtgaaggtagccagaagggaagctgtatcctgcaaagccacagaggcagagctgccc aagactatgggaacccacctcttgcatcagcatgatctggatgtgagacatggagtcaaa ggagatcattttggagctttaatgtttgactgccctgctggattttag >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_9|59_aa MTNLHPKIKEKNNSSYIHKKKRNPEKTRAQCITKESDEELENDDDDDLGINVTIFPEDY >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_9|180_bp atgactaacttacatcccaagatcaaagagaaaaataactcatcatatattcataagaaa aaaagaaacccagaaaaaacacgtgcccagtgtattactaaagaaagtgatgaagaactt gaaaatgatgatgatgatgatctaggaatcaatgttaccatcttccctgaagattactaa >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_10|209_aa MLTVKQLLTGPSGSTPEEGIVIIEGDSFMCVIAPEDLPVVQDVEEEDSDTEDSDPVNLKF PPCPITALPGAGYTQRTEFLSAPNKHGRNVTQKEAIKSRSPIQFTNQLYQGWSWRPPTSY KWKSAENLSATADRESVRGEASMEPRRCVTQPPLLDNVPGGTLEIPVKFSKPTSRIYSLQ KSIKITGGDEIRKREMWCCLGQGARHDKR >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_10|630_bp atgttaactgtaaaacagcttctgacaggtccttcaggaagtactccagaagaaggcatt gttatcatagaaggtgacagcttcatgtgtgttattgcccctgaagaccttccagtggta caagatgtggaggaggaagacagtgatactgaggattctgaccctgtgaacctgaagttc cctccatgccctatcactgctctgccaggtgctggctacacccaacgtactgaattcctc agtgcgccaaataagcatgggagaaatgtcacccaaaaggaagctattaagtcaagaagt cccatccaattcaccaaccagctctaccagggctggtcctggagaccacccacatcctac aagtggaaaagtgctgaaaacctgtcagccacagctgacagagagtccgtcaggggtgag gcttccatggaaccaagaagatgtgtcactcagcctccacttctagataatgttccaggt gggaccctggaaatccctgtcaaattctctaaacccacttccaggatctattctttacag aaatcgataaaaatcactggaggtgacgagattaggaaaagagaaatgtggtgttgcctt ggtcagggagccagacatgacaaaagatga >gi568815597f:74633245_74864110|GENSCAN_predicted_peptide_11|77_aa MPLVLSCDTILMRSGCLKVSSHPCEDVPVSPSTSAMIVSFLRPPAMLTAQPVELPAAMDM VIERANELEARKPGFKI >gi568815597f:74633245_74864110|GENSCAN_predicted_CDS_11|234_bp atgcccctagtgctgtcttgtgataccattctcatgagatctggttgtttgaaagtgtcc tcccacccatgtgaagatgtgcctgtttctccttcaacttcagccatgattgtaagtttc ctgaggcccccagccatgcttactgcacagcctgtggaattgccagcggccatggatatg gttattgaaagggcaaatgaactagaagccaggaaacctggattcaagatctga