GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:20:11 Sequence gi568815597r:227632395_227835195 : 202801 bp : 45.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 442 540 99 0 0 93 89 82 0.520 7.07 1.02 Intr + 14150 14276 127 1 1 117 68 57 0.655 6.85 1.03 Term + 21966 23434 1469 1 2 57 42 1233 0.498 106.42 1.04 PlyA + 23475 23480 6 1.05 2.06 PlyA - 23593 23588 6 1.05 2.05 Term - 27183 26953 231 0 0 13 44 202 0.215 4.77 2.04 Intr - 31619 31548 72 2 0 67 74 86 0.198 4.70 2.03 Intr - 48781 48767 15 1 0 142 81 10 0.021 1.44 2.02 Intr - 73723 73641 83 2 2 74 92 70 0.976 5.36 2.01 Init - 74305 74167 139 1 1 48 92 160 0.890 12.70 2.00 Prom - 77044 77005 40 -7.36 3.00 Prom + 77477 77516 40 -6.76 3.01 Init + 77804 77924 121 1 1 68 74 7 0.255 -2.35 3.02 Intr + 81583 81843 261 2 0 101 64 108 0.726 7.16 3.03 Term + 82132 82514 383 2 2 56 42 206 0.971 7.80 3.04 PlyA + 83290 83295 6 1.05 4.07 PlyA - 85578 85573 6 1.05 4.06 Term - 100282 99998 285 1 0 98 42 309 0.891 22.70 4.05 Intr - 100633 100487 147 1 0 99 75 216 0.858 21.83 4.04 Intr - 101287 101020 268 0 1 93 55 287 0.857 23.43 4.03 Intr - 101689 101513 177 0 0 56 81 280 0.785 23.13 4.02 Intr - 102422 102257 166 0 1 62 95 95 0.983 6.62 4.01 Init - 103017 102618 400 0 1 50 52 534 0.999 40.83 4.00 Prom - 106951 106912 40 -8.06 5.00 Prom + 108156 108195 40 -8.46 5.01 Init + 110535 110576 42 2 0 96 44 80 0.256 3.05 5.02 Intr + 115298 115839 542 1 2 89 80 656 0.519 56.60 5.03 Intr + 121353 121514 162 0 0 69 57 114 0.942 5.49 5.04 Intr + 126850 127091 242 1 2 77 92 298 0.451 26.29 5.05 Intr + 134565 134689 125 1 2 108 110 20 0.013 6.50 5.06 Term + 148133 148279 147 1 0 112 50 172 0.988 13.70 5.07 PlyA + 148810 148815 6 1.05 6.00 Prom + 149626 149665 40 -11.63 6.01 Init + 149966 150198 233 1 2 111 94 93 0.784 7.95 6.02 Intr + 153869 153939 71 2 2 104 77 32 0.686 2.53 6.03 Term + 153989 154209 221 0 2 26 48 171 0.233 4.10 6.04 PlyA + 158096 158101 6 1.05 7.00 Prom + 159765 159804 40 -5.96 7.01 Init + 162175 162240 66 0 0 60 90 115 0.731 7.98 7.02 Intr + 164896 165066 171 0 0 126 111 15 0.614 7.74 7.03 Term + 165825 166028 204 2 0 52 39 101 0.263 -1.03 7.04 PlyA + 166519 166524 6 1.05 8.00 Prom + 172138 172177 40 -4.16 8.01 Init + 183344 183470 127 1 1 86 57 230 0.739 18.02 8.02 Intr + 183696 183858 163 1 1 106 115 196 0.989 23.13 8.03 Intr + 184815 185086 272 0 2 93 97 252 0.791 23.79 8.04 Term + 189288 189337 50 1 2 66 42 71 0.339 -2.23 8.05 PlyA + 191413 191418 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 7148 7015 134 2 2 72 93 90 0.860 7.52 S.002 Init + 141152 141299 148 1 1 71 68 108 0.805 5.35 S.003 Term + 143395 143540 146 2 2 118 55 76 0.919 5.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_1|564_aa MVAGRSQMAASLLFSDLGFLASWIPWNGTLGYVGLLAFSDVVIEFSPEEWACLDPAQRNL YRDVMFENYRNLVSLDLLPEQDMKDLCQKVTLTRHRSWGLDNLHLVKDWRTVNEGKGQKE YCNRLTQCSSTKSKIFQCIECGRNFSWRSILTEHKRIHTGEKPYKCEECGKVFNRCSNLT KHKRIHTGEKPYKCDECGKVFNWWSQLTNHKKIHTGEKPYKCDECDKVFNWWSQLTSHKK IHSGEKPYPCEECGKAFTQFSNLTQHKRIHTGEKPYKCKECCKAFNKFSNLTQHKRIHTG EKPYKCEECGNVFNECSHLTRHRRIHTGEKPYKCEECGKAFTQFASLTRHKRIHTGEKPY QCEECGKTFNRCSHLSSHKRIHTGEKPYKCEECGRTFTQFSNLTQHKRIHTGEKPYKCKE CGKAFNKFSSLTQHRRIHTGVKPYKCEECGKVFKQCSHLTSHKRIHTGEKPYKCKECGKA FYQSSILSKHKRIHTEEKPYKCEECGKAFNQFSSLTRHKRIHTGEKRYKCKECGKGFYQS SIHSKYKRIYTGEEPDKCKKCGSL >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_1|1695_bp atggtggcgggccgctcccaaatggcagcaagccttttgttctctgacctggggttcttg gcctcatggattccttggaatggaaccttgggctatgtgggactactggcattcagtgat gtggtcatagaattctctccagaggagtgggcatgcctggaccctgcccagcgaaatttg tatagggatgtgatgttcgagaactacagaaacctggtctccctggaccttttgccagag caggatatgaaagatttatgccaaaaagtgacactgacaagacatagaagctggggcctt gacaatttgcacttagtgaaagactggagaactgtgaatgaaggtaaggggcagaaagaa tattgcaatagacttactcaatgttcatcaactaaaagcaaaatctttcaatgtattgaa tgtggcagaaattttagctggaggtcaatccttactgaacataagagaattcatactgga gagaagccatacaaatgtgaagaatgtggcaaagttttcaatcgatgttcaaacctaaca aaacataaaagaattcatactggagagaaaccctacaaatgtgacgaatgtggcaaagtt tttaattggtggtcacaactaactaaccataagaaaattcatactggagagaaaccctac aaatgtgatgaatgtgacaaagtttttaattggtggtcacaactaactagccataagaaa attcatagtggagagaaaccatacccatgtgaagaatgtggcaaagcctttacccagttc tcaaaccttacacaacataagagaattcatactggagagaaaccctacaaatgcaaagaa tgttgcaaagcctttaacaagttctcaaaccttactcaacataagagaattcatactgga gagaaaccttacaagtgtgaagaatgtggcaacgtttttaatgagtgctcacacctaact agacataggagaattcatactggagagaaaccctacaaatgtgaagaatgtggcaaagcc tttacacagtttgcaagccttactcgtcataaaagaattcatactggagaaaaaccctac caatgtgaagaatgtggcaaaacttttaatcggtgttcacacctaagtagccataagaga attcatactggagagaaaccctacaaatgtgaagaatgtggcagaacctttactcaattc tcaaacctcactcagcataaaagaattcatactggagagaaaccctacaaatgcaaagaa tgtggcaaagcgtttaacaagttctcaagccttactcaacataggagaattcatactgga gtgaaaccctacaaatgtgaagaatgtgggaaagtttttaaacagtgctctcacctaact agccataagagaattcatactggagagaaaccctacaaatgtaaagaatgtggcaaagct ttttaccaatcctcaatccttagtaagcataagagaattcatactgaagagaaaccctac aaatgtgaagaatgtggcaaagcctttaaccagttctcaagccttactcgtcataaaaga attcatactggagagaaacgctacaaatgtaaagaatgtggaaaaggtttttaccaatcc tcaatccatagtaagtataagagaatttatactggagaggaacctgacaaatgtaaaaaa tgtggcagtctttaa >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_2|179_aa MPLQELVSFEDVAVGFTWEEWQDLDDAQRTLYRDMMLETYSSLVSLGYHITKPEVIFNLE QGEPWMVEDTRASQDISELSSMQEINKKLFDGSNYSISLDEGQHVLEPAQNMGVTKGLYG FHFKFGTPGVSGADADHMDDIKEAGGANSPISGHGVNTEIRFASKITKEIFQSKSSLDL >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_2|540_bp atgccattacaggagttggtgtcctttgaggatgtggctgtgggcttcacctgggaagag tggcaggacttggatgatgctcaaaggactctgtacagggacatgatgctggagacctac agcagcttggtgtccttggggtaccacattaccaaacctgaggtgatcttcaacctggag caaggagaaccttggatggtagaggacaccagagcctcccaggatatttctgaactgagc tccatgcaggagatcaataaaaagctgtttgatgggtccaactactccatctctttggat gaagggcagcatgttctagaaccagcacagaacatgggagtcaccaaaggactctatggc tttcactttaagtttgggacacctggagtttctggtgctgatgctgatcatatggacgac attaaagaagcaggtggtgctaacagtcccatctctggccatggtgtgaacacagaaatc aggtttgcatccaaaatcaccaaggaaatatttcaatccaaaagctctctagatctctga >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_3|254_aa MTLNEHAAFKHLFNKGHLAPPLIHSTLSGHSTCFREHRVGALRNPGSPDEWVSPKCSKPP SPRDSQSAMLNGSCFLCHPTGSDPPAGAVRYPIQERSYWHQVGVPQGQRFQRKEQALAVL QARVTSPARAQNWTEDEMDEFTEVGFRRRVITNFTEVKEHVLTQCKEVTNLKKRLEELLP RITSLEMYINDRMELKNTARELRGAYTSINSPINQVEEKISEMEDYLAEIRQADKIREKR IKRNEQNLQEMWNY >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_3|765_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaaggacatcttgcacca cccttaatccattcaaccctgagtggacacagcacatgtttcagagagcacagggttggg gctctgaggaatccgggcagcccagacgagtgggtttcccccaaatgcagcaaacctcct tcaccaagggacagccaaagtgctatgttaaatgggtcctgcttcctgtgccacccaact gggtcagaccccccagcaggagctgtcagatatcctatacaggagcgttcctactggcat caggttggtgtccctcaaggtcagagattccagagaaaggagcaggccttggctgttctc caggctcgagtgacatctccagcaagggcacagaactggacagaggatgagatggacgaa ttcacagaagtaggcttcagaaggagggtaataacaaacttcactgaggtaaaggagcat gttctaactcaatgcaaagaagttacaaaccttaaaaaaaggttagaggagctgctacct agaataaccagtttagagatgtacataaatgaccggatggagctgaaaaacacagcaaga gaacttcgtggagcatatacaagtatcaatagcccaatcaatcaagtagaagaaaaaata tcagagatggaagactatcttgctgaaataaggcaggcagacaagattagagaaaaaaga ataaaaaggaatgaacaaaacctccaagaaatgtggaactattaa >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_4|480_aa MRAGPEPQALAGQKRGALRLLVPRLVLTVSAPAEVRRRVLRPVLSWMDRETRALADSHFR GLGVDVPGVGQAPGRVAFVSEPGAFSYADFVRGFLLPNLPCVFSSAFTQGWGSRRRWVTP AGRPDFDHLLRTYGDVVVPVANCGVQEYNSNPKEHMTLRDYITYWKEYIQAGYSSPRGCL YLKDWHLCSLLVPHVLSVVPLRPPHRDFPVEDVFTLPVYFSSDWLNEFWDALDVDDYRFV YAGPAGSWSPFHADIFRSFSWSVNVCGRKKWLLFPPGQEEALRDRHGNLPYDVTSPALCD THLHPRNQLAGPPLEITQEAGEMVFVPSGWHHQVHNLDDTISINHNWVNGFNLANMWRFL QQELCAVQEEVSEWRDSMPDWHHHCQVIMRSCSGINFEEFYHFLKVIAEKRLLVLREAAA EDGAGLGFEQAAFDVGRITEVLASLVAHPDFQRVDTSAFSPQPKELLQQLREAVDAAAAP >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_4|1443_bp atgcgtgcaggcccggagccccaggcgctggcggggcagaaacgcggcgccctgcgtctt ctggttccgaggctggtcctcaccgtttccgctccggcggaagtgaggaggagggtcctt cgacccgtgctgagctggatggaccgcgagacgcgcgccctcgccgacagccacttccga ggcctgggggtcgatgtccccggcgtcggccaggctccgggccgggtagccttcgtctcg gagccgggcgccttctcctacgccgactttgtgcggggcttcttgctgcccaacctgccc tgcgtgttttccagcgccttcacgcagggctggggcagccggcggcgctgggtgacgccc gcggggaggcccgacttcgaccacctgctacggacctacggagacgtggttgtaccagtt gcaaactgtggggtccaggaatacaactcgaaccccaaagagcacatgactctcagagac tacatcacctactggaaagagtacatacaggcgggctactcctctcccaggggctgtctc tacctcaaagactggcacttgtgcagtctcctcgtgcctcacgtgctgtcggtggtgccc ttgcgccctcctcacagggactttccggtggaggacgttttcaccctgcctgtgtacttc tcgtccgactggctgaatgagttctgggatgcactggatgtggatgactaccgctttgtc tacgcggggcctgcgggcagctggtccccgttccatgctgacatcttccgctccttcagc tggtctgtcaatgtctgtgggaggaagaagtggctcctcttccccccagggcaggaagag gccctgcgggaccgccacggcaacctgccctacgacgtgacctccccagcactctgcgac acacacctgcacccacggaaccagcttgctggcccacccttggagatcacgcaggaagcg ggcgagatggtgtttgtgcccagtggctggcaccaccaggtgcacaacctggatgacacc atctccatcaaccacaactgggtcaatggcttcaacctggccaacatgtggcgcttcttg cagcaggagctatgcgccgtgcaggaggaggtcagcgagtggagggactccatgcccgac tggcaccaccactgccaggtcatcatgaggtcctgctcgggcatcaactttgaagagttt taccacttcctcaaggtcatcgctgagaagaggctcctggtcctgagggaggcagccgct gaggacggtgctgggttgggtttcgaacaggcagcctttgatgttgggcgcatcacagag gtgctggcctccttggttgcgcaccccgacttccagagagtggacaccagcgcgttctca ccacagcccaaagagctgctgcagcagctgagagaggctgttgatgctgctgcggcccca tag >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_5|419_aa MGPVCGLPTLAMALRQKRPGPWRTQTQEQMSRDVCIHTWPCTYYLEPKRRWVTGQLSLTS LSLRFMTDSTGEILVSFPLSSIVEIKKEASHFIFSSITILEKGHAKHWFSSLRPSRNVVF SIIEHFWRELLLSQPGAVADASVPRTRGEELTGLMAGSQKRLEDTARVLHHQGQQLDSVM RGLDKMESDLEVADRAFNRALKPATEATAGFSALLPKPPRGAYRQAGYGSPTPRQHLGVD VYCSWSPSGFEREDVDDIKVHSPYEISIRQRFIGKPDMAYRLISAKMPEVIPILEVQFSK KMELLEDALVLRSARTSSPAEKSCSVWHAASGLMGRTLHREPPAGDQEGTALHLQTSLPA LSEADTQELTQILRRMKGLALEAESELERQDEALDGVAAAVDRATLTIDKHNRRMKRLT >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_5|1260_bp atgggtcctgtgtgtggcctgccgaccctcgccatggccctgaggcagaagaggcctgga ccttggcgcacacagacccaggaacagatgagcagggatgtctgcatccacacctggccg tgcacctactacctggagcccaagaggcgatgggttactggacagctgtccttaacatcg ctgtcgctcaggttcatgactgacagcactggagagattctggtcagcttccccctctcc agcatagttgagatcaagaaggaggcttcacattttatcttcagctccatcaccatcctg gagaagggccatgccaagcactggttcagctccctgcggccaagtcgaaatgtggtcttc agcatcatcgagcatttctggagggagctgctgctgtctcagcctggagccgtggcagac gcatctgtcccaaggacccggggcgaggagctgacgggactcatggctggatcccagaaa cgcctggaggacacggcgagggtcctgcaccaccagggccagcagctggacagcgtcatg agaggcctggacaagatggagtcagacctggaggtggcggacagggcattcaacagggca ctgaagccagccactgaagccacggctggcttcagtgccctgctgcccaaacctcctagg ggagcatacagacaggcaggctatgggtctccaaccccacgacagcatctaggggtggat gtttactgctcctggagccccagtgggtttgaaagagaagacgtggacgacatcaaggtc cactcaccttacgaaattagcatccgccagcggtttattggaaagccagacatggcctat cgtttgatatctgccaagatgccagaggttatccccattttagaagtgcagttcagcaag aagatggagctgttagaagatgcattggtgctcagaagcgcaagaacctcttcccccgca gagaagagctgctcagtctggcatgcagcatctgggctgatgggccgtaccctgcaccgt gagccacccgcaggagaccaggagggcacagcactgcacctgcagacaagcctgccagcc ctttctgaggcagatacccaggaactaacccagatcctgaggaggatgaaggggctggcc ctggaggccgagagtgagctggagagacaagacgaagccctggatggcgttgcagcagct gtggacagggcaaccttgaccatcgacaagcacaacaggcggatgaagaggctgacctag >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_6|174_aa MAPPPVSAGSRQQGLTMPAATVQTPIKPGRGCTLSSGPHSVSGVSSVPDPSLFSGLHSVG GVSPVPDLSLSASPCNISSGADGDVLMPLLMVTCVQTSHRGGRWKARSALWRDLAAEKHV APVIGPSGCYKFPGMAAAGWQPARKARMGQGGVAAHSNCKPQEKKGCFCSVDLL >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_6|525_bp atggccccccctcctgtcagtgctgggagccggcagcaagggctcaccatgcctgcagcc accgtgcagacccccatcaagcctggaagaggctgcaccctgtcctcaggacctcacagt gtgagtggggtctcctctgtccctgacccctcgctgttctcaggacttcacagtgtgggt ggggtctcccctgtccctgacctctcactgtcagcaagtccttgcaatatcagctcaggg gctgatggagatgtactgatgcctctcctgatggtcacttgtgttcagacgtcccacaga ggtgggaggtggaaggcacgcagcgcactgtggagagaccttgctgctgagaagcacgtg gcacctgtcataggcccctctggctgctacaagttcccgggcatggcggctgcagggtgg cagccagcccggaaagccaggatgggccagggaggtgtggcggcccacagcaactgcaag ccccaggaaaagaagggctgcttctgctctgttgacttgctgtga >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_7|146_aa MPWAATALQELEVRLASSQGAEVGAVRASRERGHRRKVGAVMAQGPPAAPAGLDNGRVCG AMALLQDWGRFAPECSPAQRDLGPGSSGFSGAMRDALGLSRFEAWEASKMPQELRASEWG LAGQLCTWTAGCLPFLAVGSVDPIHF >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_7|441_bp atgccgtgggcggccacggctctgcaggagctggaagtcaggctggcctccagtcaggga gccgaggtgggggctgtgagagcctcgagggaaagaggacacaggaggaaggtgggtgcg gtgatggctcagggaccacctgcagctccagctggcctggataatggcagggtctgtggg gcaatggccctgctacaggactggggccgctttgctcctgaatgctcacctgcccagagg gacctgggtcctgggagctctggcttcagtggagccatgcgtgatgctctaggcctgtcc cgctttgaagcttgggaggcatctaagatgccacaggagcttcgtgcctctgagtggggc cttgctgggcagctgtgcacctggacagcagggtgtcttccgtttctggcagtaggctct gtggatcccattcacttctaa >gi568815597r:227632395_227835195|GENSCAN_predicted_peptide_8|203_aa MGPLGPSALGLLLLLLVVAPPRVAALVHRQPENQGISLTGSVACGRPSMEGKILGGVPAP ERKWPWQVSVHYAGLHVCGGSILNEYWVLSAAHCFHRDKNIKIYDMYVGLVNLRVAGNHT QWYEVNRVILHPTYEMYHPIGGDVALVQLKTRIVFSESVLPVCLATPEVNLTSANCWATG WGLVSKQVFYEKLAINLIEDPCT >gi568815597r:227632395_227835195|GENSCAN_predicted_CDS_8|612_bp atgggcccactcgggccctctgccctgggccttctgctgctgctcctggtggtggcccct ccccgggtcgcagcattggtccacagacagccagagaaccagggaatctccctaactggc agcgtggcctgtggtcggcccagcatggaggggaaaatcctgggcggcgtccctgcgccc gagaggaagtggccgtggcaggtcagcgtgcactacgcaggcctccacgtctgcggcggc tccatcctcaatgagtactgggtgctgtcagctgcgcactgctttcacagggacaagaat atcaaaatctatgacatgtacgtaggcctcgtaaacctcagggtggccggcaaccacacc cagtggtatgaggtgaacagggtgatcctgcaccccacatatgagatgtaccaccccatc ggaggtgacgtggccctggtgcagctgaagacccgcattgtgttttctgagtccgtgctc ccggtttgccttgcaactccagaagtgaaccttaccagtgccaattgctgggctacggga tggggactagtctcaaaacaagttttttatgagaaattggctattaatctgattgaggat ccttgtacatga