GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:40:19 Sequence gi568815590f:60417208_60720766 : 303559 bp : 40.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 619 614 6 1.05 1.01 Sngl - 5123 3966 1158 2 0 51 55 820 0.737 70.95 1.00 Prom - 8561 8522 40 -6.25 2.00 Prom + 8828 8867 40 -7.65 2.01 Init + 16744 16833 90 0 0 60 85 133 0.994 10.74 2.02 Term + 20158 20319 162 0 0 125 45 59 0.963 2.25 2.03 PlyA + 20346 20351 6 1.05 3.02 PlyA - 20399 20394 6 1.05 3.01 Sngl - 26876 26574 303 2 0 78 54 222 0.807 13.38 3.00 Prom - 55312 55273 40 -4.15 4.00 Prom + 55826 55865 40 -5.65 4.01 Init + 62595 62696 102 2 0 51 43 123 0.257 3.84 4.02 Term + 73605 73856 252 2 0 60 48 198 0.324 7.65 4.03 PlyA + 75141 75146 6 1.05 5.00 Prom + 76459 76498 40 -4.85 5.01 Init + 77164 77326 163 0 1 93 62 127 0.798 10.54 5.02 Term + 77352 77491 140 1 2 -15 54 175 0.777 0.94 5.03 PlyA + 78025 78030 6 1.05 6.00 Prom + 79813 79852 40 -8.45 6.01 Sngl + 80134 80526 393 0 0 59 55 282 0.944 17.99 6.02 PlyA + 80559 80564 6 1.05 7.00 Prom + 81252 81291 40 -2.85 7.01 Init + 82174 82237 64 0 1 58 29 85 0.081 0.96 7.02 Intr + 99854 100046 193 0 1 77 94 255 0.785 22.63 7.03 Term + 100184 100334 151 2 1 96 43 81 0.841 0.80 7.04 PlyA + 100997 101002 6 1.05 8.00 Prom + 104164 104203 40 -6.15 8.01 Init + 129885 130005 121 2 1 68 74 91 0.618 6.10 8.02 Intr + 141645 141716 72 1 0 114 111 16 0.696 4.96 8.03 Intr + 154839 154906 68 1 2 76 127 72 0.860 7.61 8.04 Intr + 167001 167083 83 2 2 100 93 -1 0.532 -0.88 8.05 Intr + 174651 174762 112 0 1 88 84 151 0.989 14.16 8.06 Intr + 201373 201441 69 0 0 93 61 53 0.016 1.56 8.07 Term + 203467 203562 96 0 0 101 47 66 0.033 0.79 8.08 PlyA + 203709 203714 6 1.05 9.03 PlyA - 204499 204494 6 1.05 9.02 Term - 214372 213947 426 1 0 -32 39 269 0.447 4.32 9.01 Init - 215934 215854 81 0 0 69 97 40 0.240 4.03 9.00 Prom - 217054 217015 40 -5.15 10.00 Prom + 218209 218248 40 -3.45 10.01 Init + 225136 225169 34 0 1 68 58 59 0.010 -1.02 10.02 Intr + 228163 228307 145 2 1 37 42 121 0.265 0.82 10.03 Intr + 231305 231436 132 2 0 117 75 152 0.981 15.64 10.04 Intr + 234825 235046 222 0 0 39 14 273 0.726 11.32 10.05 Term + 239641 239770 130 1 1 78 35 200 0.895 10.37 10.06 PlyA + 240428 240433 6 1.05 11.02 PlyA - 241711 241706 6 1.05 11.01 Sngl - 243762 243562 201 0 0 62 48 195 0.153 7.83 11.00 Prom - 243860 243821 40 -4.75 12.00 Prom + 245095 245134 40 -6.85 12.01 Init + 255251 255360 110 1 2 72 75 69 0.515 3.74 12.02 Intr + 256076 256247 172 2 1 87 69 79 0.939 4.92 12.03 Intr + 258851 258894 44 1 2 101 62 55 0.757 0.32 12.04 Intr + 261539 261875 337 2 1 -5 105 366 0.995 23.60 12.05 Term + 279234 279326 93 2 0 58 38 104 0.015 -0.75 12.06 PlyA + 280148 280153 6 1.05 13.04 PlyA - 280361 280356 6 1.05 13.03 Term - 297627 297570 58 2 1 84 51 90 0.094 1.08 13.02 Intr - 302514 302442 73 1 1 106 61 67 0.254 3.35 13.01 Init - 303357 303246 112 0 1 85 70 91 0.660 7.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 36346 36421 76 0 1 83 89 34 0.827 4.30 S.002 Sngl + 296950 297177 228 0 0 82 48 216 0.832 9.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_1|385_aa MDSQVWRNSQPLYTLIKKTQKANTHLVEWTPEAEAAFQALKKSLTQAPVLSLPVGQDFSL YVTEKTGIALGVLTQVLGTSLQPVAYLSKEIDVVAKSWPHCLWVVAAVAVLVSEAIKIIQ GRDLIVWTSHDVNGILTAKGDLWLSDNHLLKYQALLLEGPVLRLHTCATLNPVTFLPDNE EKTERNCQQVIAQTYATRGDLLEVPLTDPYLNLYTDGSSFVEKGLRKAEYAVVSDNGILE SNPLTPGTSAQLAELIALTQALELGEGKRVNIYTDSKYAYLVLHAQAAIWREREFLTSEG TPIKHQEAIRRLLLAVQKPKEVAVLHCQGHEKGKEREIGGNRQADIEAKRAARRDPPLEM LIEGPLVWGNPLQETKSQYSAEEIE >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_1|1158_bp atggattcccaggtatggcgaaatagccagccattatacacactaattaagaaaactcag aaagccaatactcatttagtagaatggacacctgaagcagaagcggctttccaggcccta aagaagagcctaacccaagccccagtgttaagcttgccagtggggcaagacttttcttta tatgtcacagaaaaaacaggaatagctctaggagtccttacacaggtcctagggaccagc ttgcaacctgtggcatatctgagtaaggaaattgatgtagtggcaaagagttggcctcat tgtttatgggtagtggcagcagtagcagtcttagtatctgaagccattaaaataatacag ggaagagatcttatcgtgtggacatctcatgatgtgaacggcatactcactgctaaagga gacttgtggctgtcagacaaccatttacttaaatatcaggctctattacttgaaggacca gtgctgcgactgcacacctgtgcaactcttaacccagtcacatttcttccagacaatgaa gaaaagacagaacgtaactgtcaacaagtaattgctcaaacctacgctactcgaggggac cttctagaggttcccttgactgatccctacctcaacttgtatactgatggaagttccttt gtagaaaaaggacttcgaaaggcagagtatgcagtggtgagtgacaatggaatacttgaa agtaatcccctcactccaggaaccagtgctcagctggcagaactaatagccctcactcag gcactagaattaggagaaggaaaaagggtaaatatatatacagactctaagtatgcttac ctagtcctccatgcccaagcagcaatatggagagaaagggaattcctaacttccgaggga acacctatcaaacatcaggaagccattaggagattactattagctgtacagaaacctaaa gaggtggcagtcttacactgccagggtcatgagaaaggaaaggaaagggaaataggaggg aaccgccaagcagatattgaagccaaaagagccgcaaggcgggaccctccattagaaatg cttatagaaggacccctagtatggggtaaccccctccaggaaaccaagtcccagtattca gcagaagaaatagaatga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_2|83_aa MGDWGLFHPYSFDHKRRLPPEAAILGAYPQEYLIDDIVYFLTNLSKGHIISDCPIIKAVK VDQSDSSLIKCPIDCMHESVSTH >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_2|252_bp atgggagactggggcttatttcatccctacagcttcgaccataaaagacggctgccccct gaagcggccattttaggggcctaccctcaggagtacttaatagatgatattgtgtacttc ctaactaatctcagcaaagggcatataatatctgattgtcccattattaaagcagtaaag gttgatcagtctgattcatcccttataaagtgtccaatcgactgtatgcatgagagtgtt agtacccattga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_3|100_aa MGRNQSRKAENSKNQSASSPPKDLSSSPATIQSWMENDFGELTEVSFRRSVITNFSELKK DVRTHCKEAKNLEKRLDKWQTRINHVEKTLNDLMELKTMA >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_3|303_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaccagagtgcctcttctcct ccaaaggatctcagctcctcgccagcaacgatacaaagctggatggagaatgactttggc gagttgacagaagtaagcttcagaaggtcagtaataacaaacttctccgagctaaagaag gatgttcgaacccattgcaaggaagccaaaaaccttgaaaaaagattagacaaatggcaa actagaataaaccatgtagagaagaccttaaatgacctgatggagctgaaaaccatggca tga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_4|117_aa MGAPSRATQLSSAEVALKGLTAEGYRRQHFWGVGVAMNVGTVGWKQVTPRTIQGRSCHLQ VVSLKDTKPAQMLSTDIVKVLHYVSQKTRETIFRDLGDHQPKGTDWCHKSGYGIERK >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_4|354_bp atgggagcaccgtcccgggcaacacagctttcctcagctgaggttgctctgaagggactg acagcagagggctaccgaagacagcacttctggggtgtgggggttgccatgaatgtgggc acagttggatggaagcaggtcactcctaggacaatacaaggaagatcctgccaccttcaa gttgtgtctctaaaggacacaaagcctgctcagatgctgtcaacagatattgtaaaggtc cttcactatgtgagtcagaagaccagagaaactatctttagagatctgggagatcatcag cctaaaggtactgattggtgtcataagagtggttatggtattgagcgtaaatga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_5|100_aa MELDEWERIGRDFKKVHKEGAKIPFSVWSAWVLIKAALEPFETDDEADSYEEEADSECEG QDPEEIKEKKGKLKKVYFTSQSVPPAELSEWPPPPSPPNG >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_5|303_bp atggagttggatgaatgggaaagaattggcagagattttaaaaaggtgcataaagaagga gccaaaattccattttctgtttggtcagcgtgggtgttgataaaggcagctcttgagcca tttgaaacagatgatgaggcagattcatatgaggaagaggcagattctgaatgtgaggga caggacccggaggaaattaaagaaaagaaagggaagctgaaaaaggtgtattttactagc cagtcggttccacctgctgaattaagtgaatggccacctcctccctctcctcctaatggg tga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_6|130_aa MSYSRKRRLWEEPKLLIRVIPQTNISETKEKTENSRQANSPTWGQIKKLVQMAKDNLKAQ NKLKTTSNLMVAMLAVLTMAASLLTIGATQNFTYWAYVSFPPLIRSVSWMDPVIKVYTNV STWMPAPIDN >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_6|393_bp atgagctacagcaggaagagaagactctgggaagagccaaaactcctgatacgagtgatt ccgcaaacaaacatctcagagacaaaggagaaaaccgagaatagtcgtcaggcaaattct ccaacatggggacaaatcaagaagctggtgcagatggcaaaggacaacttgaaagcacag aacaaactaaaaacaactagtaacctgatggtggccatgctggcagtactcaccatggcg gcaagcctccttactataggagcaactcagaatttcacttattgggcatatgtctcattt cctcctttaattaggtctgtgagttggatggaccctgttattaaggtgtacaccaacgtc agtacctggatgccagcacccatagataactga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_7|135_aa MSGVQGLKPLVAFGTPSSVLKAPHSRRLTAAAAAAGGAWRFEAERHRGWGAEEEQQREEE PCALALSGRGHGVRLSLQVHHNRRHSAGAGDVGTASGRGLARPGLRPPTRPVLQLRAVAA VCDARFRKCVTGGEA >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_7|408_bp atgtccggggtccagggtctaaaacccctcgtggcctttggaacaccaagctctgtgcta aaggcccctcactcccggcggctgacagcagcagcggcggcggcgggcggcgcctggcgt ttcgaggctgagcggcaccggggttggggcgcggaggaggagcagcagcgggaggaggag ccgtgtgccctggcactgagcggccgcggccatggcgtacgcctatctcttcaagtacat cataatcggcgacacagtgctggagctggtgacgtgggcactgcgagcggccgcggcctg gccaggcccggtctgcggcccccgacccggccagtcttgcagctgagggccgtggcagcc gtttgcgacgcccgattcaggaagtgtgtgacaggaggggaggcctag >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_8|206_aa MTLKEHAAFKHLFNKAHLAPPLIHSTLSGYSTCFREHRVGGVGKSCLLLQFTDKRFQPVH DLTIGVEFGARMITIDGKQIKLQIWDTAGQESFRSITRSYYRGAAGALLVYDITRDLESR REVKKEEGEAFAREHGLIFMETSAKTASNVEEAFINTAKEIYEKIQEGVFDINNEANGIK IGPQHAATNATHAGNQGGQQAGGGCC >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_8|621_bp atgactcttaaggagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccattcaaccctgagtggatacagcacatgtttcagagaacacagggttggg ggtgttggtaaatcatgcttattgctacagtttacagacaagaggtttcagccagtgcat gaccttactattggtgtagagttcggtgctcgaatgataactattgatgggaaacagata aaacttcagatatgggatacggcagggcaagaatcctttcgttccatcacaaggtcgtat tacagaggtgcagcaggagctttactagtttacgatattacacgtgatttagaatctaga agagaagtaaaaaaagaagaaggtgaagcttttgcacgagaacatggactcatcttcatg gaaacgtctgctaagactgcttccaatgtagaagaggcatttattaatacagcaaaagaa atttatgaaaaaattcaagaaggagtctttgacattaataatgaggcaaatggcattaaa attggccctcagcatgctgctaccaatgcaacacatgcaggcaatcagggaggacagcag gctgggggcggctgctgttga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_9|168_aa MLPGITSQQTYWQPNHVSVGFAENPDLRKERKRKKERKKRKEKKRKEKKEKEKEKERKKK GKKERKKKKERKKEKERKKERKKERKKERERKKERKKRKKEKRLSGKARRIRNAGHYCTC HVFKNVTVQTSDSASQDDIWPSDQATQWRCACQPREIMEVQGAKCSHA >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_9|507_bp atgcttcctgggatcacctcccagcaaacgtattggcaaccaaatcatgtcagcgttggc tttgcggagaatccagacttgaggaaagaaagaaagagaaagaaagaaagaaagaaaaga aaagaaaagaaaagaaaagaaaagaaagaaaaagagaaagagaaagaaagaaagaaaaaa ggaaagaaagaaagaaagaaaaagaaagaaagaaagaaagaaaaagaaagaaagaaagaa agaaagaaagaaagaaagaaagaaagagaaagaaagaaagaaagaaagaaaagaaagaaa gaaaaacgactaagtgggaaggcaagaagaattaggaatgctggacactactgcacttgc catgttttcaagaatgtcaccgtgcagaccagtgattccgcctcgcaggatgacatatgg ccctcggaccaggcgacacagtggagatgcgcatgccagcctcgtgaaatcatggaggtt caaggggcaaagtgttcacatgcctaa >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_10|220_aa MLARLVSNLTSARLLLWWGACESPEDVFRLWVRRSRYSHEDAAGVQTTLVTRLSVHNPSR ATDERSNGHKLQNQCLKRETTGTVRGSDPLALIAISDVPALSQRGQRHPGYRTGDGGGPS GSYQSPRDARLTAAGEEEPSRPKKSPQAQGRTAAAAACAATRLRLADSGAGTPGPARGEQ EKEQDGNVRKGKVSGKGRDAERINDHVKRAQRSFDGFALE >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_10|663_bp atgttggccaggctggtctcgaacctgacctcagcccggcttctcctgtggtggggtgca tgtgagtcccctgaagatgtctttagactctgggtcagacgatccaggtacagccatgag gatgctgctggcgttcaaaccacacttgtgacaaggctttctgttcacaacccatccaga gctacagatgagcgctctaatggccacaaactacagaaccagtgcctaaagagagagacc acaggaactgtaaggggtagtgacccgttagccttgattgccattagcgatgtgccagct ctcagccaaagggggcagcgacatccgggctaccggactggcgacggcggcggcccctcg ggcagctaccagtccccgcgggatgcccggctcaccgccgccggagaagaggagccctcc aggccaaaaaaatccccccaagcccagggtcgcaccgcggcggccgcggcatgtgccgcc acccgcctccgcctcgcagactccggggccgggacgccggggccagcacgcggagagcaa gaaaaggaacaagatggaaatgtcagaaaagggaaagtaagcggcaaaggcagagatgca gaaaggatcaatgaccacgtgaagcgagctcaaagaagctttgatggctttgccttggaa taa >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_11|66_aa MIGWGREKRRDVWEWARRALRFGLAAEGGKGGRAGSKPELTLWGAGLAPKQRQPALCPAV EIVCGL >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_11|201_bp atgatcggatggggaagggagaagcgcagagatgtttgggaatgggcgagaagagcgttg agatttgggctagcggctgaaggtggcaagggtggaagggctggctctaaaccagagctg actctatggggtgcaggcctggctcccaagcagcggcagccagcactctgtcctgccgtg gagattgtttgtggcctttga >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_12|251_aa METMSGPILLRLRWVTASHGTLELVTEERRAKGAQSRSTCSGLDSLGYETTVVFLGIPPP LLVLMNLPILMNYCAYSRHSARLSSSRKASLISQVLSTFPLSSGYTELRRWRAGAAAAAA AAAAAAAAAAARGLSRGGADALVLGNYRIKLESSEITQRSAAEEAARGPGHPETHQRPVR PRPTPCPWIQPPGRSCRANSGARSAFCFPLGTRQRRSPTDPANYLIDLRRSFFSCDMGAV TVLTSSQAYVD >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_12|756_bp atggagacaatgagtggcccaatattacttagattaaggtgggtaacagcaagtcatgga acattggagctggtaactgaagagagaagagccaaaggggcacagtccagatccacttgc agtggcctggactcactgggctatgaaaccactgttgtcttcttgggaatccctcctccc ctgcttgttctgatgaatctccctattctgatgaattactgtgcctactcaagacacagt gcaaggctctcatcttctaggaaggcttccctgatatcccaggtccttagtacctttcca ctgtcctctggctacacggaactcaggcgctggcgtgctggggccgcggcggcggcggcg gcggcggcggcagcggcggcggcggcggcggcgcgggggttgagtcgtggtggtgcggac gcgctcgtgctcgggaactatcggattaaacttgaatcgagtgaaattacacaaaggagc gccgcggaggaggcggcccggggacccggacaccctgaaactcaccagagacccgttcgc ccccggccaactccgtgcccgtggattcagccccctggccgcagctgccgagccaactcc ggagcccgctctgcgttttgttttcccctcggcactaggcagcggaggagcccgaccgac ccggccaattacttaattgatctaagacgcagtttcttcagctgtgacatgggagcagta acagtacttactagttcacaggcttatgtggattaa >gi568815590f:60417208_60720766|GENSCAN_predicted_peptide_13|80_aa MVQRTGFKETKKAKNTHGMILDFTGNKRKAPIKLAKDTLTGAELHCLPNEEGPPVLELPE GRPGTQNYKWAVGYEGKTGC >gi568815590f:60417208_60720766|GENSCAN_predicted_CDS_13|243_bp atggtgcagaggacaggctttaaggaaacgaagaaggccaaaaacacacatggaatgata cttgacttcactggtaataaaagaaaggcacctatcaaattggcaaaagacacgctcact ggagctgagcttcactgcctgccgaatgaggagggtcccccagtgctggagctccctgag ggcagacctggaacacagaattacaagtgggctgtgggctacgaaggcaaaacaggctgc tga