GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:13:49 Sequence gi568815594r:81332297_81559508 : 227212 bp : 39.38% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 68 63 6 1.05 1.03 Term - 13534 13179 356 2 2 130 44 242 0.044 17.67 1.02 Intr - 26960 26836 125 1 2 90 59 94 0.044 6.01 1.01 Init - 42120 41969 152 0 2 90 38 118 0.056 6.56 1.00 Prom - 51703 51664 40 -6.85 2.00 Prom + 55832 55871 40 -3.65 2.01 Init + 60189 60223 35 2 2 50 94 37 0.113 -0.10 2.02 Intr + 60555 60748 194 0 2 82 53 95 0.093 3.61 2.03 Intr + 85526 85718 193 0 1 84 12 156 0.478 5.23 2.04 Intr + 91924 92024 101 1 2 64 72 92 0.409 4.03 2.05 Term + 93426 93607 182 1 2 107 38 101 0.346 3.79 2.06 PlyA + 93957 93962 6 1.05 3.06 PlyA - 94462 94457 6 1.05 3.05 Term - 95428 95238 191 2 2 107 54 57 0.323 0.83 3.04 Intr - 97597 97280 318 2 0 73 40 199 0.147 8.61 3.03 Intr - 101667 101544 124 0 1 86 91 108 0.918 10.14 3.02 Intr - 102438 102343 96 0 0 90 105 11 0.825 2.29 3.01 Init - 106580 106509 72 2 0 46 99 39 0.417 1.92 3.00 Prom - 111590 111551 40 -4.45 4.10 PlyA - 112164 112159 6 1.05 4.09 Term - 113332 113180 153 1 0 74 38 157 0.986 6.24 4.08 Intr - 113542 113447 96 1 0 88 77 66 0.964 4.79 4.07 Intr - 115282 115208 75 1 0 110 84 103 0.999 10.89 4.06 Intr - 115985 115773 213 2 0 67 95 241 0.641 20.69 4.05 Intr - 124492 124304 189 1 0 96 35 186 0.839 12.96 4.04 Intr - 125325 125203 123 0 0 107 90 57 0.964 7.66 4.03 Intr - 127218 127036 183 0 0 125 45 149 0.858 13.36 4.02 Intr - 140994 140821 174 0 0 76 58 92 0.613 4.21 4.01 Init - 141714 141598 117 0 0 77 27 92 0.295 2.26 4.00 Prom - 155301 155262 40 -3.65 5.00 Prom + 155672 155711 40 -7.05 5.01 Init + 155727 155851 125 2 2 82 91 101 0.830 9.49 5.02 Intr + 167587 167639 53 1 2 94 101 -11 0.119 -1.57 5.03 Intr + 169016 169132 117 0 0 57 86 88 0.530 5.02 5.04 Intr + 173425 173540 116 2 2 49 89 38 0.172 -0.75 5.05 Intr + 180817 180868 52 0 1 110 84 47 0.212 4.06 5.06 Term + 186973 187049 77 2 2 103 54 67 0.626 1.82 5.07 PlyA + 187471 187476 6 1.05 6.00 Prom + 189899 189938 40 -5.15 6.01 Init + 195120 195358 239 2 2 55 89 105 0.709 4.89 6.02 Intr + 195398 195486 89 2 2 81 67 33 0.329 -0.80 6.03 Term + 195892 196187 296 2 2 89 54 99 0.284 1.08 6.04 PlyA + 196479 196484 6 1.05 7.00 Prom + 202302 202341 40 -6.15 7.01 Sngl + 202395 204503 2109 2 0 50 39 684 0.622 53.68 7.02 PlyA + 204764 204769 6 1.05 8.04 PlyA - 205561 205556 6 1.05 8.03 Term - 211905 211797 109 2 1 34 43 141 0.257 1.20 8.02 Intr - 219383 219152 232 0 1 69 71 122 0.158 4.51 8.01 Init - 223417 223396 22 1 1 84 92 17 0.299 1.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_1|210_aa MTSPNELNKAQGINTGETEICELSGREFETPALRKLKEIQDNREGIQNSIKCSTGTMAQI QQSSMGAYVVHCQALPPCLIFLILFNSTQIATDTAVALLTGGECVDMRRKHFLCFSCQLH PPLKLTLCHLDAKGLDPATPPFAERQCLGNRGQKNYGAACTGLREGAKPLQLILAVAIRG AYPCPAVTLWSGIKGHSLYELKFISPLTGA >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_1|633_bp atgacctcaccaaatgaactaaataaggcacaagggatcaatactggagaaacagagata tgtgagctttcaggcagagaattcgaaacacctgctttgaggaaactcaaggaaattcaa gataacagagaaggaattcagaattctatcaagtgcagtactggtaccatggctcaaatc cagcagtcttccatgggagcctatgttgttcactgtcaagccttgccaccttgccttatc ttcctgattctcttcaactcaactcagatagccacagacacagcagtagctcttctcact gggggtgagtgtgtggacatgcggagaaagcattttttgtgcttttcatgccagctccac cccccactgaaattgaccctatgccacttggatgccaaaggactggacccagctactcct ccctttgcagagcggcaatgtcttggcaacagaggacagaaaaattatggagctgcctgc actggactgcgggaaggggcaaagcctcttcaactcattttggcggtagccatcagaggg gcatatccgtgtcctgcagtcacactgtggtcaggaatcaaaggtcacagtctttatgaa ttgaagttcataagtcctctgacaggagcatga >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_2|234_aa MAKTAITFAPTYWNNFPHLPTFPKSTHPSKPVSGPLYPKSLCKPSPSHRRGTEETSRTTS SGLEVSGSFTHWLWLGADRKDIQTLISERSYPIPERKKPCTERREEPKQTGFAGFPASLL DPSLSSNHISTPLSLLHRTEAWHVDSSWKEKVQKEVKSMLEKTVNEEMTEVPKPETSVKA YAFFLLLLLPPTDTGASPVALHGMACALSLGKLNNIKLSVALASVCHHSATPIN >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_2|705_bp atggcaaaaacagcaattacgtttgcaccaacctactggaataactttccccaccttcct acttttccaaaatccacccatccatcaaaacctgtgtcaggtcctctttatccaaagagc ctgtgcaaaccttctccatctcaccgcagagggacagaggagacatccaggactacaagc tctggactagaggtcagcggttcttttactcactggctgtggcttggagctgaccgtaaa gacattcagaccctcatttcagagaggtcctaccccatacctgagaggaagaaaccttgc acagagaggcgagaagaacctaaacagacaggctttgctggcttcccggccagtctatta gatccttccctttcatccaatcacatttctacaccactctccctgcttcatcgaactgaa gcgtggcatgtggacagcagttggaaagagaaagtacagaaagaagttaaaagtatgcta gaaaaaacagtaaatgaagaaatgacagaggtgccaaagccagaaacctcagtcaaagcc tatgctttcttcttgctcctgcttctgcccccgaccgacaccggtgcttcccctgtggcc ctgcatggcatggcatgtgccctttctctggggaaattgaataatataaagctttcagtg gcactggcctctgtgtgtcatcactcagctacccccataaattaa >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_3|266_aa MKLEAIILSKLTQEQKTKHCMFSLIVIPFFSLLIKDIYFLNEGCANRLPNGHVNFEKFWE LAKQVSEFMTWKQVECPFERDRKILQYLLTVPVFSEDVSGLPVCWRMSVCPSASVLPSVS SRRPAAPQTAPAKLRLIPPVCPACWHLSVCPAGVPFYQCAPLNVLSTTRHLRLLQQMCSS PRPATCVSVCRRSRVFIGPRWGRDHGGAGLCFLASRTTNTHEDPAVIGAPGSAKHTSSLK TRWRTYFLQLTDRLRFCETEMFTEDT >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_3|801_bp atgaagctagaagccatcatcctcagcaaactaacacaggaacagaaaaccaaacactgc atgttctcactcattgtgataccattcttcagtctcttaatcaaagatatttatttcctc aatgagggttgtgccaaccgccttcccaatggccatgtcaattttgagaaattttgggaa ctggccaaacaagtgagtgaatttatgacatggaaacaagtggagtgtccatttgagagg gaccggaagatcttgcagtatctgctcacagtaccagtcttcagtgaagatgtaagtggc ctgccagtgtgctggcgtatgtcggtgtgcccatctgccagcgtgctcccctcagtgtcc tctcgacgaccagccgctcctcagactgcccctgccaaactccgcctcatcccgccagtc tgtcctgcgtgctggcatctttcggtgtgccctgccggcgtgcccttctaccagtgtgct cccctcaatgtcctctcgacgactcgccacttgcgtcttcttcagcagatgtgttcctca ccacgtccagctacttgtgtgtctgtctgccggcggtctcgggtttttataggcccaaga tgggggcgtgatcatggaggggctggcctttgttttctggcatctcgtaccacgaacact catgaagaccctgcagtcattggagcacccgggtcagcaaagcacacaagctcactcaag accagatggagaacttatttcctgcagctgacagatagactcagattttgtgagactgaa atgttcactgaagacacttga >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_4|440_aa MVDAPPPTKLECPGLISDCCCAGSENFKPVDLSLLGSIGNRDAKTELEKSTNQYNHNSRA GPPSLCVSLCMAGPPSSLGLMASHLNCREAMVKSKAQESMPQTPPFSAMFDSSGYNRNLY QSAEDSCGGLYYHDNNLLSGSLEALIQHLVPNVDYYPDRTYIFTFLLSSRLFMHPYELMA KVCHLCVEHQRLSDPDSDKNQMRKIAPKILQLLTEWTETFPYDFRDERMMRNLKDLAHRI ASGEEVGNLNLARLLEFPGRAWTYRKNVQQMMQCLIRKLAALSQYEEVLAKISSTSTDRL TVLKTKPQSIQRDIITVCNDPYTLAQQLTHIELERLNYIGPEEFVQAFVQKDPLDNDKSC YSERKKTRNLEAYVEWFNRLSYLVATEICMPVKKKHRARMIEYFIDVARECFNIGNFNSL MAIICEYFVEDTCRLSLRIL >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_4|1323_bp atggtggatgcccctcctcccaccaagctcgagtgtcctgggttgatctcagactgctgc tgtgctggcagtgagaatttcaagccagtggatcttagtttgctgggctccattgggaat cgggatgcaaaaactgagcttgaaaaatcaaccaaccagtacaaccacaacagcagggca gggccacccagcctctgtgtttcactctgcatggctggcccaccaagttcccttggcctt atggcttctcatttaaactgcagagaagctatggttaagtcaaaggctcaagaaagtatg cctcagactcctcccttttcagcaatgtttgacagcagtggttacaatcgaaacctctat cagtctgcagaggacagctgtggagggttgtattaccatgacaacaacctcctctctgga tccctggaagcactcatccagcacttagtacctaatgtggattactatccagatagaaca tacatatttaccttcctactcagttctcggttatttatgcatccgtatgagctaatggcc aaagtttgccacttatgtgttgagcaccagagactaagtgatcctgatagtgataagaac cagatgagaaaaattgcacccaaaatccttcaactcctcacggaatggacggaaacattt ccctatgattttcgggatgaaagaatgatgagaaacttaaaagatctggctcaccgaata gccagtggcgaagaggttggtaacctgaatttagcgcggctgctggaattcccaggcaga gcctggacatacagaaagaatgtccagcaaatgatgcagtgtctgatccgcaagcttgct gcgctcagccagtacgaagaagtcctggcaaaaatcagctccacatccacagatcggctc acagttctcaagaccaagccacagtctatacaaagggatatcattactgtctgcaacgac ccttacacgttggcccagcagctgactcatatagagctggagaggctcaattatattggg ccagaagaatttgttcaggcgttcgtgcagaaggaccctttggataatgacaagagttgc tacagtgaacggaagaaaacacgaaacttagaagcttacgtggaatggtttaatcgcctc agctacttggttgctacagaaatctgtatgcctgttaagaaaaaacaccgagcaagaatg attgagtatttcattgacgtagctcgggagtgttttaacattggcaacttcaactccttg atggcgataatctgtgagtattttgtggaggatacatgtcgtttatctttacgaatcctg taa >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_5|179_aa MGKVFMTKTPKAMATKAKIDEWDLIKLKSFCTAKEIIIRMNSMTSSQTLPDRLLQSPCEY CLHFLVDWFEPGVIISALQMKKRKKLKKDKKLSPNRTGRVMKKSLANQVFSSIKVPKPTK QPNKKPGTIREESKAAMPPNDVCQLEELHVYDLALPPNDQLYLESGEPGSRAEKTGGCI >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_5|540_bp atgggcaaagtcttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgat gaatgggatctaattaaactaaagagcttctgcacagcaaaagaaattatcatcagaatg aacagcatgacatcctctcaaacgctacctgacagactgctgcaatcaccctgtgaatat tgtcttcatttccttgttgattggtttgaacctggggtcattatctctgctttgcaaatg aagaagagaaagaagctcaaaaaggacaagaaactttccccaaatcgaacaggtagggtt atgaaaaaatctttggctaaccaagtattctctagcataaaagtaccaaaaccaaccaag caaccaaacaaaaagccgggtacaatcagagaagaaagcaaggctgccatgcctccgaat gatgtctgtcagctagaggaattacatgtgtatgacttagcattgcctcccaatgaccaa ctctacctagaatccggagagcctggaagcagggcagagaagactggaggatgcatctga >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_6|207_aa MPAGVLAPCEAVAGPVIWQASSMAGTGECSVLQKLGGARKHRASKKVPQLWLRELLDLGS LKGHSNSLLLSSLLSSPATCSFSPTILQFLSSCSVSRKNEEHRQLECGQVPIASRLFVCR GACSSVQNHPLSFPPVLISTQCPEGTKAAGCWHVSAAPSVYAPNQIVTTPRLGHNFALKS ELVPGAGRGQTAGAGTSEPTVEGGPGP >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_6|624_bp atgccggcaggggtgttggctccctgtgaggctgtggctggaccagtcatatggcaagca tcttccatggctggcactggggaatgcagtgtcctccagaagcttggaggtgccaggaaa cacagagcctcaaagaaggtgccacagctctggctcagggagcttttagatctgggctct ctgaagggccacagcaattctctccttctctcttctctcctctcatcacctgcaacatgc tctttcagccccaccattctgcagttcctgagttcttgttctgtgtccaggaagaatgag gaacacagacaactggagtgtgggcaagtgcccatagcatccaggctgtttgtgtgcaga ggtgcctgcagttccgttcaaaaccatcccctcagctttcctcctgtactcatcagcacc caatgtccggaggggaccaaggcagcagggtgctggcatgtcagtgctgccccaagtgtg tatgcacccaaccagattgtgacaacacctaggcttggccacaactttgctctgaaatca gagctggtgccaggagcagggagaggccagacagcaggagcaggcacttctgagcctaca gtggaaggggggcctggcccttga >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_7|702_aa MGDLNTPLSTLHRSTRQKVNKDTQELNSALHQADLTDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKYQRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETSKNKDTTYQNLWDTFKAVCRGKFIALNTHKRKQERSKVDTLTSQLK ELEKQEQTHSKASRRQEITKIREELKEIETQKTLQKINKSRSWFFERINKIDRPLARLIK KKRQKNQIDTIKNDKGNITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVDFLNRPITGSEIVAIINSLPTKKSPGPEGFTAEFYQRYKEELVPFLLKLLQSIEKE GILPNSFYEASITLIPKLGRDTTKKENFRPISLMNIDAKILNEILANQIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHVNRTKNKNHMIISIDAEKAFDKIQQAFMLKTLNTLGI DGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVWEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPTVSAQNLLKLISNFNKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIHLTRDEKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINVMKMAILPKVIYRFNAIPIKLPMPFFTDWKKLL >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_7|2109_bp atgggagacttgaacaccccactgtcaacattacacagatcaacgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaacagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatatcaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccagcaagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aacacccacaagagaaagcaggaaagatccaaagttgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacactcaaaagctagcagaaggcaagaaataactaaa atcagagaagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaataaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagacagaagaatcagatagacacaataaaaaatgataaagggaatatcaccacc aatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaac caggaagaagttgactttctgaatagaccaataacaggctctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagaaggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactactccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcaccttgataccaaagctgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaatgaaatactagcaaaccaaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatgtaaacagaaccaaaaacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaagccttcatgctaaaaactctcaatacattaggtatt gatgggacatatctcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatcgtgtgggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcagttaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtatatctagaaaaccccactgtctcagcccaaaatctccttaagctgata agcaatttcaacaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaccttacaagggacgagaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatgtcatgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaagctaccaatgcctttcttcacagattggaaaaaa ctactttaa >gi568815594r:81332297_81559508|GENSCAN_predicted_peptide_8|120_aa MKPSDLEVNQPSCICLLTKLYYDLCRVENDNLGLGQIHCLSVDVAGKVWLTLRTDPPVVF VLSEEETPGEKMLMITIADIIDVYGDDADLIVVDKLFDMLLDLVCQYFTEDFRIDVHQGY >gi568815594r:81332297_81559508|GENSCAN_predicted_CDS_8|363_bp atgaagccaagtgatcttgaagttaaccagccctcatgtatatgcctgcttacaaaatta tattatgacttatgcagagtggagaatgacaacctgggcctgggtcagattcactgctta agtgtggacgtggctggcaaagtctggttaactctcaggacagacccaccagttgtcttt gtgttgagtgaggaagagactcctggtgaaaaaatgctaatgataactattgctgacatt atagacgtatatggggatgatgccgacttgatcgtggtggataagctttttgatatgcta ctggatttggtttgccagtattttactgaggattttcgcatcgatgttcatcagggatat tga