GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:20:31 Sequence gi568815597r:167985534_168204850 : 219317 bp : 39.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1962 2075 114 2 0 101 58 48 0.813 2.82 1.02 Intr + 5671 5806 136 0 1 65 110 155 0.999 14.62 1.03 Intr + 7693 7907 215 2 2 124 92 158 0.999 17.31 1.04 Intr + 16949 17042 94 1 1 50 48 155 0.786 6.32 1.05 Intr + 18337 18456 120 2 0 59 76 59 0.561 1.35 1.06 Intr + 19000 19260 261 2 0 32 81 171 0.804 7.34 1.07 Intr + 30248 30418 171 0 0 69 30 133 0.240 4.59 1.08 Intr + 37455 37514 60 1 0 87 69 55 0.029 1.29 1.09 Intr + 52838 52955 118 0 1 46 115 78 0.356 4.90 1.10 Intr + 57492 57607 116 0 2 29 80 74 0.397 -0.13 1.11 Intr + 59052 59138 87 1 0 43 78 100 0.916 3.52 1.12 Term + 59367 59710 344 1 2 55 32 400 0.364 24.79 1.13 PlyA + 59895 59900 6 1.05 2.02 PlyA - 62491 62486 6 1.05 2.01 Sngl - 70889 70368 522 2 0 116 48 561 0.999 48.70 2.00 Prom - 77376 77337 40 -5.95 3.12 PlyA - 78054 78049 6 1.05 3.11 Term - 80239 80121 119 2 2 54 48 133 0.016 3.62 3.10 Intr - 99587 99392 196 2 1 14 8 197 0.001 2.37 3.09 Intr - 100263 100064 200 1 2 127 44 222 0.137 19.85 3.08 Intr - 101167 101018 150 2 0 88 61 98 0.979 6.31 3.07 Intr - 102171 102052 120 1 0 52 58 76 0.490 0.55 3.06 Intr - 104368 104066 303 2 0 33 23 179 0.590 1.54 3.05 Intr - 105135 105031 105 1 0 90 91 99 0.365 9.67 3.04 Intr - 107605 107276 330 2 0 40 19 216 0.174 4.58 3.03 Intr - 110152 110070 83 0 2 136 51 65 0.153 5.96 3.02 Intr - 111699 110872 828 2 0 115 91 911 0.586 83.69 3.01 Init - 113720 113530 191 2 2 90 61 179 0.450 13.93 3.00 Prom - 115514 115475 40 -2.85 4.08 PlyA - 115605 115600 6 1.05 4.07 Term - 116158 115953 206 2 2 35 36 195 0.818 5.55 4.06 Intr - 117707 117624 84 0 0 109 61 37 0.773 1.97 4.05 Intr - 118343 118221 123 0 0 50 47 86 0.549 0.34 4.04 Intr - 118554 118391 164 2 2 74 55 219 0.993 15.90 4.03 Intr - 119294 118944 351 1 0 86 119 415 0.198 37.91 4.02 Intr - 127526 127342 185 2 2 78 43 115 0.005 3.66 4.01 Init - 144728 144573 156 2 0 66 53 161 0.708 10.36 4.00 Prom - 148326 148287 40 -6.65 5.04 PlyA - 148470 148465 6 1.05 5.03 Term - 149614 149480 135 1 0 83 54 84 0.913 1.54 5.02 Intr - 150606 150477 130 2 1 65 106 75 0.969 6.78 5.01 Init - 150863 150724 140 2 2 32 99 122 0.803 5.51 5.00 Prom - 169233 169194 40 -5.95 6.00 Prom + 174314 174353 40 -3.75 6.01 Init + 176314 176723 410 0 2 78 57 680 0.756 59.97 6.02 Intr + 193508 193648 141 2 0 41 53 189 0.007 9.25 6.03 Intr + 198369 198548 180 0 0 51 94 130 0.753 7.96 6.04 Intr + 199246 199345 100 1 1 2 94 15 0.551 -7.31 6.05 Intr + 205836 205967 132 2 0 110 68 117 0.917 11.82 6.06 Term + 214370 214513 144 1 0 62 38 126 0.837 1.93 6.07 PlyA + 214644 214649 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100263 99998 266 1 2 127 55 294 0.861 24.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:167985534_168204850|GENSCAN_predicted_peptide_1|611_aa IMTVPNDPYTFLSCGEDGTVRWFDTRIKTSCTKEDCKDDILINCRRAATSVAICPPIPYY LAVGCSDSSVRIYDRRMLGTRATGNYAGRGTTGMVARFIPSHLNNKSCRVTSLCYSEDGQ EILVSYSSDYIYLFDPKDDTARELKTPSAEERREELRQPPVKRLRLRGDWSDTGPRARPE SERERDGEQSPNVSLMQRMSDMLSRWFEEASEVAQSNRGRGRSRPRGGTSQSDISTLPTV PSSPDLEVSETAMEVDTPAEQFLQPSTSSTMSAQAHSTSSPTESPHSTPLLSSPDSEQRQ SVEASGHHTHHQSEFLRGPEIALLRKRLQQLRLKKAEQQRQQELAAHTQQQPSTSDQSSH EGSSQDPHASDSPSSVVNKQLGSMSLDEQQDNNNEKLSPKPGTGEPVLSLHYSTEGTTTS TIKLNFTDEWSSIASSSRGIGSHCKSEGQEESFVPQSSVQPPEGDSETKAPEESSEDVTK YQEGVSAENPVENHINITQSDKFTAKPLDSNSGERNDLNLDRSCGVPEESASSEKAKEPE TSDQTSTESATNENNTNPEPQFQTEATGPSAHEETSTRDSALQDTDDSDDDPVLIPGARY RAGPGDRLVNF >gi568815597r:167985534_168204850|GENSCAN_predicted_CDS_1|1836_bp attatgactgtacccaatgacccttacacttttctctcttgtggtgaagatggaactgtt aggtggtttgatacacgcatcaaaactagctgcacaaaagaagattgtaaagatgatatt ttaattaactgtcgacgtgctgccacgtctgttgctatttgcccaccaataccatattac cttgctgttggttgttctgacagctcagtacgaatatatgatcggcgaatgctgggcaca agagctacagggaattatgcaggtcgagggactactggaatggttgcccgttttattcct tcccatcttaataataagtcctgcagagtgacatctctgtgttacagtgaagatggtcaa gagattctcgttagttactcttcagattacatatatctttttgacccgaaagatgataca gcacgagaacttaaaactccttctgcggaagagagaagagaagagttgcgacaaccacca gttaagcgtttgagacttcgtggtgattggtcagatactggacccagagcaaggccggag agtgaacgagaacgagatggagagcagagtcccaatgtgtcattgatgcagagaatgtct gatatgttatcaagatggtttgaagaagcaagtgaggttgcacaaagcaatagaggacga ggaagatctcgacccagaggtggaacaagtcaatcagatatttcaactcttcctacggtc ccatcaagtcctgatttggaagtgagtgaaactgcaatggaagtagatactccagctgaa caatttcttcagccttctacatcctctacaatgtcagctcaggctcattcgacatcatct cccacagaaagccctcattctactcctttgctatcttctccagacagtgaacaaaggcag tctgttgaggcatctggacaccacacacatcatcagtctgaatttttaaggggccctgag atagctttgcttcgtaagcgcctgcaacaactgaggcttaagaaggctgagcagcagagg cagcaagagctagctgcacatacccagcaacagccttccacttctgatcagtcttctcat gagggctcttcacaggaccctcatgcttcagattctccttcttctgtggttaacaaacag ctcggatccatgtcacttgacgagcaacaggataacaataatgaaaagctgagccccaaa ccagggacaggtgaaccagttttaagtttgcactacagcacagaaggaacaactacaagc acaataaaactgaactttacagatgaatggagcagtatagcatcaagttctagaggaatt gggagccattgcaaatctgagggtcaggaggaatctttcgtcccacagagctcagtgcaa ccaccagaaggagacagtgaaacaaaagctcctgaagaatcatcagaggatgtgacaaaa tatcaggaaggagtatctgcagaaaacccagttgagaaccatatcaatataacacaatca gataagttcacagccaagccattggattccaactcaggagaaagaaatgacctcaatctt gatcgctcttgtggggttccagaagaatctgcttcatctgaaaaagccaaggaaccagaa acttcagatcagactagcactgagagtgctaccaatgaaaataacaccaatcctgagcct cagttccaaacagaagccactgggccttcagctcatgaagaaacatccaccagggactct gctcttcaggacacagatgacagtgatgatgacccagtcctgatcccaggtgcaaggtat cgagcaggacctggtgataggttggtaaatttttaa >gi568815597r:167985534_168204850|GENSCAN_predicted_peptide_2|173_aa MALRVVRSVRALLCTLRAVPSPAAPCPPRPWQLGVGAVRTLRTGPALLSVRKFTEKHEWV TTENGIGTVGISNFAQEALGDVVYCSLPEVGTKLNKQDEFGALESVKAASELYSPLSGEV TEINEALAENPGLVNKSCYEDGWLIKMTLSNPSELDELMSEEAYEKYIKSIEE >gi568815597r:167985534_168204850|GENSCAN_predicted_CDS_2|522_bp atggcgctgcgagtggtgcggagcgtgcgggccctgctctgcaccctgcgcgcggtcccg tcacccgccgcgccctgcccgccgaggccctggcagctgggggtgggcgccgtccgtacg ctacgcactggacccgctctgctctcggtgcgtaaattcacagagaaacacgaatgggta acaacagaaaatggcattggaacagtgggaatcagcaattttgcacaggaagcgttggga gatgttgtttattgtagtctccctgaagttgggacaaaattgaacaaacaagatgagttt ggtgctttggaaagtgtgaaagctgctagtgaactctattctcctttatcaggagaagta actgaaattaatgaagctcttgcagaaaatccaggacttgtaaacaaatcttgttatgaa gatggttggctgatcaagatgacactgagtaacccttcagaactagatgaacttatgagt gaagaagcatatgagaaatacataaaatctattgaggagtga >gi568815597r:167985534_168204850|GENSCAN_predicted_peptide_3|874_aa MTSDRSESQQATHKGKKTTMGLVRPPGLKRASVPWGSSQKQEQCRAFVLKEEGGKVDGQS PIEGYYAVLYPMVYPMKITGNRAVMALVYIWLHSLIGCLPPLFGWSSVEFDEFKWMCVAA WHREPGYTAFWQIWCALFPFLVMLVCYGFIFRVARVKARKVHCGTVVIVEEDAQRTGRKN SSTSTSSSGSRRNAFQGVVYSANQCKALITILVVLGAFMVTWGPYMVVIASEALWGKSSV SPSLETWATWLSFASAVCHPLIYGLWNKTVRKELLGMCFGDRYYREPFVQRQRTSRLFSI SNRITGNLGIVRIAPPRQMKSVVLSATLACLWVSSQADWRSVISSDGPRQSGGVHAKPQP SIEASWPCGLRMVKHQYKADWWLLAMGAKKRLQEKVQAAPPPALPLADPEPSLQPGLESP STTPLPSQGGWPRQYQGAPPPNAALTRPLKLLGIEPFPLSPAKDTDSRLSRGKHLAPDLG LSPHLTALMAGGQPLGHSSSTGDTGFSCSQDSGSCKPRAAPCLSSRLPGIRVMAPGRWGA ADAVSCAGSTLLEPQEALGRGAWAEAGSHTHGSTLLRKSQGVWPHLLPIPRLAVGALAPG GSAAEAAQSRMPGRTDMMLLEDYTSDDNPPSHCTCPPKRRSSVTFEDEVEQIKGKNELNA PQPPSPRRATLAPSVLLCECGSSKGTLQVVQEQGTARGSSPAMKAAKNSILHVKAEVHKS LDSYAASLAKAIEAEAKINLFGEEALPGVLVTARTVPGGGFGGRRGSRTLMPQEENPPQC TVKDDTALLPNDTRPSWCLYMVDAGSVGPSVLGLVRRESTQEQSASERSSVAPPAALVLE KISYLGQTDVAAGSLPHDYQLPEASNAQQCADPR >gi568815597r:167985534_168204850|GENSCAN_predicted_CDS_3|2625_bp atgacgtcagacaggtccgagtctcagcaggccacccacaaggggaagaagaccaccatg gggcttgttcgccccccagggctcaaaagggccagtgtcccctgggggtcatcgcagaag caggagcagtgtagggctttcgtcctcaaggaggaagggggaaaggtcgatggccagtcc cccattgaaggctactatgctgtcctgtaccccatggtgtaccccatgaagatcacaggg aaccgggctgtgatggcacttgtctacatctggcttcactcgctcatcggctgcctgcca cccctgtttggttggtcatccgtggagtttgacgagttcaaatggatgtgtgtggctgct tggcaccgggagcctggctacacggccttctggcagatctggtgtgccctcttccccttt ctggtcatgctggtgtgctatggcttcatcttccgcgtggccagggtcaaggcacgcaag gtgcactgtggcacagtcgtcatcgtggaggaggatgctcagaggaccgggaggaagaac tccagcacctccacctcctcttcaggcagcaggaggaatgcctttcagggtgtggtctac tcggccaaccagtgcaaagccctcatcaccatcctggtggtcctcggtgccttcatggtc acctggggcccctacatggttgtcatcgcctctgaggccctctgggggaaaagctccgtc tccccgagcctggagacttgggccacatggctgtcctttgccagcgctgtctgccacccc ctgatctatggactctggaacaagacagttcgcaaagaactactgggcatgtgctttggg gaccggtattatcgggaaccatttgtgcaacgacagaggacttccaggctcttcagcatt tccaacaggatcacaggtaacttgggcattgttaggatagcccctccgaggcagatgaaa tctgtggtcctctcagccactctggcctgtttgtgggtgagttctcaggctgactggcga tcagtgatctcctcagatggtcccagacaatccgggggtgtccatgccaaacctcagcct tccatcgaagcatcatggccttgtggtcttaggatggtaaagcaccaatacaaagctgac tggtggctactggcaatgggagccaagaagaggcttcaggagaaagtgcaggcagcccct ccacctgcccttccgctggccgacccagaaccttcactccagccagggctggagagcccg agcaccacccctctccccagccagggagggtggcccaggcagtatcagggggcaccccct ccaaacgccgcccttacgagacccttgaagctcctgggcatagagcctttcccgttatct cctgcgaaggacactgactcacgtttgtccaggggaaaacacctcgctccagacctgggc ctgtccccacacctcactgcgctcatggcaggtggacagcccctggggcacagcagcagc acgggggacactggcttcagctgctcccaggactcaggatcttgcaagcccagggctgct ccttgtctgagcagtcggcttcctggcatcagagtcatggcaccggggaggtggggagct gcagatgctgtatcctgtgctggaagcacgctgttagagccacaggaggccctgggcaga ggcgcatgggcagaggcagggagtcacactcatggctccacattgctgaggaagagccag ggtgtctggccacacctgctgcccatccccaggctggctgttggtgcactggcgcctgga gggagtgccgctgaagctgcacaaagccgcatgcctggcaggacagatatgatgctgctt gaggactacacgtctgatgacaaccctccctctcactgcacttgcccacccaagagaagg agctcggtgacatttgaggatgaagtggaacaaatcaaaggaaaaaatgaactaaatgct ccccaaccccccagcccacgtagggccacacttgccccatcagtcctgctttgtgagtgt ggcagctccaaaggcaccctccaggtggtgcaagagcaaggcactgccagggggtcctcc cctgccatgaaagctgccaagaactcgattcttcatgtgaaagctgaagtacacaagtcc ttggacagttacgcagcaagcttggccaaagccattgaggccgaagccaaaatcaactta tttggggaggaggctttgccaggggtcttggttacagcacggactgtcccggggggcggc ttcgggggccgccgaggcagcagaactcttatgcctcaggaggagaaccctccccagtgt actgtgaaggatgacacagcacttcttcctaatgacacgcgaccgtcctggtgcctctac atggttgatgcgggcagtgtgggaccctcagttctaggactggtccgcagagaaagcacc caggagcagagcgcttcggagcggtcctcagtggcgccacctgctgccctcgtcctagaa aagatatcttacttgggtcaaacggatgtggctgcaggcagtttaccacatgattatcag cttccagaagcatcaaatgctcagcagtgtgccgatcccagatga >gi568815597r:167985534_168204850|GENSCAN_predicted_peptide_4|422_aa MDVVCEKGGSPEEQLLDKGKRYRGGRQLISSVDEKGRNSAPEGILENCSAYRIHPEGFAG ACAFATPDLLAETGILDFTGWAALRSTFLLCLLGNIGPEGLKDCKPTLGSLIRSCRKELS NLTEEEGGEGGVIITQFIAIIVITIFVCLGNLVIVVTLYKKSYLLTLSNKFVFSLTLSNF LLSVLVLPFVVTSSIRREWIFGVVWCNFSALLYLLISSASMLTLGVIAIDRAARTGNTLT AHSPGLKGEQGEGRQPENTGSDTGERPCAFGNSAAPALNVADARGGSRIMLSDLVVQVLW ALAASFPQMSAFIRILLMATAFITGTGDGEHFLLFSNILEEGDFSWLRGIPSCGVSSANP HGNLVSPDFADEQSEVFEALSNLPKVTWLGSNSPSSEMPEPGRFVIVHHQLSAASHSSSQ LA >gi568815597r:167985534_168204850|GENSCAN_predicted_CDS_4|1269_bp atggatgtggtgtgtgaaaagggtgggtctccagaggagcaactgctggacaagggtaag agatatcgaggaggccgacagttaatttcctctgtagatgagaaaggaagaaatagtgcc ccagagggcatcttggaaaactgcagcgcttacaggatacaccctgagggatttgctggg gcttgtgccttcgctactccagacctcctggcagagactgggattttggatttcactggg tgggcagctcttaggtccacattcctcctctgtcttctggggaatattggaccagaaggc cttaaagactgcaagccaactctgggctctctgattcgcagctgcaggaaggagctgagt aatctcactgaggaggagggtggcgaagggggcgtcatcatcacccagttcatcgccatc attgtcatcaccatttttgtctgcctgggaaacctggtcatcgtggtcaccttgtacaag aagtcctacctcctcaccctcagcaacaagttcgtcttcagcctgactctgtccaacttc ctgctgtccgtgttggtgctgccttttgtggtgacgagctccatccgcagggaatggatc tttggtgtagtgtggtgcaacttctctgccctcctctacctgctgatcagctctgccagc atgctaaccctcggggtcattgccatcgaccgtgcagcccgaacagggaacaccctcact gcccacagtccaggactgaaaggagagcagggggaaggaagacagccagagaatacaggc agtgacaccggggagaggccgtgtgcatttggaaactcggctgctcccgccctcaacgta gctgatgcacggggagggtcccgcattatgctgtcagatttagttgtacaagtcttgtgg gctttggcagcgagtttcccacaaatgagtgcatttattagaattttgctaatggccact gcgttcatcacaggcacaggtgatggtgagcattttctcttattttcaaatatccttgaa gaaggtgatttttcatggttacgaggtattccatcctgtggagttagctcagctaaccct catggtaaccttgttagccccgattttgcagatgagcaaagtgaggtttttgaggcctta agtaacttgcccaaggtcacgtggctgggaagtaactctcccagttctgagatgcccgag cctggacgctttgtcattgtacaccatcaactcagtgctgccagtcattccagcagccag ctagcgtag >gi568815597r:167985534_168204850|GENSCAN_predicted_peptide_5|134_aa MGGRRCVPGTLPMRAAPPGAKRLHVPLRAKGVGRSGHAPRLESVRTRSAGGRFGWFLERG ELDQPSAPSGPSLTPVQPLDKGIWDENVCWLHCHNIKNRDPSLRGPLDILELIQGRNDIR TVLFVLQPGALCVL >gi568815597r:167985534_168204850|GENSCAN_predicted_CDS_5|405_bp atgggagggcgtaggtgcgtgcccgggactctgccgatgcgggccgcgccccctggggcc aagagactccacgtcccgctcagggcaaagggtgtggggagaagcggtcacgcccctcgg ctggaatccgtgaggaccagatctgcgggagggagattcggctggttcttggaaagggga gaactcgaccagccctcggcgccttctgggccaagcttaacccccgtgcaacctttggac aaaggcatatgggatgagaacgtctgttggctccactgccataacatcaagaatagagac ccctcattaagaggcccgcttgacattttggaactaatccaaggccgcaacgatatccga accgttctctttgtgttgcagcctggtgcactgtgcgtcctctga >gi568815597r:167985534_168204850|GENSCAN_predicted_peptide_6|368_aa MGLQRPAASMLPYSTSNQPAASTLPYSTSNQPAASTLPYSTSNQPAASTLPYSSSNQPAA STLPYSTSNQPAASTLPYSTSNQPAASTLPYSTSNQPAASTLPYSTSNQPAASTLPYSTS NQPAASTLPCSTSSHHSLFLVASAGPASAMMIHGFQSSHRDFCFGPWKLTASKTHIMKSA DVEKLADELHMPSLPEMMFGDNVLRIQHGSGFGIEFNATDALRCVNNYQGMLKVACAEEW QESRTEGEHSKEVIKPYDWTYTTDYKGTLLGESLKLKVVPTTDHIDTEKLKAREQIKFFE EVLLFEDELHDHGVSSLSVKIHVPPSLFTEPNEISQYLPIKEAVCEKLIFPERIDPNPAD SQKSTQVE >gi568815597r:167985534_168204850|GENSCAN_predicted_CDS_6|1107_bp atgggactgcagaggccagctgcctctatgctgccctattcgacttccaatcagccagct gcctctacactgccctattcgacttccaatcagccagctgcgtctacactgccctattcg acttccaatcagccagctgcctctacactgccctattcgtcttccaatcagccagctgcg tctacactgccctattcgacttccaatcagccagctgcctctacactgccctattcgact tccaatcagccagctgcgtctacactgccctattcgacttccaatcagccagctgcctct acactgccctattcgacttccaatcagccagctgcgtctacactgccctattcgacttcc aatcagccagctgcgtctacactgccctgttcaacttccagtcaccacagcttattcctt gtggcctctgcgggtcctgcctcagccatgatgatccacggcttccagagcagccaccgg gatttctgcttcgggccctggaagctgacggcgtccaagacccacatcatgaagtcggcg gatgtggagaaattagccgatgaattacatatgccatctctccctgaaatgatgtttgga gacaacgttttaagaatccagcatgggtctggctttggaattgagttcaatgctacagat gcgttaagatgtgtaaacaactaccaaggaatgcttaaagtggcctgtgctgaagagtgg caagaaagcaggacggagggtgaacactccaaagaggttattaaaccatatgattggacc tatacaacagattataagggaaccttacttggagaatctcttaagttaaaggttgtacct acaacagatcatatagatacagaaaaattgaaagccagagaacagattaagttttttgaa gaagttctcctttttgaggatgaacttcatgatcatggagtttcaagcctgagtgtgaag attcatgttccaccttccctcttcacggaacctaatgaaatatcccagtatttaccaata aaggaagcagtttgtgagaagctaatatttccagaaagaattgatcctaacccagcagac tcacaaaaaagtacacaagtggaataa