GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:41:02 Sequence gi568815597f:110351217_110556348 : 205132 bp : 46.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 14124 14086 39 0 0 90 81 48 0.179 2.72 1.05 Intr - 25945 25734 212 2 2 48 105 182 0.992 14.53 1.04 Intr - 28140 27637 504 1 0 81 110 89 0.687 3.35 1.03 Intr - 29927 29766 162 0 0 90 98 48 0.969 5.95 1.02 Intr - 30579 30436 144 1 0 82 117 17 0.940 4.25 1.01 Init - 31729 31618 112 1 1 59 75 63 0.473 2.47 1.00 Prom - 34598 34559 40 -6.36 2.00 Prom + 38869 38908 40 -3.06 2.01 Init + 45232 45307 76 0 1 98 68 96 0.602 8.03 2.02 Term + 45349 45455 107 2 2 90 37 51 0.535 -1.23 2.03 PlyA + 45935 45940 6 1.05 3.05 PlyA - 46193 46188 6 1.05 3.04 Term - 50367 50307 61 2 1 98 44 63 0.620 0.18 3.03 Intr - 52820 52703 118 0 1 96 92 49 0.672 5.62 3.02 Intr - 55163 55102 62 1 2 45 81 42 0.512 -2.42 3.01 Init - 56650 56370 281 1 2 104 94 71 0.604 5.79 3.00 Prom - 85725 85686 40 -3.86 4.00 Prom + 99550 99589 40 -4.26 4.01 Init + 100001 100072 72 1 0 78 98 37 0.630 3.32 4.02 Intr + 102745 102870 126 0 0 130 73 146 0.991 18.18 4.03 Term + 105016 105135 120 0 0 107 48 123 0.996 8.67 4.04 PlyA + 106120 106125 6 1.05 5.06 PlyA - 106829 106824 6 -0.45 5.05 Term - 107478 107173 306 0 0 66 43 131 0.484 1.52 5.04 Intr - 112507 112345 163 0 1 81 99 2 0.203 0.58 5.03 Intr - 113152 112984 169 2 1 64 69 137 0.489 8.40 5.02 Intr - 115153 115055 99 2 0 54 109 78 0.715 6.58 5.01 Init - 117478 117421 58 1 1 84 95 53 0.984 7.07 5.00 Prom - 125407 125368 40 -1.96 6.00 Prom + 129489 129528 40 -3.96 6.01 Init + 129550 129611 62 0 2 72 109 33 0.984 4.68 6.02 Intr + 131559 131709 151 0 1 92 76 152 0.995 14.56 6.03 Intr + 132698 132815 118 1 1 97 72 144 0.976 13.74 6.04 Intr + 133490 133728 239 0 2 -2 103 151 0.434 4.93 6.05 Intr + 135640 135794 155 0 2 99 12 217 0.905 14.07 6.06 Intr + 137286 137418 133 0 1 42 94 100 0.198 6.65 6.07 Intr + 138158 138355 198 1 0 -85 78 317 0.837 12.95 6.08 Intr + 138956 139054 99 1 0 19 80 189 0.999 11.51 6.09 Term + 139818 139955 138 2 0 107 38 132 0.886 8.06 6.10 PlyA + 140034 140039 6 1.05 7.04 PlyA - 142332 142327 6 -0.45 7.03 Term - 143024 142893 132 2 0 98 35 73 0.137 1.09 7.02 Intr - 150599 150405 195 2 0 78 47 94 0.206 3.91 7.01 Init - 153211 153110 102 1 0 64 50 56 0.085 -0.36 7.00 Prom - 158985 158946 40 -4.16 8.07 PlyA - 160116 160111 6 1.05 8.06 Term - 167572 166036 1537 0 1 99 43 1872 0.068 173.75 8.05 Intr - 179109 178990 120 2 0 76 47 73 0.024 1.61 8.04 Intr - 182240 182180 61 0 1 85 64 67 0.032 1.79 8.03 Intr - 197675 197601 75 0 0 26 95 127 0.405 6.69 8.02 Intr - 198218 198140 79 2 1 45 75 49 0.163 -1.58 8.01 Intr - 200009 199956 54 2 0 97 75 30 0.056 1.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_1|391_aa MTKTFAIFFVVFQEEFEGTSEQIGWIGSIMSSLRFCAGPLVAIICDILGEKTTSILGAFV VTGGYLISSWATSIPFLCVTMGLLPGLGSAFLYQVAAVVTTKYFKKRLALSTAIARSGMG LTFLLAPFTKFLIDLYDWTGALILFGAIALNLVPSSMLLRPIHIKSENNSGIKDKGSSLS AHGPEAHATETHCHETEESTIKDSTTQKAGLPSKNLTVSQNQSEEFYNGPNRNRLLLKSD EESDKVISWSCKQLFDISLFRNPFFYIFTWSFLLSQLAYFIPTFHLVARAKTLGIDIMDA SYLVSVAGILETVSQIISGWVADQNWIKKYHYHKSYLILCGITNLLAPLATTFPLLMTYT ICFAIFAGGYLALILPVLNKEEMKQADQKKK >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_1|1173_bp atgaccaagacttttgcaattttctttgtggtctttcaagaagagtttgaaggcacctca gagcaaattggttggattggatccatcatgtcatctcttcgtttttgtgcaggtcccctg gttgctattatttgtgacatacttggagagaaaactacctccattcttggggctttcgtt gttactggtggatatctgatcagcagctgggccacaagtattccttttctttgtgtgact atgggacttctacccggtttgggttctgctttcttataccaagtggctgctgtggtaact accaaatacttcaaaaaacgattggctctttctacagctattgcccgttctgggatggga ctgacttttcttttggcaccctttacaaaattcctgatagatctgtatgactggacagga gcccttatattatttggagctatcgcattgaatttggtgccttctagtatgctcttaaga cccatccatatcaaaagtgagaacaattctggtattaaagataaaggcagcagtttgtct gcacatggtccagaggcacatgcaacagaaacacactgccatgagacagaagagtctacc atcaaggacagtactacgcagaaggctggactacctagcaaaaatttaacagtctcacaa aatcaaagtgaagagttctacaatgggcctaacaggaacagactgttattaaagagtgat gaagaaagtgataaggttatttcgtggagctgcaaacaactgtttgacatttctctcttt agaaatcctttcttctacatatttacttggtcttttctcctcagtcagttagcatacttc atccctacctttcacctggtagccagagccaaaacactggggattgacatcatggatgcc tcttaccttgtttctgtagcaggtatccttgagacggtcagtcagattatttctggatgg gttgctgatcaaaactggattaagaagtatcattaccacaagtcttacctcatcctctgc ggcatcactaacctgcttgctcctttagccaccacatttccactacttatgacctacacc atctgctttgccatctttgctggtggttacctggcattgatactgcctgtactgaacaaa gaggagatgaaacaagcagaccagaagaaaaag >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_2|60_aa MAGCRFQALPCGEAAKAWREIERSAAAGLGAKPLTARGRQGWRASLSECGARQAHTHPEL >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_2|183_bp atggcgggctgcaggttccaagccctgccctgcggggaggcagctaaggcctggcgagaa atcgagcgcagcgccgctgctggcctgggtgctaagcccctcactgcccggggccggcag ggctggcgggccagcctctccgagtgcggggcccgccaagcccacacccacccggaactc tag >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_3|173_aa MEPGAGHLDGHRAGSPSLRQALCDGSAVMFSSKERGRCTVINFVPLEAPLRSTPRSRQVT EACGGEGRAVPLGSEPEWSVGGMEATLEQHLEDTMKNPSIVGVLCTDSQGLNLGCRGTLS DEHAGVISVLAQQAAKLTSDPTDIPVVCLESDNGNIMIQKHDGITVAVHKMAS >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_3|522_bp atggagccaggtgcaggtcacctcgacggtcaccgcgcggggagcccaagccttcgtcag gctctgtgcgacggaagcgcagtgatgttttccagtaaagaacgcggacgttgcaccgtg atcaattttgtccctttggaggcgccgttacggtccacgccccgctcgcgtcaagtgact gaggcctgtggtggagaaggacgtgccgtgccgctgggttctgagccggagtggtcggtg ggtgggatggaggcgaccttggagcagcacttggaagacacaatgaagaatccctccatt gttggagtcctgtgcacagattcacaaggacttaatctgggttgccgcgggaccctgtca gatgagcatgctggagtgatatctgttctagcccagcaagcagctaagctaacctctgac cccactgatattcctgtggtgtgtctagaatcagataatgggaacattatgatccagaaa cacgatggcatcacggtggcagtgcacaaaatggcctcttga >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_4|105_aa MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC HPGSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_4|318_bp atgagaggtgccacgcgagtctcaatcatgctcctcctagtaactgtgtctgactgtgct gtgatcacaggggcctgtgagcgggatgtccagtgtggggcaggcacctgctgtgccatc agcctgtggcttcgagggctgcggatgtgcaccccgctggggcgggaaggcgaggagtgc caccccggcagccacaaggtccccttcttcaggaaacgcaagcaccacacctgtccttgc ttgcccaacctgctgtgctccaggttcccggacggcaggtaccgctgctccatggacttg aagaacatcaatttttag >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_5|264_aa MRLSDIGKGESPDDHRGQAAAKTLACQTSSPEIQIQWLWDAAQDPGYFTSTPASGELKHI REKIVRSFLDILLVYLTEEIPPGNASVVCWLQHVGAIFIIAIVQMGKLRYTETNMSVTDA QQSLSSRQGAETGKDLLDKRTINHILPVTWTKWSSQDPSTAPQGGSQFRKGSTKVHQAAV NGTSRKAMNVKGQAMVKMESWTWKGTLYPGPEPETKGESDSRAARRSQVTTAGDNVRKRR QKQKQKQTKKGPCGGLAWLKWVLE >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_5|795_bp atgaggctcagtgacattggcaagggtgaaagcccagatgaccacaggggccaggcagct gctaaaactctagcttgccagacttcctctccagaaatccagatccagtggctctgggat gcggcacaggaccctgggtacttcacaagcaccccagcttctggggaattgaagcacatc cgagaaaagattgttcgttccttcctggacatcttattggtttacctcactgaagagatc cctccaggtaatgctagtgtggtgtgctggttacaacacgtaggtgctatttttattatt gccattgtacagatggggaaactgaggtacacagaaacaaatatgtctgtgactgacgcg cagcagagtctatcgagtaggcagggggctgagacaggaaaggatcttttggacaaaagg accataaatcatatccttcctgtgacctggaccaagtggtcttcccaagacccaagcact gcccctcagggagggtcacaatttagaaagggctccacaaaagtccaccaggcagcagtc aatggtacctcgagaaaagctatgaatgtcaaaggacaagccatggtcaaaatggaatcc tggacgtggaagggaactttatatccaggacctgagcctgaaaccaagggagaaagcgac tcaagggctgcaagaaggtcccaagtcaccacagcaggtgacaatgtccgtaaaaggaga caaaaacaaaaacaaaaacaaacaaagaaaggaccctgtgggggcctggcctggcttaag tgggtcctggagtaa >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_6|430_aa MRGLVVFLAVFALSEVNAITRVPLHKGKSLRRALKERRLLEDFLRNHHYAVSRKHSSSGV VASESLTNYLDCQYFGKIYIGTLPQKFTLVFDTGSPDIWVPSVYCNSDACQRIASIEAIF LQGLLRGSALAGTVSGEKILRAPLTLGLCFRKPPTLRSVQVLHPQNMGKSLSIQYGTGSM RGLLGYDTVTDLVEVMILLQVSNIVDPHQTVGLSTQEPGDVFTYSEFDGILGLAYPSLAS EYALRLGFRNDQGSMLTLRAIDLSYYTGSLHWIPMTARILAVHCGQEGPGEGGLDEAILH TFGSVIIDGVVVACDGGCQAILDTGTSLLVGPGGNILNIQQAIGATAGQYNEFDIDCGRL SSIPTAVFEIHGKKYPLPPSAYTSQDQGFCTSGFQGDYSSQQWILGNVFIWEYYSVFDRT NNRVGLAKAV >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_6|1293_bp atgaggggccttgtggtattccttgcagtctttgctctctctgaggtcaatgccatcacc agggttcctctgcacaaagggaagtcgctgaggagggccctgaaggagcgcaggctcctg gaggacttcctgaggaatcaccattatgcagtcagcaggaagcactccagctctggggtg gtggccagcgagtctctgaccaactacctggattgtcagtactttgggaagatctacatc gggacccttccccagaagttcaccttggtgtttgatacaggctccccggatatctgggtg ccctctgtctactgcaacagtgatgcctgtcagagaattgccagtattgaggccatattc ttacaaggcctgctgaggggttcggccctggctggcaccgtgagtggtgagaagattcta agggccccactaactctgggtctgtgtttcagaaaaccaccaacgcttcgatccgtccaa gtcctccacccacagaacatgggcaagtccctgtccatccagtatggcacaggcagcatg cggggcttgctgggctatgacactgtcaccgacctagtggaggtcatgattctcttgcag gtctccaacattgtggacccccaccagactgtgggtctgagcacccaggaacctggcgac gtcttcacctactccgagtttgatgggatcctggggctggcctatccctctcttgcctct gagtacgcgctgcgccttggtttcaggaatgaccaggggagcatgctcacgctgagggcc attgatctgtcgtactacacaggctccctgcactggatacccatgactgcaagaatactg gcagttcactgtggacaggaaggacctggggagggagggctggatgaggccatcttgcat acctttggaagtgtcatcattgacggcgtggtggtggcctgtgacggtggctgtcaggcc atcctggacaccggcacctccctgctggtggggcctggcggcaacatcctcaacatccag caggccattggagccactgcgggccagtacaatgagtttgacatcgactgcgggcgcctg agcagcattcccacggctgtcttcgagatccacggcaagaagtaccccctgccaccctcc gcctataccagccaggaccagggcttctgcaccagtggtttccagggtgactatagttcc cagcagtggatcctggggaatgtcttcatctgggagtattacagtgtctttgacaggacc aataaccgtgtggggctggcgaaggctgtctga >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_7|142_aa MFPGLAAATPQWNPPPPMGIEGTPPQSCPMTVAEQARESTSKTESTGFANQTHCFGVFYL LQASHKIQSTLKTRGLHKGMLTRKQGLLEAILEGYQPQGVGPLHKQTTDVLVAPSQSLLI HHLAQPSEGLTTETNVDPVPEP >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_7|429_bp atgttccctggcttagcagcagcgactcctcagtggaacccaccaccgcccatgggcata gagggcaccccaccacagagctgtcctatgacggtggcagagcaagcaagagagagtaca agcaagacagaatccacgggctttgcaaaccaaacgcattgctttggtgtcttctatttg ttacaagcaagtcacaaaatccagtccacactcaagacgagagggttacacaagggcatg ctgaccaggaagcagggattactggaggccattttggaaggttatcaaccacagggggta ggccctcttcataagcaaaccacggatgtgctggtggccccttcccaaagcctcctcatt caccacctggcacagccctcagaaggcctgaccacagagaccaatgtggaccccgtccca gaaccataa >gi568815597f:110351217_110556348|GENSCAN_predicted_peptide_8|641_aa IPCRGEHISGSPVSHIKQTTGGRVKAKAFKGVQNSSSNVTTIGRGQLVLKALTKAVHEKK TRPGYCQTPAGTDTTVPTSGSSAALASGASRWAEAHRKLFSARQGPVTEETRKAMPDLIS QSRKVSYEQGMDVCGWKEMEVALVNFDNSDEIQEEPGYATDFDSTSPKGRPGGSSFSNGK ILISESTNHETAFSKLPGDYADPPGPEPVVLNEGNQRVIINIAGLRFETQLRTLSQFPET LLGDREKRMQFFDSMRNEYFFDRNRPSFDGILYYYQSGGKIRRPANVPIDIFADEISFYE LGSEAMDQFREDEGFIKDPETLLPTNDIHRQFWLLFEYPESSSAARAVAVVSVLVVVISI TIFCLETLPEFREDRELKVVRDPNLNMSKTVLSQTMFTDPFFMVESTCIVWFTFELVLRF VVCPSKTDFFRNIMNIIDIISIIPYFATLITELVQETEPSAQQNMSLAILRIIRLVRVFR IFKLSRHSKGLQILGQTLKASMRELGLLIFFLFIGVILFSSAVYFAEVDEPESHFSSIPD GFWWAVVTMTTVGYGDMCPTTPGGKIVGTLCAIAGVLTIALPVPVIVSNFNYFYHRETEN EEKQNIPGEIERILNSVGSRMGSTDSLNKTNGGCSTEKSRK >gi568815597f:110351217_110556348|GENSCAN_predicted_CDS_8|1926_bp atcccctgccgaggagagcacatctcaggatcacctgtcagccacatcaagcagacaaca ggaggcagggtaaaggctaaggccttcaagggggtccagaacagcagcagcaatgtaacg actatagggagagggcagcttgtgctgaaggctcttaccaaggctgtccacgagaagaag acccgtccaggctactgtcagaccccagctggaactgacactacagtgcccaccagcggc agctcagctgccctggcatctggagccagcaggtgggcggaggcccatcgtaaactcttc agtgccaggcaggggccagtcactgaggagaccaggaaagcgatgcctgacttgatcagc cagtcacgcaaggtctcctatgagcagggaatggatgtgtgtggctggaaagaaatggag gttgcgctggtcaattttgataattcagatgaaatccaagaagagccaggctatgccaca gacttcgactcaaccagcccaaaaggccggcctgggggcagctccttctccaacgggaag atcctcatcagcgaaagcaccaaccatgagacggccttctccaagcttccgggagactat gctgaccccccagggcctgagccagtggtcctaaatgaaggaaaccagcgggtgatcatc aacattgctgggctgagatttgagacccagctcagaacccttagtcagttcccagagact ctcctgggagaccgggaaaaaaggatgcagttctttgactccatgagaaatgagtatttc tttgatcggaaccggcccagttttgatggaatcctatattattaccaatctggtgggaaa attcggcgcccagccaatgttcccattgatatctttgctgatgaaatctccttctatgag ctgggtagtgaggccatggaccagttccgggaggatgaaggcttcatcaaagaccctgaa acactgctacccaccaatgacatccaccgtcagttctggctcctctttgagtaccctgaa agttccagcgctgcccgtgctgtggccgtggtctcggtgttggttgtggtcatctccatc accatcttctgcctggagacactgccagagttccgggaggatagggagctaaaggtggtc agagaccccaatctcaacatgagcaagacagtcctctcccagaccatgttcaccgaccct ttcttcatggtggagtctacctgcatcgtgtggttcaccttcgagctggtgctccggttc gtggtctgccccagcaagactgacttcttcaggaacatcatgaacatcattgacatcatc tccattatcccctactttgcaactctcatcacagagctagtccaggagacagagccgagt gcccaacagaacatgtccctggccatcctgaggatcatccgcctggtgagggtcttccgc atcttcaagctctcgcgccactccaaggggctgcagatcctcgggcaaacactgaaggcg tccatgcgggagttggggttgctcatcttctttctcttcattggagtcatcctcttctcc agtgcagtctactttgctgaggtggatgagccagagtcccatttctctagcattcctgat ggcttctggtgggcagtggtcaccatgacaactgtaggctatggggacatgtgcccgacc accccaggggggaagattgtgggcactctgtgtgccattgcaggggtcctcaccattgcc ctccctgtgcctgtcattgtctccaacttcaattacttctaccaccgggagactgagaat gaagaaaagcagaacatcccaggagaaattgaaagaatcctcaacagtgtaggctcaaga atgggcagcacagactctcttaataagaccaatggtggctgttccacagagaagtctagg aaatga