GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:13:56 Sequence gi568815595f:23817860_24019498 : 201639 bp : 41.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1832 1962 131 1 2 60 31 125 0.475 2.57 1.02 Intr + 4125 4252 128 0 2 82 73 94 0.404 6.70 1.03 Intr + 24225 24294 70 1 1 60 110 30 0.007 -0.28 1.04 Term + 42259 42382 124 1 1 19 46 135 0.017 -0.72 1.05 PlyA + 42912 42917 6 1.05 2.00 Prom + 43712 43751 40 -3.05 2.01 Init + 43760 44034 275 1 2 43 72 193 0.422 9.69 2.02 Intr + 55271 55346 76 2 1 64 115 49 0.147 3.80 2.03 Intr + 60786 60952 167 2 2 26 101 103 0.397 3.24 2.04 Intr + 61753 61839 87 1 0 93 94 48 0.381 4.07 2.05 Intr + 69708 69840 133 0 1 53 116 121 0.748 11.13 2.06 Intr + 76307 76536 230 1 2 8 72 154 0.286 1.54 2.07 Term + 76979 77177 199 2 1 53 41 128 0.681 0.49 2.08 PlyA + 78462 78467 6 1.05 3.04 PlyA - 79957 79952 6 1.05 3.03 Term - 83190 82910 281 1 2 36 37 265 0.767 10.82 3.02 Intr - 85114 85027 88 1 1 47 36 122 0.512 1.52 3.01 Init - 89820 89758 63 0 0 69 81 72 0.610 5.80 3.00 Prom - 96468 96429 40 -7.15 4.00 Prom + 98443 98482 40 -7.45 4.01 Init + 100001 100172 172 1 1 99 102 209 0.995 23.05 4.02 Intr + 100581 100717 137 1 2 91 84 171 0.999 16.37 4.03 Term + 101337 101642 306 2 0 91 34 206 0.998 9.63 4.04 PlyA + 101672 101677 6 1.05 5.03 PlyA - 102018 102013 6 1.05 5.02 Term - 108673 108525 149 2 2 64 43 206 0.978 10.78 5.01 Init - 109695 109629 67 0 1 65 44 26 0.304 -2.81 5.00 Prom - 112698 112659 40 -7.55 6.00 Prom + 115096 115135 40 -6.65 6.01 Init + 119102 119227 126 1 0 63 97 132 0.847 11.81 6.02 Intr + 128346 128623 278 2 2 4 -9 269 0.007 4.19 6.03 Intr + 136739 136944 206 2 2 88 108 95 0.410 9.42 6.04 Intr + 138178 138266 89 2 2 96 98 49 0.973 5.47 6.05 Intr + 141812 141956 145 1 1 82 80 77 0.990 5.23 6.06 Intr + 144118 144746 629 2 2 50 40 468 0.758 29.20 6.07 Intr + 147118 147303 186 0 0 48 92 75 0.650 2.86 6.08 Intr + 149954 150164 211 1 1 113 115 210 0.998 23.76 6.09 Term + 159364 159560 197 2 2 60 48 114 0.804 1.19 6.10 PlyA + 159667 159672 6 1.05 7.04 PlyA - 160571 160566 6 1.05 7.03 Term - 163884 163493 392 1 2 30 37 282 0.872 11.46 7.02 Intr - 164548 164229 320 0 2 85 97 177 0.628 12.98 7.01 Init - 171208 171135 74 1 2 115 99 27 0.910 7.17 7.00 Prom - 177472 177433 40 -6.85 8.05 PlyA - 177894 177889 6 1.05 8.04 Term - 184306 184154 153 1 0 90 38 104 0.549 2.54 8.03 Intr - 199076 199015 62 0 2 89 80 86 0.261 5.43 8.02 Intr - 199499 199355 145 2 1 -49 103 115 0.227 -1.77 8.01 Intr - 201044 200884 161 0 2 74 93 155 0.355 13.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 99532 99311 222 1 0 71 45 314 0.993 21.33 S.002 Init - 99679 99656 24 1 0 67 79 33 0.811 -0.11 S.003 Term - 133959 133738 222 0 0 60 37 157 0.940 3.73 S.004 Init - 134334 134107 228 0 0 88 63 111 0.918 7.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_1|150_aa KIVEIAGEIAVTFYSCPLLAVFLSWHPSSFGNFDLPAITEKLWTKERHHEKSAQDEDSGI WHLEVYTRKSQQRRQKENQESVVLESSESKCGVTFGLTIMGSGISDVRIRLKFRAQPFAA SLLSQLSVLGFWVLSPWLDEDERSDPLRFL >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_1|453_bp aaaatagtggaaattgctggggaaattgctgtcacattttacagctgtccattgctggca gtctttctgagttggcacccaagcagttttggcaattttgaccttcctgcaatcacagag aagctctggacaaaggaaaggcatcatgagaagagtgcccaggatgaagactcaggcatc tggcatttagaagtgtacactaggaaaagccagcaaaggagacagaaggaaaatcaagag agtgtggtattagaaagcagtgaaagtaaatgtggtgtaacttttggtttgactataatg ggcagtggaatctcagatgttaggatcagactgaaattcagagctcagccctttgctgcc agcttgctctcacagctgtctgtccttggtttctgggtgctttcaccttggctagatgaa gatgaacgttctgacccgctcaggtttctgtag >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_2|388_aa MPRETHWRRNTQVDGRREECTGVGAHWDASRPSTGGMTQSLAGAVGEELGHQAARLQGKT ISLLAAPSAESCFHSIKPRTHSPSPHVILFFWLWPSHTWCRFRKISQYKCEKMGVKMAHA NPIVNCSCEGSRLRAPYENLMPDDLLLSPITPTWDCLVAGKQARGLPTDSTLCVCPNIWG GALIPVRGHQGGLGRGGGQGTSAGPKGDNIYEWRSTILGPPGSVYEGGVFFLDITFTPEY PFKPPKTLRVTEYSSQSSGRRRDPKPDFQIVPVKITQDDVTVDNGQQELSKASLRSSSEL IRSWNHYCLPSCLYIELELPVLQPAPQLLLITELTHGQPALAVTLVIILSDFTGHVDAAS SNLVSYSLTSPMVSSSLPQPFTTPVVIS >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_2|1167_bp atgccgagagaaacacattggcgaaggaatacacaggtggatggacgtcgagaggaatgc actggtgtaggagcacactgggatgccagcaggccatcgactggtggaatgacacaaagt ttggctggggcagttggagaagagttgggccaccaagcggccagactccaggggaaaacc atttcccttctggctgccccatctgctgagagctgcttccactcaataaaacctcgcact cattctccaagcccacatgtgatcctattcttctggctgtggccctctcacacctggtgt agattcaggaaaatctcacagtacaaatgtgagaaaatgggagtaaaaatggcacacgca aaccctattgtgaactgctcatgtgagggatctaggttgcgtgctccttatgagaatcta atgcctgatgatctgctactgtcacccatcacccccacatgggactgtctagttgcagga aaacaagctcgggggctccccactgattctacattatgcgtctgtcctaacatctgggga ggtgctctgatacctgtccgtgggcaccagggaggcctgggccgcggcggaggacagggc accagtgctggtcccaaaggcgataacatctatgaatggagatcaaccattctagggcct ccaggatccgtgtatgagggtggtgtattctttctcgatatcacttttacaccagaatat cccttcaagcctccaaagacactgcgcgtgacagaatacagcagtcaaagctcggggagg agaagagaccccaaaccagattttcagatagtgcccgtaaagattacacaagacgatgtg acagtagacaatgggcagcaagaactgtcaaaggcctctctgaggagcagcagtgaactc atcagatcttggaaccactactgtcttccttcctgtctgtacattgagcttgagctccct gtgctccagccggccccccaactgctcctcatcacagagctgactcatggtcaacctgct ctcgccgttactctggtcataattcttagtgatttcactggccacgtagatgctgcttcc agtaacctggtctcttattctttgacttctccaatggtctcatcatccttacctcagcca ttcaccactcccgtggtcatatcctag >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_3|143_aa MVKRGSNGNYTFSATDSICTKILQIPLDGDKELAASVLGAERESRRTSTFRMEDCETMED VYMASVETDRGVKEQLHLYDTRGLQEGVELPKHYFSFADGFVLVYSVNNLESFQRVELLK KEIDKFKDKKEASGYVKNAKCEL >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_3|432_bp atggtgaagagaggatctaatggcaactatacgtttagtgccactgactcgatttgcact aagattctccagattcccctggatggggataaagagctggctgccagtgttttgggagca gaaagggaaagccggaggacttcaacattcagaatggaagattgcgaaacaatggaagat gtatacatggcttcagtagaaacagaccgaggagtaaaagaacagttacatctttatgac accagaggtctacaggaaggcgtggagctgccaaagcattatttttcatttgctgatggc ttcgttcttgtgtacagtgtgaataaccttgaatcctttcaaagagtggagcttctgaag aaagaaatcgataagttcaaagacaaaaaagaggcaagtggatatgttaaaaatgccaaa tgtgaattataa >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_4|204_aa MGAYKYIQELWRKKQSDVMRFLLRVRCWQYRQLSALHRAPRPTRPDKARRLGYKAKQGYV IYRIRVRRGGRKRPVPKGATYGKPVHHGVNQLKFARSLQSVAEERAGRHCGALRVLNSYW VGEDSTYKFFEVILIDPFHKAIRRNPDTQWITKPVHKHREMRGLTSAGRKSRGLGKGHKF HHTIGGSRRAAWRRRNTLQLHRYR >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_4|615_bp atgggtgcatacaagtacatccaggagctatggagaaagaagcagtctgatgtcatgcgc tttcttctgagggtccgctgctggcagtaccgccagctctctgctctccacagggctccc cgccccacccggcctgataaagcgcgccgactgggctacaaggccaagcaaggttacgtt atatataggattcgtgttcgccgtggtggccgaaaacgcccagttcctaagggtgcaact tacggcaagcctgtccatcatggtgttaaccagctaaagtttgctcgaagccttcagtcc gttgcagaggagcgagctggacgccactgtggggctctgagagtcctgaattcttactgg gttggtgaagattccacatacaaattttttgaggttatcctcattgatccattccataaa gctatcagaagaaatcctgacacccagtggatcaccaaaccagtccacaagcacagggag atgcgtgggctgacatctgcaggccgaaagagccgtggccttggaaagggccacaagttc caccacactattggtggctctcgccgggcagcttggagaaggcgcaatactctccagctc caccgttaccgctaa >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_5|71_aa MPIKSNAFISFGGVSCRLPGFQKNDTKKKKRKKRKRKEKERKRKKRKKREEEDEEQEEEE EGGGGRRKKKK >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_5|216_bp atgccaattaaatcaaatgctttcatcagttttggaggagtgagttgccgtttgccagga tttcagaagaatgacacaaaaaaaaagaagaggaagaagaggaagaggaaagagaaggag agaaagaggaagaagagaaaaaagagagaagaggaggacgaggagcaagaggaggaggag gaaggaggaggaggaagaagaaagaagaagaaataa >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_6|688_aa MRCTWRIHQWRRAPDKLKLSEELIVEKGPTWVSKSPLSIGDKRALTPQCTGRRTLFSRGD PKARTPRKPNEPAPGEAGSCIPCRFRGGSAGRGRAAGRRRRLGEGVGLSRRDTLEGALRC FDPERLPADWVAPPLEGSENSFQSSSSSVPSSPNSSNSDTNGNPKNGDLANIEGILKNDR IDCSMKTSKSSAPGMTKSHSGVTKFSGMVLLCKVCGDVASGFHYGVHACEGCKGFFRRSI QQNIQYKKCLKNENCSIMRMNRNRCQQCRFKKCLSVGMSRDAVRFGRIPKREKQRMLIEM QSAMKTMMNSQFSGHLQNDTLVEHHEQTALPAQEQLRPKPQLEQENIKSSSPPSSDFAKE EVIGMVTRAHKDTFMYNQEQQENSAESMQPQRGERIPKNMEQYNLNHDHCGNGLSSHFPC SESQQHLNGQFKGRNIMHYPNGHAICIANGHCMNFSNAYTQRVCDRVPIDGFSQNENKNS YLCNTGGRMHLVCPMSKSPYVDPHKSGHEIWEEFSMSFTPAVKEVVEFAKRIPGFRDLSQ HDQVNLLKAGTFEVLMVRFASLFDAKERTVTFLSGKKYSVDDLHSMGAGDLLNSMFEFSE KLNALQLSDEEMSLFTAVVLVSADRSGIENVNSVEALQETLIRALRTLIMKNHPNEASIF TKLLLKLPDLRSLNNMHSEELLAFKVHP >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_6|2067_bp atgcgctgcacttggagaattcaccagtggaggagagctcctgataaactgaagctgagt gaagagttgattgttgaaaagggacccacctgggtctctaagtctcctctaagtattggc gacaagcgggcgctgacaccgcagtgcaccggacgccgcacgctcttttcgcgaggtgac cccaaggcgcggaccccgcgcaaaccaaacgaaccggcgcctggggaggctggtagctgc ataccttgcagattccgaggaggaagtgcaggacgagggcgtgctgcaggccggaggagg cgcctcggggaaggcgtggggctttcccgaagggatacgctcgaaggagctctgaggtgc ttcgatcccgagcgactccccgcagactgggtagcaccgccccttgagggttctgagaat agtttccagtcctcctcctcttctgttccatcttctccaaatagctctaattctgatacc aatggtaatcccaagaatggtgatctcgccaatattgaaggcatcttgaagaatgatcga atagattgttctatgaaaacaagcaaatcgagtgcacctgggatgacaaaaagtcatagt ggtgtgacaaaatttagtggcatggttctactgtgtaaagtctgtggggatgtggcgtca ggattccactatggagttcatgcttgcgaaggctgtaagggtttctttcggagaagtatt caacaaaacatccagtacaagaagtgcctgaagaatgaaaactgttctataatgagaatg aataggaacagatgtcagcaatgtcgcttcaaaaagtgtctgtctgttggaatgtcaaga gatgctgttcggtttggtcgtattcctaagcgtgaaaaacagaggatgctaattgaaatg caaagtgcaatgaagaccatgatgaacagccagttcagtggtcacttgcaaaatgacaca ttagtagaacatcatgaacagacagccttgccagcccaggaacagctgcgacccaagccc caactggagcaagaaaacatcaaaagctcttctcctccatcttctgattttgcaaaggaa gaagtgattggcatggtgaccagagctcacaaggatacctttatgtataatcaagagcag caagaaaactcagctgagagcatgcagccccagagaggagaacggattcccaagaacatg gagcaatataatttaaatcatgatcattgcggcaatgggcttagcagccattttccctgt agtgagagccagcagcatctcaatggacagttcaaagggaggaatataatgcattaccca aatggtcatgccatttgtattgcaaatggacattgtatgaacttctccaatgcttatact caaagagtatgtgatagagttccgatagatggattttctcagaatgagaacaagaatagt tacctgtgcaacactggaggaagaatgcatctggtttgtccaatgagtaagtctccatat gtggatcctcataaatcaggacatgaaatctgggaagaattttcgatgagcttcactcca gcagtgaaagaagtggtggaatttgcaaagcgtattcctgggttcagagatctctctcag catgaccaggtcaaccttttaaaggctgggacttttgaggttttaatggtacggttcgca tcattatttgatgcaaaggaacgtactgtcacctttttaagtggaaagaaatatagtgtg gatgatttacactcaatgggagcaggggatctgctaaactctatgtttgaatttagtgag aagctaaatgccctccaacttagtgatgaagagatgagtttgtttacagctgttgtcctg gtatctgcagatcgatctggaatagaaaacgtcaactctgtggaggctttgcaggaaact ctcattcgtgcactaaggaccttaataatgaaaaaccatccaaatgaggcctctattttt acaaaactgcttctaaagttgccagatcttcgatctttaaacaacatgcactctgaggag ctcttggcctttaaagttcacccttaa >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_7|261_aa MANLKLPMRSQLAHKIPETLAFMSWLRSACSPCYWHSLRPWSKVGAKSWGHEQQQETDGF LGRRRRVPSEAPPSGYRGPECWQLSRQPCRPEWKLVVPFPGPPMAARGPISMHFLLSEAH KIPRLSQSWGKAPLHLTLHSSVYLILPGCRTRTWDPLNGKAKSCNTNRIETCPLLTTLWV KERKAAASPSGTSHLGTPQAKAVIPSLEPCGSWHLQPSGHHCIPRCQLGKLLMVHLVQLQ PRREPAPGDVYPMAAADVSAQ >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_7|786_bp atggccaatctcaagctaccaatgcgaagtcaactggctcacaaaattcctgaaacattg gctttcatgagctggctcagaagtgcctgttccccctgctattggcactcactccgacct tggagcaaagttggggccaagtcctggggtcatgaacagcagcaagagacagacgggttc ctgggcagaaggaggcgggtccccagtgaggccccaccttcaggctacagagggcctgaa tgctggcaactgagccgccagccctgcagaccagagtggaaacttgtggtgccttttcca ggcccacccatggctgcccgtggaccaatcagcatgcacttcctcctctccgaggcccat aaaatccctaggctgagtcagagctggggaaaagctcctcttcatcttaccctccactca tctgtgtacctcattcttcctggttgcaggacaagaacttgggacccactgaatggcaaa gctaaaagttgtaacacaaataggattgaaacatgccccttgctcaccacgctgtgggtg aaggagagaaaagctgcagccagcccttcagggacgtcacacctgggaacgccccaagcc aaggctgtgattccctctttggagccctgtggttcctggcatcttcagccttccggccac cactgcattcccaggtgccagctgggaaagctgctcatggtgcacctggtccagctgcag cctcgcagagagcctgcacctggagatgtctatcccatggcagcagctgatgtgtctgca cagtag >gi568815595f:23817860_24019498|GENSCAN_predicted_peptide_8|173_aa XVGSRLVSLSTACRANRVEQDQQARAKLKQRRRQPQRFLAGEQHPEDFVIAGVKRAPLES TPMKEREERIRQKGIQAFDVGPAASVNPMVSSGAKWPIRIMPGKECSNLRESVRGKLQIL EIQTWAGSYPVAFSDSQAFGFRLNYAAGFPGSPAFRQHITGLLGLQNCVTQFP >gi568815595f:23817860_24019498|GENSCAN_predicted_CDS_8|522_bp ngcgtgggatccaggctggtatctctgagcacagcctgcagggccaatcgagtggaacaa gaccagcaggcgcgagcaaaactcaagcagaggcgccgccagccacagaggtttctggct ggtgaacaacacccagaggattttgtgatagcaggagtaaagagagcacccttggaatcc acacctatgaaagagagagaggaaaggatcaggcagaagggaatccaagcctttgatgta ggtccagcagcctcagttaacccaatggtgagctctggagccaaatggcccatccgaatt atgccagggaaagaatgttctaacctccgggaaagtgtacgagggaagctccagattctg gaaattcagacttgggctgggagttacccagttgccttctctgattctcaggcctttgga ttcagactgaactatgctgctggctttcctggttctccagctttcaggcagcatatcacg ggacttcttggcctccaaaactgtgtgacccaattcccataa