GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:29:56 Sequence gi568815595r:23793098_24010904 : 217807 bp : 41.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7961 7976 16 1 1 83 94 6 0.553 1.12 1.02 Intr + 12906 12991 86 1 2 100 80 60 0.845 5.02 1.03 Intr + 14140 14324 185 0 2 116 69 252 0.947 23.76 1.04 Intr + 18363 18413 51 0 0 86 109 55 0.185 4.50 1.05 Intr + 22397 22595 199 2 1 106 31 36 0.109 -1.87 1.06 Intr + 26594 26724 131 1 2 60 31 125 0.393 2.57 1.07 Intr + 28887 29014 128 0 2 82 73 94 0.359 6.70 1.08 Intr + 48987 49056 70 1 1 60 110 30 0.006 -0.28 1.09 Term + 67021 67144 124 1 1 19 46 135 0.017 -0.72 1.10 PlyA + 67674 67679 6 1.05 2.00 Prom + 68474 68513 40 -3.05 2.01 Init + 68522 68796 275 1 2 43 72 193 0.422 9.69 2.02 Intr + 80033 80108 76 2 1 64 115 49 0.147 3.80 2.03 Intr + 85548 85714 167 2 2 26 101 103 0.397 3.24 2.04 Intr + 86515 86601 87 1 0 93 94 48 0.381 4.07 2.05 Intr + 94470 94602 133 0 1 53 116 121 0.748 11.13 2.06 Intr + 101069 101298 230 1 2 8 72 154 0.286 1.54 2.07 Term + 101741 101939 199 2 1 53 41 128 0.681 0.49 2.08 PlyA + 103224 103229 6 1.05 3.04 PlyA - 104719 104714 6 1.05 3.03 Term - 107952 107672 281 1 2 36 37 265 0.767 10.82 3.02 Intr - 109876 109789 88 1 1 47 36 122 0.512 1.52 3.01 Init - 114582 114520 63 0 0 69 81 72 0.610 5.80 3.00 Prom - 121230 121191 40 -7.15 4.00 Prom + 123205 123244 40 -7.45 4.01 Init + 124763 124934 172 1 1 99 102 209 0.995 23.05 4.02 Intr + 125343 125479 137 1 2 91 84 171 0.999 16.37 4.03 Term + 126099 126404 306 2 0 91 34 206 0.998 9.63 4.04 PlyA + 126434 126439 6 1.05 5.03 PlyA - 126780 126775 6 1.05 5.02 Term - 133435 133287 149 2 2 64 43 206 0.978 10.78 5.01 Init - 134457 134391 67 0 1 65 44 26 0.304 -2.81 5.00 Prom - 137460 137421 40 -7.55 6.00 Prom + 139858 139897 40 -6.65 6.01 Init + 143864 143989 126 1 0 63 97 132 0.847 11.81 6.02 Intr + 153108 153385 278 2 2 4 -9 269 0.007 4.19 6.03 Intr + 161501 161706 206 2 2 88 108 95 0.410 9.42 6.04 Intr + 162940 163028 89 2 2 96 98 49 0.973 5.47 6.05 Intr + 166574 166718 145 1 1 82 80 77 0.990 5.23 6.06 Intr + 168880 169508 629 2 2 50 40 468 0.758 29.20 6.07 Intr + 171880 172065 186 0 0 48 92 75 0.650 2.86 6.08 Intr + 174716 174926 211 1 1 113 115 210 0.998 23.76 6.09 Term + 184126 184322 197 2 2 60 48 114 0.804 1.19 6.10 PlyA + 184429 184434 6 1.05 7.04 PlyA - 185333 185328 6 1.05 7.03 Term - 188646 188255 392 1 2 30 37 282 0.872 11.46 7.02 Intr - 189310 188991 320 0 2 85 97 177 0.628 12.98 7.01 Init - 195970 195897 74 1 2 115 99 27 0.910 7.17 7.00 Prom - 202234 202195 40 -6.85 8.04 PlyA - 202656 202651 6 1.05 8.03 Term - 210021 209816 206 1 2 29 38 137 0.087 -0.65 8.02 Intr - 211354 211235 120 2 0 128 91 5 0.104 4.35 8.01 Init - 216311 216227 85 2 1 24 68 126 0.525 5.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 124294 124073 222 1 0 71 45 314 0.993 21.33 S.002 Init - 124441 124418 24 1 0 67 79 33 0.811 -0.11 S.003 Term - 158721 158500 222 0 0 60 37 157 0.940 3.73 S.004 Init - 159096 158869 228 0 0 88 63 111 0.918 7.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_1|329_aa MTQIEARERHEWPGPAAAAAAAAAAARRSQTQREGLFAGWGGGFAMSDDDSRASTSSSSS SSSNQQTEKETNTPKKKESKVSMSKNSKLLSTSAKRIQKELADITLDPPPNCRCSKLALC ILPRSFCEFTLCFSGIPIPPRIGGNGNEMSVQKSQAAHHGSQWKEILFLQQWSVTYLVQK IVEIAGEIAVTFYSCPLLAVFLSWHPSSFGNFDLPAITEKLWTKERHHEKSAQDEDSGIW HLEVYTRKSQQRRQKENQESVVLESSESKCGVTFGLTIMGSGISDVRIRLKFRAQPFAAS LLSQLSVLGFWVLSPWLDEDERSDPLRFL >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_1|990_bp atgacccaaattgaagcgagagagagacacgagtggccaggcccagccgcagccgcagca gcagccgccgcggcggcacggaggagccagacacaaagagaggggctgtttgcggggtgg ggtggggggttcgctatgtcggatgacgattcgagggccagcaccagctcctcctcatct tcgtcttccaaccagcaaaccgagaaagaaacaaacacccccaagaagaaggagagtaaa gtcagcatgagcaaaaactccaaactcctctccaccagcgccaagagaattcagaaggag ctggcggacatcactttagaccctccacctaattgcagatgctccaagttagcactttgt atacttccacgtagtttctgtgaatttacattgtgtttttctggcattccaatacctcct cgaattggtggtaatggtaatgagatgtcagttcagaagtcccaggctgctcatcatggt tcacaatggaaagagatccttttcttgcaacagtggagtgttacatacctcgtccagaaa atagtggaaattgctggggaaattgctgtcacattttacagctgtccattgctggcagtc tttctgagttggcacccaagcagttttggcaattttgaccttcctgcaatcacagagaag ctctggacaaaggaaaggcatcatgagaagagtgcccaggatgaagactcaggcatctgg catttagaagtgtacactaggaaaagccagcaaaggagacagaaggaaaatcaagagagt gtggtattagaaagcagtgaaagtaaatgtggtgtaacttttggtttgactataatgggc agtggaatctcagatgttaggatcagactgaaattcagagctcagccctttgctgccagc ttgctctcacagctgtctgtccttggtttctgggtgctttcaccttggctagatgaagat gaacgttctgacccgctcaggtttctgtag >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_2|388_aa MPRETHWRRNTQVDGRREECTGVGAHWDASRPSTGGMTQSLAGAVGEELGHQAARLQGKT ISLLAAPSAESCFHSIKPRTHSPSPHVILFFWLWPSHTWCRFRKISQYKCEKMGVKMAHA NPIVNCSCEGSRLRAPYENLMPDDLLLSPITPTWDCLVAGKQARGLPTDSTLCVCPNIWG GALIPVRGHQGGLGRGGGQGTSAGPKGDNIYEWRSTILGPPGSVYEGGVFFLDITFTPEY PFKPPKTLRVTEYSSQSSGRRRDPKPDFQIVPVKITQDDVTVDNGQQELSKASLRSSSEL IRSWNHYCLPSCLYIELELPVLQPAPQLLLITELTHGQPALAVTLVIILSDFTGHVDAAS SNLVSYSLTSPMVSSSLPQPFTTPVVIS >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_2|1167_bp atgccgagagaaacacattggcgaaggaatacacaggtggatggacgtcgagaggaatgc actggtgtaggagcacactgggatgccagcaggccatcgactggtggaatgacacaaagt ttggctggggcagttggagaagagttgggccaccaagcggccagactccaggggaaaacc atttcccttctggctgccccatctgctgagagctgcttccactcaataaaacctcgcact cattctccaagcccacatgtgatcctattcttctggctgtggccctctcacacctggtgt agattcaggaaaatctcacagtacaaatgtgagaaaatgggagtaaaaatggcacacgca aaccctattgtgaactgctcatgtgagggatctaggttgcgtgctccttatgagaatcta atgcctgatgatctgctactgtcacccatcacccccacatgggactgtctagttgcagga aaacaagctcgggggctccccactgattctacattatgcgtctgtcctaacatctgggga ggtgctctgatacctgtccgtgggcaccagggaggcctgggccgcggcggaggacagggc accagtgctggtcccaaaggcgataacatctatgaatggagatcaaccattctagggcct ccaggatccgtgtatgagggtggtgtattctttctcgatatcacttttacaccagaatat cccttcaagcctccaaagacactgcgcgtgacagaatacagcagtcaaagctcggggagg agaagagaccccaaaccagattttcagatagtgcccgtaaagattacacaagacgatgtg acagtagacaatgggcagcaagaactgtcaaaggcctctctgaggagcagcagtgaactc atcagatcttggaaccactactgtcttccttcctgtctgtacattgagcttgagctccct gtgctccagccggccccccaactgctcctcatcacagagctgactcatggtcaacctgct ctcgccgttactctggtcataattcttagtgatttcactggccacgtagatgctgcttcc agtaacctggtctcttattctttgacttctccaatggtctcatcatccttacctcagcca ttcaccactcccgtggtcatatcctag >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_3|143_aa MVKRGSNGNYTFSATDSICTKILQIPLDGDKELAASVLGAERESRRTSTFRMEDCETMED VYMASVETDRGVKEQLHLYDTRGLQEGVELPKHYFSFADGFVLVYSVNNLESFQRVELLK KEIDKFKDKKEASGYVKNAKCEL >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_3|432_bp atggtgaagagaggatctaatggcaactatacgtttagtgccactgactcgatttgcact aagattctccagattcccctggatggggataaagagctggctgccagtgttttgggagca gaaagggaaagccggaggacttcaacattcagaatggaagattgcgaaacaatggaagat gtatacatggcttcagtagaaacagaccgaggagtaaaagaacagttacatctttatgac accagaggtctacaggaaggcgtggagctgccaaagcattatttttcatttgctgatggc ttcgttcttgtgtacagtgtgaataaccttgaatcctttcaaagagtggagcttctgaag aaagaaatcgataagttcaaagacaaaaaagaggcaagtggatatgttaaaaatgccaaa tgtgaattataa >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_4|204_aa MGAYKYIQELWRKKQSDVMRFLLRVRCWQYRQLSALHRAPRPTRPDKARRLGYKAKQGYV IYRIRVRRGGRKRPVPKGATYGKPVHHGVNQLKFARSLQSVAEERAGRHCGALRVLNSYW VGEDSTYKFFEVILIDPFHKAIRRNPDTQWITKPVHKHREMRGLTSAGRKSRGLGKGHKF HHTIGGSRRAAWRRRNTLQLHRYR >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_4|615_bp atgggtgcatacaagtacatccaggagctatggagaaagaagcagtctgatgtcatgcgc tttcttctgagggtccgctgctggcagtaccgccagctctctgctctccacagggctccc cgccccacccggcctgataaagcgcgccgactgggctacaaggccaagcaaggttacgtt atatataggattcgtgttcgccgtggtggccgaaaacgcccagttcctaagggtgcaact tacggcaagcctgtccatcatggtgttaaccagctaaagtttgctcgaagccttcagtcc gttgcagaggagcgagctggacgccactgtggggctctgagagtcctgaattcttactgg gttggtgaagattccacatacaaattttttgaggttatcctcattgatccattccataaa gctatcagaagaaatcctgacacccagtggatcaccaaaccagtccacaagcacagggag atgcgtgggctgacatctgcaggccgaaagagccgtggccttggaaagggccacaagttc caccacactattggtggctctcgccgggcagcttggagaaggcgcaatactctccagctc caccgttaccgctaa >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_5|71_aa MPIKSNAFISFGGVSCRLPGFQKNDTKKKKRKKRKRKEKERKRKKRKKREEEDEEQEEEE EGGGGRRKKKK >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_5|216_bp atgccaattaaatcaaatgctttcatcagttttggaggagtgagttgccgtttgccagga tttcagaagaatgacacaaaaaaaaagaagaggaagaagaggaagaggaaagagaaggag agaaagaggaagaagagaaaaaagagagaagaggaggacgaggagcaagaggaggaggag gaaggaggaggaggaagaagaaagaagaagaaataa >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_6|688_aa MRCTWRIHQWRRAPDKLKLSEELIVEKGPTWVSKSPLSIGDKRALTPQCTGRRTLFSRGD PKARTPRKPNEPAPGEAGSCIPCRFRGGSAGRGRAAGRRRRLGEGVGLSRRDTLEGALRC FDPERLPADWVAPPLEGSENSFQSSSSSVPSSPNSSNSDTNGNPKNGDLANIEGILKNDR IDCSMKTSKSSAPGMTKSHSGVTKFSGMVLLCKVCGDVASGFHYGVHACEGCKGFFRRSI QQNIQYKKCLKNENCSIMRMNRNRCQQCRFKKCLSVGMSRDAVRFGRIPKREKQRMLIEM QSAMKTMMNSQFSGHLQNDTLVEHHEQTALPAQEQLRPKPQLEQENIKSSSPPSSDFAKE EVIGMVTRAHKDTFMYNQEQQENSAESMQPQRGERIPKNMEQYNLNHDHCGNGLSSHFPC SESQQHLNGQFKGRNIMHYPNGHAICIANGHCMNFSNAYTQRVCDRVPIDGFSQNENKNS YLCNTGGRMHLVCPMSKSPYVDPHKSGHEIWEEFSMSFTPAVKEVVEFAKRIPGFRDLSQ HDQVNLLKAGTFEVLMVRFASLFDAKERTVTFLSGKKYSVDDLHSMGAGDLLNSMFEFSE KLNALQLSDEEMSLFTAVVLVSADRSGIENVNSVEALQETLIRALRTLIMKNHPNEASIF TKLLLKLPDLRSLNNMHSEELLAFKVHP >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_6|2067_bp atgcgctgcacttggagaattcaccagtggaggagagctcctgataaactgaagctgagt gaagagttgattgttgaaaagggacccacctgggtctctaagtctcctctaagtattggc gacaagcgggcgctgacaccgcagtgcaccggacgccgcacgctcttttcgcgaggtgac cccaaggcgcggaccccgcgcaaaccaaacgaaccggcgcctggggaggctggtagctgc ataccttgcagattccgaggaggaagtgcaggacgagggcgtgctgcaggccggaggagg cgcctcggggaaggcgtggggctttcccgaagggatacgctcgaaggagctctgaggtgc ttcgatcccgagcgactccccgcagactgggtagcaccgccccttgagggttctgagaat agtttccagtcctcctcctcttctgttccatcttctccaaatagctctaattctgatacc aatggtaatcccaagaatggtgatctcgccaatattgaaggcatcttgaagaatgatcga atagattgttctatgaaaacaagcaaatcgagtgcacctgggatgacaaaaagtcatagt ggtgtgacaaaatttagtggcatggttctactgtgtaaagtctgtggggatgtggcgtca ggattccactatggagttcatgcttgcgaaggctgtaagggtttctttcggagaagtatt caacaaaacatccagtacaagaagtgcctgaagaatgaaaactgttctataatgagaatg aataggaacagatgtcagcaatgtcgcttcaaaaagtgtctgtctgttggaatgtcaaga gatgctgttcggtttggtcgtattcctaagcgtgaaaaacagaggatgctaattgaaatg caaagtgcaatgaagaccatgatgaacagccagttcagtggtcacttgcaaaatgacaca ttagtagaacatcatgaacagacagccttgccagcccaggaacagctgcgacccaagccc caactggagcaagaaaacatcaaaagctcttctcctccatcttctgattttgcaaaggaa gaagtgattggcatggtgaccagagctcacaaggatacctttatgtataatcaagagcag caagaaaactcagctgagagcatgcagccccagagaggagaacggattcccaagaacatg gagcaatataatttaaatcatgatcattgcggcaatgggcttagcagccattttccctgt agtgagagccagcagcatctcaatggacagttcaaagggaggaatataatgcattaccca aatggtcatgccatttgtattgcaaatggacattgtatgaacttctccaatgcttatact caaagagtatgtgatagagttccgatagatggattttctcagaatgagaacaagaatagt tacctgtgcaacactggaggaagaatgcatctggtttgtccaatgagtaagtctccatat gtggatcctcataaatcaggacatgaaatctgggaagaattttcgatgagcttcactcca gcagtgaaagaagtggtggaatttgcaaagcgtattcctgggttcagagatctctctcag catgaccaggtcaaccttttaaaggctgggacttttgaggttttaatggtacggttcgca tcattatttgatgcaaaggaacgtactgtcacctttttaagtggaaagaaatatagtgtg gatgatttacactcaatgggagcaggggatctgctaaactctatgtttgaatttagtgag aagctaaatgccctccaacttagtgatgaagagatgagtttgtttacagctgttgtcctg gtatctgcagatcgatctggaatagaaaacgtcaactctgtggaggctttgcaggaaact ctcattcgtgcactaaggaccttaataatgaaaaaccatccaaatgaggcctctattttt acaaaactgcttctaaagttgccagatcttcgatctttaaacaacatgcactctgaggag ctcttggcctttaaagttcacccttaa >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_7|261_aa MANLKLPMRSQLAHKIPETLAFMSWLRSACSPCYWHSLRPWSKVGAKSWGHEQQQETDGF LGRRRRVPSEAPPSGYRGPECWQLSRQPCRPEWKLVVPFPGPPMAARGPISMHFLLSEAH KIPRLSQSWGKAPLHLTLHSSVYLILPGCRTRTWDPLNGKAKSCNTNRIETCPLLTTLWV KERKAAASPSGTSHLGTPQAKAVIPSLEPCGSWHLQPSGHHCIPRCQLGKLLMVHLVQLQ PRREPAPGDVYPMAAADVSAQ >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_7|786_bp atggccaatctcaagctaccaatgcgaagtcaactggctcacaaaattcctgaaacattg gctttcatgagctggctcagaagtgcctgttccccctgctattggcactcactccgacct tggagcaaagttggggccaagtcctggggtcatgaacagcagcaagagacagacgggttc ctgggcagaaggaggcgggtccccagtgaggccccaccttcaggctacagagggcctgaa tgctggcaactgagccgccagccctgcagaccagagtggaaacttgtggtgccttttcca ggcccacccatggctgcccgtggaccaatcagcatgcacttcctcctctccgaggcccat aaaatccctaggctgagtcagagctggggaaaagctcctcttcatcttaccctccactca tctgtgtacctcattcttcctggttgcaggacaagaacttgggacccactgaatggcaaa gctaaaagttgtaacacaaataggattgaaacatgccccttgctcaccacgctgtgggtg aaggagagaaaagctgcagccagcccttcagggacgtcacacctgggaacgccccaagcc aaggctgtgattccctctttggagccctgtggttcctggcatcttcagccttccggccac cactgcattcccaggtgccagctgggaaagctgctcatggtgcacctggtccagctgcag cctcgcagagagcctgcacctggagatgtctatcccatggcagcagctgatgtgtctgca cagtag >gi568815595r:23793098_24010904|GENSCAN_predicted_peptide_8|136_aa MKPQQQKTAIRNTKMFVEKKDDKIMHKAGPHRSSAYRQKWKKKVENTHLNHLSQEVAHIT HHFSLYAIVNEHLLSTYYALSATLEAEDAEKLQSISSRSLRFCGVRGKRRKHNSADEGAL KSNWWKPRKCPKGDGG >gi568815595r:23793098_24010904|GENSCAN_predicted_CDS_8|411_bp atgaaaccacagcaacagaaaactgccatccgaaacactaagatgttcgtggaaaaaaag gatgacaagataatgcacaaagcagggcctcacaggtcttctgcatacaggcagaagtgg aaaaagaaagtggagaacacacatcttaaccacctcagccaggaagtggcacacattaca catcatttctcactatatgccattgtcaatgagcatttactgagcacctactatgctcta agtgccacactagaggctgaagatgctgagaaactacaatctatatcctcaagaagctta cggttctgtggagtcagggggaaaaggaggaagcataactcagcagatgaaggagccctt aagtcaaactggtggaaacccagaaagtgtcccaagggagatggtggctga