GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:54:53 Sequence gi568815595f:23707270_23990603 : 283334 bp : 42.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3113 3306 194 2 2 58 -8 155 0.424 1.11 1.02 Intr + 3338 3561 224 0 2 15 44 224 0.535 7.62 1.03 Intr + 32754 32811 58 2 1 88 54 64 0.016 0.54 1.04 Intr + 35216 35389 174 0 0 75 66 68 0.122 2.29 1.05 Intr + 55824 55978 155 1 2 81 105 14 0.177 1.27 1.06 Term + 56170 56277 108 0 0 79 42 138 0.562 5.83 1.07 PlyA + 56280 56285 6 1.05 2.00 Prom + 61314 61353 40 -3.65 2.01 Init + 76079 76109 31 1 1 78 115 19 0.168 3.55 2.02 Intr + 82883 82909 27 0 0 87 86 42 0.063 1.07 2.03 Intr + 98734 98819 86 2 2 100 80 60 0.881 5.02 2.04 Intr + 99968 100152 185 1 2 116 69 252 0.946 23.76 2.05 Intr + 104191 104241 51 1 0 86 109 55 0.185 4.50 2.06 Intr + 108225 108423 199 0 1 106 31 36 0.109 -1.87 2.07 Intr + 112422 112552 131 2 2 60 31 125 0.393 2.57 2.08 Intr + 114715 114842 128 1 2 82 73 94 0.359 6.70 2.09 Intr + 134815 134884 70 2 1 60 110 30 0.006 -0.28 2.10 Term + 152849 152972 124 2 1 19 46 135 0.017 -0.72 2.11 PlyA + 153502 153507 6 1.05 3.00 Prom + 154302 154341 40 -3.05 3.01 Init + 154350 154624 275 2 2 43 72 193 0.422 9.69 3.02 Intr + 165861 165936 76 0 1 64 115 49 0.147 3.80 3.03 Intr + 171376 171542 167 0 2 26 101 103 0.397 3.24 3.04 Intr + 172343 172429 87 2 0 93 94 48 0.381 4.07 3.05 Intr + 180298 180430 133 1 1 53 116 121 0.748 11.13 3.06 Intr + 186897 187126 230 2 2 8 72 154 0.286 1.54 3.07 Term + 187569 187767 199 0 1 53 41 128 0.681 0.49 3.08 PlyA + 189052 189057 6 1.05 4.04 PlyA - 190547 190542 6 1.05 4.03 Term - 193780 193500 281 2 2 36 37 265 0.767 10.82 4.02 Intr - 195704 195617 88 2 1 47 36 122 0.512 1.52 4.01 Init - 200410 200348 63 1 0 69 81 72 0.610 5.80 4.00 Prom - 207058 207019 40 -7.15 5.00 Prom + 209033 209072 40 -7.45 5.01 Init + 210591 210762 172 2 1 99 102 209 0.995 23.05 5.02 Intr + 211171 211307 137 2 2 91 84 171 0.999 16.37 5.03 Term + 211927 212232 306 0 0 91 34 206 0.998 9.63 5.04 PlyA + 212262 212267 6 1.05 6.03 PlyA - 212608 212603 6 1.05 6.02 Term - 219263 219115 149 0 2 64 43 206 0.978 10.78 6.01 Init - 220285 220219 67 1 1 65 44 26 0.304 -2.81 6.00 Prom - 223288 223249 40 -7.55 7.00 Prom + 225686 225725 40 -6.65 7.01 Init + 229692 229817 126 2 0 63 97 132 0.847 11.81 7.02 Intr + 238936 239213 278 0 2 4 -9 269 0.007 4.19 7.03 Intr + 247329 247534 206 0 2 88 108 95 0.410 9.42 7.04 Intr + 248768 248856 89 0 2 96 98 49 0.973 5.47 7.05 Intr + 252402 252546 145 2 1 82 80 77 0.990 5.23 7.06 Intr + 254708 255336 629 0 2 50 40 468 0.758 29.20 7.07 Intr + 257708 257893 186 1 0 48 92 75 0.650 2.86 7.08 Intr + 260544 260754 211 2 1 113 115 210 0.998 23.76 7.09 Term + 269954 270150 197 0 2 60 48 114 0.804 1.19 7.10 PlyA + 270257 270262 6 1.05 8.04 PlyA - 271161 271156 6 1.05 8.03 Term - 274474 274083 392 2 2 30 37 282 0.873 11.46 8.02 Intr - 275138 274819 320 1 2 85 97 177 0.632 12.98 8.01 Init - 281798 281725 74 2 2 115 99 27 0.944 7.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 210122 209901 222 2 0 71 45 314 0.993 21.33 S.002 Init - 210269 210246 24 2 0 67 79 33 0.811 -0.11 S.003 Term - 244549 244328 222 1 0 60 37 157 0.940 3.73 S.004 Init - 244924 244697 228 1 0 88 63 111 0.918 7.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_1|304_aa XQTRAVLVTLLECKFSEVRDLPYSPLSLVPKQYLAHSNTSVRICGINKRITKQLDENGDI TEESRQRPSSSAFIICNTSQCRMNPVNALTPKTGRLLPALLTDVYIPVSIVTEMPQKPAA GLHPKLNTHKCLETWGDQGQKYSQSNEEIFIEEELYGLGGGKEKQQQPLYTNMPAQLGSL VFGCRSRTNFQVAFRENPGKQFQSGNLLSLNISKTHLEDEIVKLALKTLFSLSSIQRDNC GRKELSCLPDPAHSFFSGQLLLCRGSWEELFLAHAANRAQQARGGANMSPSSSPSAKVHL KHEK >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_1|915_bp nngcaaaccagggcagttctggtcaccctactcgagtgtaagttctctgaggtcagagat ttgccttattcacctctgtctctggtccctaaacagtacctagcacatagtaacacctcg gtacgtatttgtggaataaataagcgaataactaagcagctggatgaaaatggagacata actgaggagtctcgccaaagacctagtagttctgctttcatcatttgcaacacatcacaa tgccgaatgaatcccgtcaatgctttaactcctaagactggccgcctgcttcctgcccta ctcactgatgtttatattcctgtctccatagttactgagatgcctcagaagccagcagct gggttgcatcccaagctgaatacgcacaagtgtctggagacctggggtgatcaaggccag aaatattcacaatctaatgaagaaatattcattgaggaagagctctatggcctgggagga ggaaaagaaaaacaacaacaacctttgtatacaaatatgccagcccaacttggttctctt gtatttggctgcagatcaagaacaaactttcaagtggcatttcgagagaatccaggaaag cagtttcaatctggaaatttgctctccttaaatatttcaaaaacacatttagaagatgaa attgtgaaattggccttgaaaactttattttcgctttcctcaattcaaagggacaattgt ggaaggaaagaactgagctgcttgccagacccagcgcactcctttttctctggacagctg ctgctttgcagagggtcgtgggaggagttatttttagcacacgcagccaacagagctcag caggcccgtggaggcgctaacatgtcccccagttcatctccttctgcaaaggtgcatttg aagcatgaaaaataa >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_2|343_aa MRTGLKCKENVEEDEWRAEARERHEWPGPAAAAAAAAAAARRSQTQREGLFAGWGGGFAM SDDDSRASTSSSSSSSSNQQTEKETNTPKKKESKVSMSKNSKLLSTSAKRIQKELADITL DPPPNCRCSKLALCILPRSFCEFTLCFSGIPIPPRIGGNGNEMSVQKSQAAHHGSQWKEI LFLQQWSVTYLVQKIVEIAGEIAVTFYSCPLLAVFLSWHPSSFGNFDLPAITEKLWTKER HHEKSAQDEDSGIWHLEVYTRKSQQRRQKENQESVVLESSESKCGVTFGLTIMGSGISDV RIRLKFRAQPFAASLLSQLSVLGFWVLSPWLDEDERSDPLRFL >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_2|1032_bp atgaggacaggcctcaaatgtaaggaaaacgtggaggaggatgaatggagagcggaagcg agagagagacacgagtggccaggcccagccgcagccgcagcagcagccgccgcggcggca cggaggagccagacacaaagagaggggctgtttgcggggtggggtggggggttcgctatg tcggatgacgattcgagggccagcaccagctcctcctcatcttcgtcttccaaccagcaa accgagaaagaaacaaacacccccaagaagaaggagagtaaagtcagcatgagcaaaaac tccaaactcctctccaccagcgccaagagaattcagaaggagctggcggacatcacttta gaccctccacctaattgcagatgctccaagttagcactttgtatacttccacgtagtttc tgtgaatttacattgtgtttttctggcattccaatacctcctcgaattggtggtaatggt aatgagatgtcagttcagaagtcccaggctgctcatcatggttcacaatggaaagagatc cttttcttgcaacagtggagtgttacatacctcgtccagaaaatagtggaaattgctggg gaaattgctgtcacattttacagctgtccattgctggcagtctttctgagttggcaccca agcagttttggcaattttgaccttcctgcaatcacagagaagctctggacaaaggaaagg catcatgagaagagtgcccaggatgaagactcaggcatctggcatttagaagtgtacact aggaaaagccagcaaaggagacagaaggaaaatcaagagagtgtggtattagaaagcagt gaaagtaaatgtggtgtaacttttggtttgactataatgggcagtggaatctcagatgtt aggatcagactgaaattcagagctcagccctttgctgccagcttgctctcacagctgtct gtccttggtttctgggtgctttcaccttggctagatgaagatgaacgttctgacccgctc aggtttctgtag >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_3|388_aa MPRETHWRRNTQVDGRREECTGVGAHWDASRPSTGGMTQSLAGAVGEELGHQAARLQGKT ISLLAAPSAESCFHSIKPRTHSPSPHVILFFWLWPSHTWCRFRKISQYKCEKMGVKMAHA NPIVNCSCEGSRLRAPYENLMPDDLLLSPITPTWDCLVAGKQARGLPTDSTLCVCPNIWG GALIPVRGHQGGLGRGGGQGTSAGPKGDNIYEWRSTILGPPGSVYEGGVFFLDITFTPEY PFKPPKTLRVTEYSSQSSGRRRDPKPDFQIVPVKITQDDVTVDNGQQELSKASLRSSSEL IRSWNHYCLPSCLYIELELPVLQPAPQLLLITELTHGQPALAVTLVIILSDFTGHVDAAS SNLVSYSLTSPMVSSSLPQPFTTPVVIS >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_3|1167_bp atgccgagagaaacacattggcgaaggaatacacaggtggatggacgtcgagaggaatgc actggtgtaggagcacactgggatgccagcaggccatcgactggtggaatgacacaaagt ttggctggggcagttggagaagagttgggccaccaagcggccagactccaggggaaaacc atttcccttctggctgccccatctgctgagagctgcttccactcaataaaacctcgcact cattctccaagcccacatgtgatcctattcttctggctgtggccctctcacacctggtgt agattcaggaaaatctcacagtacaaatgtgagaaaatgggagtaaaaatggcacacgca aaccctattgtgaactgctcatgtgagggatctaggttgcgtgctccttatgagaatcta atgcctgatgatctgctactgtcacccatcacccccacatgggactgtctagttgcagga aaacaagctcgggggctccccactgattctacattatgcgtctgtcctaacatctgggga ggtgctctgatacctgtccgtgggcaccagggaggcctgggccgcggcggaggacagggc accagtgctggtcccaaaggcgataacatctatgaatggagatcaaccattctagggcct ccaggatccgtgtatgagggtggtgtattctttctcgatatcacttttacaccagaatat cccttcaagcctccaaagacactgcgcgtgacagaatacagcagtcaaagctcggggagg agaagagaccccaaaccagattttcagatagtgcccgtaaagattacacaagacgatgtg acagtagacaatgggcagcaagaactgtcaaaggcctctctgaggagcagcagtgaactc atcagatcttggaaccactactgtcttccttcctgtctgtacattgagcttgagctccct gtgctccagccggccccccaactgctcctcatcacagagctgactcatggtcaacctgct ctcgccgttactctggtcataattcttagtgatttcactggccacgtagatgctgcttcc agtaacctggtctcttattctttgacttctccaatggtctcatcatccttacctcagcca ttcaccactcccgtggtcatatcctag >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_4|143_aa MVKRGSNGNYTFSATDSICTKILQIPLDGDKELAASVLGAERESRRTSTFRMEDCETMED VYMASVETDRGVKEQLHLYDTRGLQEGVELPKHYFSFADGFVLVYSVNNLESFQRVELLK KEIDKFKDKKEASGYVKNAKCEL >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_4|432_bp atggtgaagagaggatctaatggcaactatacgtttagtgccactgactcgatttgcact aagattctccagattcccctggatggggataaagagctggctgccagtgttttgggagca gaaagggaaagccggaggacttcaacattcagaatggaagattgcgaaacaatggaagat gtatacatggcttcagtagaaacagaccgaggagtaaaagaacagttacatctttatgac accagaggtctacaggaaggcgtggagctgccaaagcattatttttcatttgctgatggc ttcgttcttgtgtacagtgtgaataaccttgaatcctttcaaagagtggagcttctgaag aaagaaatcgataagttcaaagacaaaaaagaggcaagtggatatgttaaaaatgccaaa tgtgaattataa >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_5|204_aa MGAYKYIQELWRKKQSDVMRFLLRVRCWQYRQLSALHRAPRPTRPDKARRLGYKAKQGYV IYRIRVRRGGRKRPVPKGATYGKPVHHGVNQLKFARSLQSVAEERAGRHCGALRVLNSYW VGEDSTYKFFEVILIDPFHKAIRRNPDTQWITKPVHKHREMRGLTSAGRKSRGLGKGHKF HHTIGGSRRAAWRRRNTLQLHRYR >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_5|615_bp atgggtgcatacaagtacatccaggagctatggagaaagaagcagtctgatgtcatgcgc tttcttctgagggtccgctgctggcagtaccgccagctctctgctctccacagggctccc cgccccacccggcctgataaagcgcgccgactgggctacaaggccaagcaaggttacgtt atatataggattcgtgttcgccgtggtggccgaaaacgcccagttcctaagggtgcaact tacggcaagcctgtccatcatggtgttaaccagctaaagtttgctcgaagccttcagtcc gttgcagaggagcgagctggacgccactgtggggctctgagagtcctgaattcttactgg gttggtgaagattccacatacaaattttttgaggttatcctcattgatccattccataaa gctatcagaagaaatcctgacacccagtggatcaccaaaccagtccacaagcacagggag atgcgtgggctgacatctgcaggccgaaagagccgtggccttggaaagggccacaagttc caccacactattggtggctctcgccgggcagcttggagaaggcgcaatactctccagctc caccgttaccgctaa >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_6|71_aa MPIKSNAFISFGGVSCRLPGFQKNDTKKKKRKKRKRKEKERKRKKRKKREEEDEEQEEEE EGGGGRRKKKK >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_6|216_bp atgccaattaaatcaaatgctttcatcagttttggaggagtgagttgccgtttgccagga tttcagaagaatgacacaaaaaaaaagaagaggaagaagaggaagaggaaagagaaggag agaaagaggaagaagagaaaaaagagagaagaggaggacgaggagcaagaggaggaggag gaaggaggaggaggaagaagaaagaagaagaaataa >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_7|688_aa MRCTWRIHQWRRAPDKLKLSEELIVEKGPTWVSKSPLSIGDKRALTPQCTGRRTLFSRGD PKARTPRKPNEPAPGEAGSCIPCRFRGGSAGRGRAAGRRRRLGEGVGLSRRDTLEGALRC FDPERLPADWVAPPLEGSENSFQSSSSSVPSSPNSSNSDTNGNPKNGDLANIEGILKNDR IDCSMKTSKSSAPGMTKSHSGVTKFSGMVLLCKVCGDVASGFHYGVHACEGCKGFFRRSI QQNIQYKKCLKNENCSIMRMNRNRCQQCRFKKCLSVGMSRDAVRFGRIPKREKQRMLIEM QSAMKTMMNSQFSGHLQNDTLVEHHEQTALPAQEQLRPKPQLEQENIKSSSPPSSDFAKE EVIGMVTRAHKDTFMYNQEQQENSAESMQPQRGERIPKNMEQYNLNHDHCGNGLSSHFPC SESQQHLNGQFKGRNIMHYPNGHAICIANGHCMNFSNAYTQRVCDRVPIDGFSQNENKNS YLCNTGGRMHLVCPMSKSPYVDPHKSGHEIWEEFSMSFTPAVKEVVEFAKRIPGFRDLSQ HDQVNLLKAGTFEVLMVRFASLFDAKERTVTFLSGKKYSVDDLHSMGAGDLLNSMFEFSE KLNALQLSDEEMSLFTAVVLVSADRSGIENVNSVEALQETLIRALRTLIMKNHPNEASIF TKLLLKLPDLRSLNNMHSEELLAFKVHP >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_7|2067_bp atgcgctgcacttggagaattcaccagtggaggagagctcctgataaactgaagctgagt gaagagttgattgttgaaaagggacccacctgggtctctaagtctcctctaagtattggc gacaagcgggcgctgacaccgcagtgcaccggacgccgcacgctcttttcgcgaggtgac cccaaggcgcggaccccgcgcaaaccaaacgaaccggcgcctggggaggctggtagctgc ataccttgcagattccgaggaggaagtgcaggacgagggcgtgctgcaggccggaggagg cgcctcggggaaggcgtggggctttcccgaagggatacgctcgaaggagctctgaggtgc ttcgatcccgagcgactccccgcagactgggtagcaccgccccttgagggttctgagaat agtttccagtcctcctcctcttctgttccatcttctccaaatagctctaattctgatacc aatggtaatcccaagaatggtgatctcgccaatattgaaggcatcttgaagaatgatcga atagattgttctatgaaaacaagcaaatcgagtgcacctgggatgacaaaaagtcatagt ggtgtgacaaaatttagtggcatggttctactgtgtaaagtctgtggggatgtggcgtca ggattccactatggagttcatgcttgcgaaggctgtaagggtttctttcggagaagtatt caacaaaacatccagtacaagaagtgcctgaagaatgaaaactgttctataatgagaatg aataggaacagatgtcagcaatgtcgcttcaaaaagtgtctgtctgttggaatgtcaaga gatgctgttcggtttggtcgtattcctaagcgtgaaaaacagaggatgctaattgaaatg caaagtgcaatgaagaccatgatgaacagccagttcagtggtcacttgcaaaatgacaca ttagtagaacatcatgaacagacagccttgccagcccaggaacagctgcgacccaagccc caactggagcaagaaaacatcaaaagctcttctcctccatcttctgattttgcaaaggaa gaagtgattggcatggtgaccagagctcacaaggatacctttatgtataatcaagagcag caagaaaactcagctgagagcatgcagccccagagaggagaacggattcccaagaacatg gagcaatataatttaaatcatgatcattgcggcaatgggcttagcagccattttccctgt agtgagagccagcagcatctcaatggacagttcaaagggaggaatataatgcattaccca aatggtcatgccatttgtattgcaaatggacattgtatgaacttctccaatgcttatact caaagagtatgtgatagagttccgatagatggattttctcagaatgagaacaagaatagt tacctgtgcaacactggaggaagaatgcatctggtttgtccaatgagtaagtctccatat gtggatcctcataaatcaggacatgaaatctgggaagaattttcgatgagcttcactcca gcagtgaaagaagtggtggaatttgcaaagcgtattcctgggttcagagatctctctcag catgaccaggtcaaccttttaaaggctgggacttttgaggttttaatggtacggttcgca tcattatttgatgcaaaggaacgtactgtcacctttttaagtggaaagaaatatagtgtg gatgatttacactcaatgggagcaggggatctgctaaactctatgtttgaatttagtgag aagctaaatgccctccaacttagtgatgaagagatgagtttgtttacagctgttgtcctg gtatctgcagatcgatctggaatagaaaacgtcaactctgtggaggctttgcaggaaact ctcattcgtgcactaaggaccttaataatgaaaaaccatccaaatgaggcctctattttt acaaaactgcttctaaagttgccagatcttcgatctttaaacaacatgcactctgaggag ctcttggcctttaaagttcacccttaa >gi568815595f:23707270_23990603|GENSCAN_predicted_peptide_8|261_aa MANLKLPMRSQLAHKIPETLAFMSWLRSACSPCYWHSLRPWSKVGAKSWGHEQQQETDGF LGRRRRVPSEAPPSGYRGPECWQLSRQPCRPEWKLVVPFPGPPMAARGPISMHFLLSEAH KIPRLSQSWGKAPLHLTLHSSVYLILPGCRTRTWDPLNGKAKSCNTNRIETCPLLTTLWV KERKAAASPSGTSHLGTPQAKAVIPSLEPCGSWHLQPSGHHCIPRCQLGKLLMVHLVQLQ PRREPAPGDVYPMAAADVSAQ >gi568815595f:23707270_23990603|GENSCAN_predicted_CDS_8|786_bp atggccaatctcaagctaccaatgcgaagtcaactggctcacaaaattcctgaaacattg gctttcatgagctggctcagaagtgcctgttccccctgctattggcactcactccgacct tggagcaaagttggggccaagtcctggggtcatgaacagcagcaagagacagacgggttc ctgggcagaaggaggcgggtccccagtgaggccccaccttcaggctacagagggcctgaa tgctggcaactgagccgccagccctgcagaccagagtggaaacttgtggtgccttttcca ggcccacccatggctgcccgtggaccaatcagcatgcacttcctcctctccgaggcccat aaaatccctaggctgagtcagagctggggaaaagctcctcttcatcttaccctccactca tctgtgtacctcattcttcctggttgcaggacaagaacttgggacccactgaatggcaaa gctaaaagttgtaacacaaataggattgaaacatgccccttgctcaccacgctgtgggtg aaggagagaaaagctgcagccagcccttcagggacgtcacacctgggaacgccccaagcc aaggctgtgattccctctttggagccctgtggttcctggcatcttcagccttccggccac cactgcattcccaggtgccagctgggaaagctgctcatggtgcacctggtccagctgcag cctcgcagagagcctgcacctggagatgtctatcccatggcagcagctgatgtgtctgca cagtag