GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:24:32 Sequence gi568815592f:118805650_119007611 : 201962 bp : 38.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 PlyA - 3113 3108 6 1.05 1.13 Term - 10645 9175 1471 0 1 96 33 1064 0.970 90.81 1.12 Intr - 16518 16254 265 1 1 65 38 198 0.146 8.15 1.11 Intr - 16908 16616 293 2 2 -19 71 188 0.455 2.25 1.10 Intr - 20643 20498 146 0 2 116 100 155 0.888 17.66 1.09 Intr - 21215 21133 83 0 2 65 110 125 0.998 10.84 1.08 Intr - 22481 22278 204 0 0 44 21 238 0.996 10.95 1.07 Intr - 23601 23399 203 2 2 80 102 249 0.966 23.51 1.06 Intr - 33818 33680 139 0 1 52 72 128 0.486 6.20 1.05 Intr - 34103 33924 180 0 0 28 43 145 0.601 2.92 1.04 Intr - 41029 40848 182 0 2 86 47 108 0.951 5.09 1.03 Intr - 50896 50722 175 2 1 107 86 150 0.988 14.78 1.02 Intr - 55616 55537 80 1 2 76 88 120 0.358 9.08 1.01 Init - 56576 56506 71 2 2 100 6 89 0.307 0.55 1.00 Prom - 76111 76072 40 -3.95 2.00 Prom + 78893 78932 40 -3.65 2.01 Init + 88765 88873 109 0 1 51 113 136 0.541 12.83 2.02 Intr + 95117 95232 116 0 2 79 110 78 0.987 8.35 2.03 Intr + 100003 100179 177 0 0 80 90 145 0.999 13.09 2.04 Term + 101753 101965 213 1 0 97 47 109 0.999 3.95 2.05 PlyA + 103498 103503 6 1.05 3.28 PlyA - 103625 103620 6 1.05 3.27 Term - 105737 105679 59 0 2 82 43 45 0.725 -3.73 3.26 Intr - 106120 106001 120 2 0 89 72 94 0.765 7.45 3.25 Intr - 107771 107646 126 0 0 112 88 49 0.978 7.03 3.24 Intr - 112112 111912 201 0 0 81 72 180 0.979 14.04 3.23 Intr - 116437 116356 82 1 1 88 99 78 0.999 7.19 3.22 Intr - 118458 118162 297 0 0 -2 99 235 0.410 11.75 3.21 Intr - 122444 122273 172 1 1 52 60 172 0.788 9.82 3.20 Intr - 126089 125851 239 2 2 94 82 148 0.552 10.29 3.19 Intr - 127091 127060 32 0 2 72 107 24 0.622 -0.37 3.18 Intr - 129195 129082 114 1 0 72 36 113 0.530 4.00 3.17 Intr - 129338 129242 97 2 1 52 100 90 0.459 5.26 3.16 Intr - 133990 133895 96 1 0 96 80 32 0.408 2.49 3.15 Intr - 135050 134847 204 2 0 130 9 90 0.132 3.77 3.14 Intr - 135221 135194 28 1 1 68 94 9 0.064 -3.30 3.13 Intr - 139277 139155 123 1 0 101 74 43 0.106 2.98 3.12 Intr - 156314 156112 203 2 2 60 78 159 0.783 9.26 3.11 Intr - 159122 159018 105 2 0 37 95 80 0.891 3.09 3.10 Intr - 161303 161186 118 1 1 93 50 86 0.984 4.85 3.09 Intr - 168925 168779 147 0 0 22 91 153 0.990 7.43 3.08 Intr - 169559 169375 185 2 2 80 91 151 0.999 12.26 3.07 Intr - 170395 170268 128 2 2 62 111 179 0.977 17.08 3.06 Intr - 173869 173716 154 1 1 89 -4 187 0.997 8.32 3.05 Intr - 174683 174489 195 2 0 40 89 185 0.787 12.49 3.04 Intr - 182471 182430 42 2 0 92 101 25 0.090 1.72 3.03 Intr - 197400 197250 151 2 1 32 91 158 0.975 9.74 3.02 Intr - 197973 197852 122 0 2 49 56 203 0.998 11.57 3.01 Intr - 200959 200798 162 1 0 69 76 101 0.846 6.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 73450 73527 78 0 0 92 70 40 0.843 3.71 S.002 Term + 76246 76308 63 0 0 97 37 80 0.823 0.71 S.003 Init - 176920 176870 51 1 0 65 85 17 0.848 0.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:118805650_119007611|GENSCAN_predicted_peptide_1|1163_aa MALDLGIPALLGAQEAPLPLKAWNSWCLRVFGHHRVPFVQMLVPAVEAAYGLTVTAVKDS GEWNLEAGALVLADAGLCCIDEFNSLKEHDRTSIHEAMEQQTISVAKAGMFNIHCLPPSS PDHQDSRRLLPLAGLLYLLRDPSHDCQLWSPELSGPSVYSPVVNDFDSQVSAGDTQANRV WSGPPANASRPAAEGPVRRRDNKQKGIASTSTKRTSTHKPHPKVTNIKDSREQNWMENEF DKLTEVGFRMWEITNSSELKEHVLTQCKEDKNLEKSLVCKLNTRTTILAATNPKGQYDPQ ESVSVNIALGSPLLSRFDLILVLLDTKNEDWDRIISSFILENKGYPSKSEKLWSMEKMKT YFCLIRNLQPTLSDVGNQVLLRYYQMQRQSDCRNAARTTIRLLESLIRLAEAHARLMFRD TVTLEDAITVVSVMESSMQGGALLGGVNALHTSFPENPGEQYQRQCELILEKLELQSLLS EELRRLERVQQTLHVGELCWHQAGAPLGQSFQRKEQAAIFAVLHPPLVIPQANRVWSGPL ANCSRLVEEGPIRRKTSKQKAKTTTSTKRAPTQKPHPKVISSKIKARAQNWTENEIDELT EVGFRRWVITNSSKLKEHVLTQCKEAKNLDKRLQELLTGITSLERSVNYLMELKNTAREL HEAYTSISTQINQALQNQSVHQSQPRVLEVETTPGSLRNGPGEESNFRTSSQQEINYSTH IFSPGGSPEGSPVLDPPPHLEPNRSTSRKHSAQHKNNRDDSLDWFDFMATHQSEPKNTVV VSPHPKTSGENMASKISNSTSQGKEKSEPGQRSKVDIGLLPSPGETGVPWRADNVESNKK KRLALDSEAAVSADKPDSVLTHHVPRNLQKLCKERAQKLCRNSTRVPAQCTVPSHPQSTP VHSPDRMLDSPKRKRPKSLAQVEEPAIENVKPPGSPVAKLAKFTFKQKSKLIHSFEDHSH VSPGATKIAVHSPKISQRRTRRDAALPVKRPGKLTSTPGNQISSQPQGETKEVSQQPPEK HGPREKVMCAPEKRIIQPELELGNETGCAHLTCEGDKKEEVSGSNKSGKVHACTLARLAN FCFTPPSESKSKSPPPERKNRGERGPSSPPTTTAPMRVSKRKSFQLRGSTEKLIVSKESL FTLPELGDEAFDCDWDEEMRKKS >gi568815592f:118805650_119007611|GENSCAN_predicted_CDS_1|3492_bp atggctctggacctaggcatccctgcactcttgggggcccaggaagccccccttcccctg aaggcttggaattcctggtgtctccgagtttttgggcaccatcgtgttccctttgttcag atgctggtgcctgcagtagaagctgcttatggtctgacggtaactgctgtaaaagactca ggagaatggaatttggaggctggggcattagttcttgcagatgcgggcctttgctgtatt gatgagttcaatagcctcaaagagcatgataggaccagtatccatgaagcaatggagcaa caaaccataagtgttgctaaggctgggatgtttaatatccactgcttgcctccttcttca cctgatcaccaagactccaggcgactcttgccccttgccggtctcctctatctactccgg gatccttcccatgactgccagctctggtccccggaactttcaggcccctctgtctattct ccagtagtcaatgactttgattcacaagtctctgctggtgatacccaggcaaacagggtc tggagtggacctccagcaaacgccagcagacctgcagcagaggggcctgttagaaggaga gataacaaacagaaaggaatagcatcaacatcaacaaaaaggacgtccacacataaaccc catccgaaggtcaccaacatcaaagactcaagggaacaaaactggatggagaatgagttt gataaattgacagaagtaggcttcagaatgtgggaaataacaaactcctctgagctaaag gagcatgttctaacccagtgcaaggaagataagaaccttgaaaaaagcctcgtgtgcaag ctgaacacaaggaccaccatcctggcagcaacgaaccccaaaggccagtacgacccccag gagtccgtgtctgtgaacattgccctcggcagcccactcttaagtcgatttgacctgatc ctggttttgcttgataccaagaatgaagactgggatcgtatcatttcctcctttatctta gaaaataaaggttacccaagcaaatcagagaagctctggagcatggaaaagatgaaaacc tatttctgcctcataaggaatctgcagcccacactgtctgatgtgggcaatcaggttctt ctccggtactaccagatgcaaaggcagagtgattgccggaacgctgcccggaccaccatt cggctgttggaaagcttgatacgattagcagaagctcatgctcgcctgatgtttcgtgat actgtaactctggaagacgctattacggtggtgtcagtcatggagtcctcaatgcaggga ggtgcactgctaggaggtgtgaatgccctccacacttcctttcctgaaaaccctggagag cagtaccagagacagtgtgaacttattctggaaaagctagagctgcagagcctcttgagt gaagagcttagaagacttgaaagggttcagcagacacttcatgtaggagagctctgctgg catcaggctggtgcccctctgggacaaagcttccagaggaaggagcaggcagcaatcttt gctgttctgcaccctccactggtgataccccaggcaaacagggtctggagtggacctcta gcaaactgcagcagacttgtggaagaggggcctattagaagaaaaactagcaaacagaaa gcaaaaacaacaacatcaacaaaaagggcccccacacaaaaaccccatcctaaggtcatc agctcaaagatcaaagcaagggcacaaaactggacagagaatgagattgacgaattgaca gaagtaggcttcagaaggtgggtaataacaaactcctccaagctaaaggagcatgttcta acccaatgcaaggaagctaagaaccttgataaaagattacaggaactgctaactggaata accagtttagaaagaagcgtaaattacctgatggagctgaaaaacacagcacgagaactt catgaagcatacacaagtatcagtactcaaatcaatcaagcgttacagaatcagagtgtg caccaatcccaaccacgggtattggaggtagagactactccaggatccttgagaaatggt ccaggggaagaatcaaacttcagaacttcatcacagcaggaaatcaactatagcacacat atcttctctcctggaggcagccccgagggaagcccagttctagatcccccaccgcatctg gagcctaatagatcaacaagtaggaaacattcagctcagcacaaaaataacagagatgac agtttagattggtttgatttcatggcaactcatcagagtgaacctaaaaacactgttgtt gtgtctcctcatcccaaaacatctggagaaaatatggcttcgaagatctctaacagcaca tctcagggtaaggagaagagtgagccaggccaaaggagcaaagtggacattgggttgctt ccatcaccaggagagacaggtgttccatggagggcagacaatgtggaaagtaacaagaaa aaaaggctagcactagattctgaagcagcagtctctgctgataaaccagactcagtactg actcatcatgtccccaggaacctgcagaagctgtgcaaagagagggcccagaagttgtgc agaaatagcaccagggtgcctgcacagtgcacagtcccttcccatcctcagtccactcct gtacatagcccagacagaatgctggactcacccaaaagaaagagaccgaaatcccttgcg caagtggaagagcctgcaattgaaaatgttaagcctccaggttcccctgtggccaaactg gcaaaatttactttcaagcagaagtcaaaactgatccactcctttgaagatcacagccat gtgtcacctggtgcaactaaaatagcagttcatagtcctaaaatttcccagcgtagaaca agaagagacgcagccttgccggtgaagcgtccaggaaagttaacatctaccccaggaaac cagatctccagtcagccacagggtgagacaaaggaggtgtcgcagcagccaccagagaaa cacggaccaagagagaaggtgatgtgtgcccctgagaagaggattattcagcctgaatta gagcttgggaacgagactgggtgtgctcatcttacttgtgagggagacaaaaaggaagag gtttcaggcagtaataaaagcggcaaggttcatgcctgcacattagccagattggcaaac ttctgctttactcccccatcggaatccaaatcaaaatcccctcctcctgaaaggaagaac cgaggtgagagaggcccaagctcccctcctacaaccacagctccaatgcgtgtcagtaaa aggaaatcttttcagctccgtgggtccaccgagaaactgattgtttccaaagaatccctc ttcactttaccagaactaggtgatgaagcatttgattgtgactgggatgaagagatgaga aaaaagtcatag >gi568815592f:118805650_119007611|GENSCAN_predicted_peptide_2|204_aa MAKVQVNNVVVLDNPSPFYNPFQFEITFECIEDLSEDLEWKIIYVGSAESEEYDQVLDSV LVGPVPAGRHMFVFQADAPNPGLIPDADAVGVTVVLITCTYRGQEFIRVGYYVNNEYTET ELRENPPVKPDFSKLQRNILASNPRVTRFHINWEDNTEKLEDAESSNPNLQSLLSTDALP SASKGWSTSENSLNVMLESHMDCM >gi568815592f:118805650_119007611|GENSCAN_predicted_CDS_2|615_bp atggcaaaggttcaggtgaacaatgtagtggtgctggataacccttctcctttctacaac ccgttccagttcgagatcaccttcgagtgcatcgaggacctgtctgaagacttggaatgg aaaattatctatgtgggctctgcagaaagtgaagaatacgatcaagttttagactctgtt ttagtgggtcctgttcccgcaggaaggcatatgtttgtatttcaggctgatgcacctaat ccaggactcattccagatgcagatgcagtaggcgtaactgttgtgctaattacttgtacc tatcgaggacaagaatttattagagttggctattatgtaaataatgaatatactgagaca gaattaagggaaaatccaccagtaaaaccagacttttctaagcttcaaaggaatattttg gcatctaatcccagggtcacaagattccacattaattgggaagataacacagaaaaactg gaagatgcagagagcagtaatccaaatctacagtcacttctttcaacagatgcattacct tcagcatcaaagggatggtccacatcagaaaactcactaaatgtcatgttagaatcccac atggactgcatgtga >gi568815592f:118805650_119007611|GENSCAN_predicted_peptide_3|1233_aa IGHLQDMVRKSEQGLGSAEGLIASLQDSQERLQNELDLTKDSLKETKDALLNVEGELEQE RQQHEETIAAMKEEEKLKVDKMAHDLEIKWTENLRQECSKLREELRLQHEEDKKSAMSQL LQLKDREKNAARDSWQKKVEDLLNQRQSLGEALHKSISNNLEIQLSQSQTSLQQLQAQFT QERQRLTQELEELEEQHQQRHKSLKEAHVLAFQTMEEEKEKEQRALENHLQQKHSAELQS LKDAHRESMEGFRIEMEQELQTLRFELEDEGKAMLASLRSELNHQHAAAIDLLRHNHHQE LAAAKMELERSIDISRRQSKEHICRITDLQEELRHREHHISELDKEVQHLHENISALTKE LEFKGKEILRIRSESNQQIRLHEQDLNKRLEKELDVMTADHLREKNIMRADFNKTNELLK EINAALQVSLEEMEEKYLMRESKPEDIQMITELKAMLTERDQIIKKLIEDNKFYQLELVN RETNFNKVFNSSPTVGVINPLAKQKKKNDKSPTNRFVSVPNLSALESGGVGNGHPNRLDP IPNSPVHDIEFNSSKPLPQPVPPKGPKTFLRSSHPAGLSGFRLVLENVYKESCDVTYYWS VQVFDFFMAQYWVSTSSLKPQAKEPHSVATTTTFPQVVLPGYHQCSPMAKGLLSQLAVNA DRPEVNPSGQWALLCLLDSSDSPASGLGRGSTHFNVKSHNLYAIPLPSAQILSPHHVDTA QPSCAGDEPAAWLELSGPFGLGSNAFPATSDTAIRPKSRSRLITSKTIQLRDRTGTGAAR EKSPRSLRDCQAVSAEPSRFPTLPRFKMNSDQVTLVGQVFESYVSEYHKNDILLILKERD EDAHYPVVVNAMTLFETNMEIGEYFNMFPSEVLTIFDSALRSVGIFEEQSEEESTETKGA LYVKKCAEVLIEQEASHKRPYSPVQIWYLILRSVASHGRLVREHIPKTKDVGHFLSVTGT VIRTSLVKVLEFERDYMCNKCKHVFVIKADFEQYYTFCRPSSCPSLESCDSSKFTCLSGL SSSPTRCRDYQEIKIQEQVQRLSVGSIPRSMKVILEDDLVDSCKSGDDLTIYGIVMQRWK PFQQDVRCEVEIVLKANYIQVNNEQSSGIIMDEEVQKEFEDFWEYYKSDPFAGRNVILAS LCPQVFGMYLVKLAVAMVLAGGIQRTDATGTRVRGESHLLLVGDPGTGKSQFLKYAAKIT PRSVLTTGIGSTSAGKSPYHLAFYQDSATGMLN >gi568815592f:118805650_119007611|GENSCAN_predicted_CDS_3|3702_bp attggccacctccaagatatggtaaggaaaagtgaacaaggtcttggctctgcagaagga cttattgctagtcttcaggactcccaggaaaggcttcagaatgagcttgacttgactaaa gacagcctaaaggagaccaaggatgctctattaaatgtggagggtgagctagaacaagaa aggcaacagcatgaagaaacaattgctgccatgaaagaagaagagaagctcaaagtggac aaaatggcccatgacttagaaattaagtggactgaaaatcttagacaagagtgttctaaa cttcgtgaagagttaaggcttcaacatgaagaggataagaagtcagcaatgtctcaactt ttgcagttgaaagatcgagagaaaaatgcagcaagagattcatggcagaagaaagtagaa gatctcttaaaccagcggcagtctctgggtgaagctttgcataaatctatcagcaataat ctggagatacagctttcccagtctcagacttctttgcaacaactgcaagcccagtttacg caagaacgacagcggcttacgcaagagcttgaagaattagaggagcaacatcagcaaaga cacaaatcattaaaagaagcacatgtccttgcatttcaaactatggaagaggaaaaggaa aaggagcaaagagctcttgaaaatcatttacaacagaagcattctgcagagcttcaatca ctaaaagatgcacacagagagtcaatggagggcttccggatagaaatggaacaggaactt cagactcttcggtttgaattagaagatgaaggaaaggctatgcttgcttccttgcgctca gaactcaaccatcaacatgcagctgcaattgatttgttacggcataatcatcatcaagaa ttggcagctgctaaaatggaattagagagaagcatagacatcagcagaagacagagtaag gagcacatatgtagaattacagatctacaagaggaattaagacacagagagcatcacatc tctgaattggataaggaggttcagcaccttcatgagaatataagtgccctaaccaaagaa ctggaatttaaggggaaagaaattctcagaatacgaagtgaatctaaccaacagataagg ttgcatgaacaagatttaaacaagagacttgaaaaagagttggatgtcatgacagcagac cacctcagagagaaaaatatcatgcgggcagattttaataagactaacgagctactcaag gaaataaatgccgctttacaagtgtcattagaagaaatggaagaaaaatatctaatgaga gaatcaaaaccagaagatatacagatgattacagaattaaaagccatgcttacagaaaga gaccagatcataaagaaactaattgaggataataagttttatcagctggaattagtcaat cgagaaactaacttcaacaaagtgtttaactcaagtcctactgttggtgttattaatcca ttggctaagcaaaagaagaagaatgataaatcaccaacaaacaggtttgtgagtgttccc aatctaagtgctctggaatctggtggagtgggcaatggacatcctaaccgcctggatccc attcctaattctccagtccacgatattgagttcaacagcagcaaaccacttccacagcca gtgccacctaaagggcccaagacatttttgaggtctagccacccagcagggctatcaggc ttcagactggtgctagaaaatgtctataaagagtcctgtgatgtgacttattattggtct gttcaggttttcgatttcttcatggctcaatattgggtatcgacaagttccctcaagccc caggcaaaggagcctcactctgtggctaccaccactacattcccacaggtagtactgcca ggctaccaccaatgttcacctatggccaaaggtctcctcagtcagcttgcagtgaatgct gacaggcctgaggttaacccttcagggcagtgggctctcctctgcctgctggattcaagt gattctcctgcctctggcttagggcggggaagcactcacttcaacgtaaagtcccataat ctctatgctatccctctcccaagtgcacagattctctctccacaccatgtggacactgcc cagccttcttgcgccggagacgaacccgctgcctggctggagctctctgggcccttcggc ctcgggagcaacgcgttcccggcgacttctgacacagccattcgtcccaagtctcgttcc cgtctcattacttcaaaaactattcaactccgggaccggacaggcacgggagctgcccgg gagaagagccccagaagtctcagagactgccaagctgtttctgcagaacccagtagattt cctacgttacctagattcaagatgaatagcgatcaagttacactggttggtcaagtgttt gagtcatatgtttcggaataccataagaatgatattcttctaatcttgaaggaaagggat gaagatgctcattacccagttgtggttaatgccatgactctgtttgagaccaacatggaa atcggggaatatttcaacatgttccccagtgaagtgcttacaatttttgatagtgcactg cgaagtgtgggcatttttgaggagcagagtgaagaggagagcactgagacaaagggagca ctgtatgtgaaaaagtgcgcagaagtcttgatagaacaggaagccagtcacaaaaggccc tatagtccagtccagatttggtaccttattcttagatctgtggcaagccatggaaggctg gtgagggaacacatacctaaaaccaaggatgtgggacactttttatctgtcactgggaca gtgattcgaacaagtctggtgaaggttctggagtttgagcgggattacatgtgtaacaaa tgcaagcatgtgtttgtgatcaaggctgactttgagcagtattacaccttttgccggcca tcctcgtgtcccagcttggagagctgtgattcctctaaattcacttgcctctcaggcttg tcttcgtctccaaccaggtgtagagattaccaggaaatcaaaattcaggaacaggttcaa aggctatctgttggaagtattccacgatctatgaaggttattctggaagatgacttagtg gatagttgcaaatctggtgatgacctcactatttacgggattgtaatgcaacggtggaag ccctttcagcaagatgtgcgctgtgaagtggagatagtcctgaaagcaaattacatccaa gtaaataatgagcagtcctcagggatcatcatggatgaggaggtccaaaaggaattcgaa gatttttgggaatactataagagcgatccctttgcaggaaggaatgtaatattggctagc ttgtgccctcaagtgtttggaatgtatctagtaaagcttgctgtggccatggtgctggct ggtgggattcaaaggactgatgctacaggaacacgggtcagaggagaatctcatctttta ttggttggggatcctggcacagggaaatctcagttcctcaaatatgcagcaaagattaca ccaagatctgtgctgaccacaggaattggatctactagtgcaggaaaaagtccgtatcat ctggcattttatcaagactcagcaactggaatgttaaactaa