GENSCAN 1.0 Date run: 4-Aug-121 Time: 20:39:42 Sequence gi568815595f:98668441_98893758 : 225318 bp : 38.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 14805 15491 687 2 0 34 43 384 0.627 24.44 1.02 PlyA + 16349 16354 6 1.05 2.00 Prom + 19095 19134 40 -4.85 2.01 Init + 21968 21975 8 1 2 114 68 0 0.326 1.15 2.02 Term + 30880 31006 127 1 1 83 36 123 0.905 3.37 2.03 PlyA + 31070 31075 6 1.05 3.00 Prom + 41469 41508 40 -4.95 3.01 Init + 45622 45713 92 0 2 29 52 138 0.022 4.32 3.02 Intr + 64433 64571 139 2 1 47 76 103 0.214 4.55 3.03 Intr + 64662 65151 490 2 1 87 113 165 0.197 10.55 3.04 Intr + 101885 102058 174 0 0 9 84 116 0.051 2.29 3.05 Intr + 105480 105543 64 1 1 89 89 67 0.202 3.76 3.06 Intr + 112739 112829 91 2 1 73 51 47 0.197 -1.42 3.07 Intr + 114145 114569 425 0 2 88 -26 324 0.029 12.74 3.08 Intr + 116505 116600 96 0 0 77 94 65 0.155 4.21 3.09 Intr + 119596 119782 187 1 1 74 92 109 0.505 8.67 3.10 Intr + 119886 120023 138 2 0 89 91 81 0.996 8.24 3.11 Intr + 123401 123553 153 1 0 92 85 66 0.952 6.05 3.12 Term + 128005 128037 33 0 0 120 43 25 0.258 -2.19 3.13 PlyA + 128187 128192 6 1.05 4.10 PlyA - 129359 129354 6 1.05 4.09 Term - 131401 130932 470 2 2 82 49 430 0.998 32.65 4.08 Intr - 132276 132139 138 1 0 -2 93 160 0.209 7.01 4.07 Intr - 133209 133160 50 2 2 106 93 47 0.047 4.41 4.06 Intr - 135932 135701 232 0 1 46 72 82 0.001 -1.69 4.05 Intr - 143114 143028 87 0 0 49 91 82 0.874 3.62 4.04 Intr - 144042 143892 151 0 1 50 98 130 0.982 9.01 4.03 Intr - 149453 149329 125 0 2 75 93 132 0.474 11.78 4.02 Intr - 150977 150762 216 0 0 90 42 151 0.424 8.25 4.01 Init - 164937 164709 229 0 1 59 37 149 0.058 5.48 4.00 Prom - 175225 175186 40 -4.65 5.00 Prom + 188124 188163 40 -2.85 5.01 Init + 188481 188574 94 2 1 30 73 75 0.641 0.80 5.02 Intr + 188684 188848 165 0 0 27 44 133 0.465 1.81 5.03 Term + 189415 190058 644 2 2 51 42 296 0.617 14.64 5.04 PlyA + 190060 190065 6 1.05 6.00 Prom + 192740 192779 40 -6.15 6.01 Init + 193346 193847 502 1 1 62 72 196 0.752 10.63 6.02 Term + 194199 194704 506 1 2 36 36 154 0.388 -1.38 6.03 PlyA + 195188 195193 6 1.05 7.04 PlyA - 195975 195970 6 1.05 7.03 Term - 198795 198086 710 1 2 75 47 159 0.077 3.18 7.02 Intr - 213327 213100 228 1 0 84 63 183 0.966 12.32 7.01 Intr - 224166 224057 110 2 2 93 96 14 0.057 1.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 114145 114606 462 0 0 88 42 344 0.939 23.87 S.002 Sngl - 136572 136150 423 0 0 73 48 194 0.948 9.94 S.003 Sngl - 198868 198086 783 1 0 82 47 188 0.899 9.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:98668441_98893758|GENSCAN_predicted_peptide_1|228_aa MDQAEEKMSEIEDQLNEMKCEDKIREKRMKSNNQNLQDIWDYVKRPNLRFTGVPETDGEN GIKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSRRVTPRHIIFIFTKVEMKEKM LIAAREKDQVTHRGKPIRLTADLSAETLQARREWGPIFNILKEKNFQHRISYPAKLSFIS KGEIKSFTDKQMVRDFVTTRPTLQELLKEALNMERKNQYQPLQKHIKL >gi568815595f:98668441_98893758|GENSCAN_predicted_CDS_1|687_bp atggatcaagcagaagaaaagatgtcagagattgaagatcaacttaatgagatgaagtgt gaagacaagattagagaaaaaagaatgaaaagcaacaaccaaaacctacaagatatatgg gactatgtgaaaagaccaaacctacgttttactggtgtgcctgaaactgacggggagaat ggaatcaagttggaaaacactcttcaggatattatccaggagaacttccccaacctagca agacaggccaacattcaaattcaggaaatacagagaacaccacaaagatactcctcgaga agagtaaccccaagacacataatcttcatattcaccaaggttgaaatgaaggaaaaaatg ttaatagcagccagagagaaagatcaggttacccacagagggaagcccatcagactaaca gcagatctctctgcagaaaccctacaagccagaagagagtgggggccaatattcaacatt cttaaagagaagaattttcaacacagaatttcatatccagccaaactaagcttcataagc aaaggagaaataaaatcctttacagacaagcaaatggtgagagattttgtcaccaccagg cctaccttgcaagagctcctgaaggaagcactaaatatggaaaggaaaaaccagtaccag ccactgcaaaaacatatcaaattgtaa >gi568815595f:98668441_98893758|GENSCAN_predicted_peptide_2|44_aa MPREYLLDDKRCVSQPPLSPSQQLINLQEQSCQSKFYLLRGTLN >gi568815595f:98668441_98893758|GENSCAN_predicted_CDS_2|135_bp atgcccagggaatacctgctggatgacaagagatgtgtgtcccaaccaccgctgtcaccc agccaacagctgatcaaccttcaggagcagagctgtcagagcaagttttacctgctcaga ggcactctgaactaa >gi568815595f:98668441_98893758|GENSCAN_predicted_peptide_3|693_aa MSVGSGGVVAVLSCLVPGPKMIRAGEYRAGERRERGSSRLEQRPLRSRSPVRLCGRRSWA SAGHSCRPASLRVCTARPLRLRVGGGDTERCLPLHISCQHQGGGVRRRKGTMGAGTREPA GVRRSHRREGTRGSWSPSEGREGRAASRVSPGAVGAERICSGTLFCRIAGEVVQLGSQEE CPAPRSLLFSCKRVCTRGGLRGLEGSGLLPYKVFRETLFPSTMELDLTKQAPRQFRRKED GAHKGLTYAKNREELEMTLNSVICRVPRIPRQGSMEPVQVRCFCGVRCRCASADVAGGAE YFRLALSKLQSCDLFDEFDKDMDEAGNYHSQQTNTGTENQTLHVLTHKWEHCPWGHPNQY SNPAGLSSENISNFNNTLRYHIAGISKTAASREEANGVSIDHVQPPSGLIINKESEVYKM LQEKQELNEPLKQSTSFLILQEILESEIKGDLNNPQDSEVLLSSVYYSVAASIGNAQKVP MCDKCGPGIVGIIPCKKCVVVGNGGVLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRR TTFRLFYPESVFSDPIHNDPNTTVILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLI YKPYQIRILDPFIIRTAAYELLHFPKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFK YNFSDLKSPLHYYGNATMSLMNKVTMQSQGKGV >gi568815595f:98668441_98893758|GENSCAN_predicted_CDS_3|2082_bp atgtcagtgggctctggaggtgtagtggccgtcctgagttgcctggtacctggccccaag atgattagagcaggggaatatcgggcaggagagcgccgcgagagaggcagcagccggctg gagcagcggcccctcaggtctcggagcccggtgcgcctctgcggtcgtcgctcctgggcc tcggcgggtcactcttgccggccggcttcgctgcgggtttgcactgcccggcctctgagg ctcagggttgggggtggggacacggagcgctgtttgcccctccacatatcctgccagcac caaggaggtggcgtgcggaggcgtaaagggaccatgggcgctgggacccgcgagccagcc ggcgtgcgccgcagccaccgccgagagggcacccggggatcctggagccccagtgaaggg cgggagggacgcgcagcctcgcgggtaagcccaggagccgtgggggccgagagaatctgc tctgggaccctcttttgtcgaatcgccggcgaggttgtgcaattgggaagccaagaggag tgtccagctcctcgttcactattgttttcttgtaaaagggtctgtacgcgaggagggttg aggggtctggaaggttctggtctcttaccgtataaagtttttcgagaaaccctttttccc tccacaatggaactagacctaacaaaacaagctccgaggcagtttaggagaaaggaggac ggagcacacaagggcctaacttatgccaagaatagagaagagttagaaatgactctcaat agtgtgatttgtagagtcccccggatcccaaggcaagggtctatggaacctgttcaggtc aggtgtttctgtggtgttcgctgccggtgtgcgtcagcagatgtggcaggaggagcggaa tattttcgacttgctctttcaaaactgcagagttgtgatctctttgatgagtttgacaag gacatggatgaagctggaaactatcattctcagcaaactaacacaggaacagaaaaccaa acactgcatgttctcactcataagtgggagcactgtccctggggtcatccaaaccagtac agtaacccagctggcctctcttctgaaaacatctccaacttcaacaatactttgagatac catattgctggtatttcaaaaactgctgccagcagggaggaggccaatggcgtgtccata gatcatgttcagcctccaagtggtctcataatcaacaaagaatctgaagtttacaagatg cttcaggagaaacaggagttgaatgaacccctgaaacagtctacctctttcctgattttg caggaaatcctggagtctgagataaaaggggatctcaacaaccctcaggattcagaagtg ttactcagctcagtctattactcagtggctgcatcaattggaaatgctcagaaggtgccc atgtgtgacaaatgtggtcctggcattgtaggcatcataccctgtaaaaagtgtgtggtg gttggtaatggaggagttttgaagaataagacattaggagaaaaaatcgactcctatgat gtaataataagaatgaataatggtcctgttttaggacatgaagaagaagttgggagaagg acaaccttccgacttttttatccagaatctgttttttcagatcctattcacaatgaccct aatacgacagtgattctcactgcttttaagccacatgatttaaggtggctgttggaattg ttgatgggtgacaaaataaacactaatggtttttggaagaaaccagccttaaacctgatt tataaaccttatcaaatccgaatattagatcctttcattatcagaacagcagcttatgaa ctgcttcattttccaaaagtgtttcccaaaaatcagaaacctaaacacccaacaacagga attattgccatcacattggcgttttacatatgtcacgaagttcacctagctggttttaaa tacaacttttctgacctcaagagtcctttgcactactatgggaatgccaccatgtctttg atgaataaggtgacaatgcagagccagggcaaaggtgtttaa >gi568815595f:98668441_98893758|GENSCAN_predicted_peptide_4|565_aa MAKPNSRKLRINNKTIQELADKIASAKKNLTDLIELKNTLQEFHNPIASINSRIDHAEER ILELEDWLSEIRESDKRCYGTLGMESGVIADPQITASSVLEWTDHTGQENSWKPKKARLK KPGPPWAAFATDEYQWLQIDLNKEKKITGIITTGSTMVEHNYYVSAYRILYSDDGQKWTV YREPGVEQDKIFQGNKDYHQDVRNNFLPPIIARFIRVNPTQWQQKIAMKMELLGCQFIPK GRPPKLTQPPPPRNSNDLKNTTAPPKIAKEIQTTIREYYKHLYANKLENLEEMDKFLDTY TLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFIAEFYQRKKKTEGTYDLPYW DRAGWWKGMKQFLPAKAVDHEETPVRYSSSEVNHLSPREVTTVLQADSAEYAQPLVGGIV GTLHQRSTFKPEEGKEAGYADLDPYNSPGQEVYHAYAEPLPITGPEYATPIIMDMSGHPT TSVGQPSTSTFKATGNQPPPLVGTYNTLLSRTDSCSSAQAQYDTPKAGKPGLPAPDELVY QVPQSTQEVSGAGRDGECDVFKEIL >gi568815595f:98668441_98893758|GENSCAN_predicted_CDS_4|1698_bp atggcaaaacccaattcaaggaaactaagaattaacaataaaacaatacaggagctggca gataaaatagccagtgcaaaaaagaatctaactgacctgatagagctgaaaaacacacta caagaatttcacaacccaattgcaagtattaacagcagaatagaccacgctgaggaaaga atcttggaacttgaagactggctctctgaaataagagagtcagacaaaagatgttatgga acactggggatggagtctggtgtgatcgcggatcctcaaataacagcatcatctgtgctg gagtggactgaccacacagggcaagagaacagttggaaacccaaaaaagccaggctgaaa aaacctggaccgccttgggctgcttttgccactgatgaataccagtggttacaaatagat ttgaataaggaaaagaaaataacaggcattataaccactggatccaccatggtggagcac aattactatgtgtctgcctacagaatcctgtacagtgatgatgggcagaaatggactgtg tacagagagcctggtgtggagcaagataagatatttcaaggaaacaaagattatcaccag gatgtgcgtaataactttttgccaccaattattgcacgttttattagagtgaatcctacc caatggcagcagaaaattgccatgaaaatggagctgctcggatgtcagtttattcctaaa ggtcgtcctccaaaacttactcaacctccacctcctcggaacagcaatgacctcaaaaac actacagcccctccaaaaatagccaaagaaatacaaactaccatcagagaatactacaaa cacctctatgcaaataaactagaaaatctagaagaaatggataaattcctggacacatac accctcccaagactaaaccaggaagaagttgaatccctgaatagaccaataacaggctct gaaattgaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagacggattc atagccgaattctaccagagaaagaaaaaaactgaaggcacctatgacttaccttactgg gaccgggcaggttggtggaaaggaatgaagcagtttcttcctgcaaaagcagtggaccat gaggaaaccccagttcgctatagcagcagcgaagttaatcacctgagtccaagagaagtc accacagtgctgcaggctgactctgcagagtatgctcagccactggtaggaggaattgtt ggtacacttcatcaaagatctacctttaaaccagaagaaggaaaagaagcaggctatgca gacctagatccttacaactcaccagggcaggaagtttatcatgcctatgctgaaccactc ccaattacggggcctgagtatgcaaccccaatcatcatggacatgtcagggcaccccaca acttcagttggtcagccctccacatccactttcaaggctacggggaaccaacctccccca ctagtgggaacttacaatacacttctctccaggactgacagctgctcctcagcccaggcc cagtatgataccccgaaagctgggaagccaggtctacctgccccagacgaattggtgtac caggtgccacagagcacacaagaagtatcaggagcaggaagggatggggaatgtgatgtt tttaaagaaatcctttga >gi568815595f:98668441_98893758|GENSCAN_predicted_peptide_5|300_aa MAKKIQLTYKCVQNWWVLGLTDFKNEAADPRRVKLQTFVVSVTALKAARLELFIPPGGFV VSLASGVKLQTFAVSVTAHKGSVDPKTRHKGSPSPHQTQEPSWLHPVDPAPGLQVELPAS PAPCARTPQPLGGQWDWVPWSRGRCSSGRLGPRSSPRWGAGSGMAGCRSRALPHGEAAKA QRKVTAAAGPGAKHLTAWGWQGQLATPSVGPAEPTHTQNSHWPASAVCSPSSRLRLSLHT YPQAEGAGSGLGQPRKGLPQCSSRLKGSSSAAKVGAQAEEVPRASEACEGCQHAVTSHKY >gi568815595f:98668441_98893758|GENSCAN_predicted_CDS_5|903_bp atggcaaagaagattcaactgacatataagtgtgtccagaattggtgggttcttggtctc actgacttcaagaacgaagccgcagaccctcgcagagtgaagctgcagaccttcgtggtg agtgttacagctcttaaggcggcacgtctggagttgttcattcctcctggtgggttcgtg gtctcgctggcttcaggagtgaagctgcagacctttgcggtgagtgttacagctcataaa ggcagtgtggacccaaagactagacataaaggttctccaagtccccaccagactcaggag cccagctggctgcacccagtggatcccgcaccggggctgcaggtggagctgcctgccagt cccgcgccgtgcgcccgcactcctcagcccttgggtggtcaatgggactgggtgccatgg agcagggggcggtgctcgtcggggaggcttgggccacgcagcagcccacggtggggggca ggctcaggcatggcgggctgcaggtcccgagccctgccccacggggaggcagctaaggcc cagcgaaaagtcacagcagctgctggcccaggtgctaagcacctcactgcctggggctgg cagggccagctggccactccgagtgtggggcctgccgagcccacacacacccagaactcg cactggcccgcaagcgccgtgtgcagccccagttcccgcctgcgcctctccctccacacc tacccgcaagctgagggagccggctccggccttggccagcccaggaaggggctcccacag tgcagcagcaggctgaagggctcctcaagtgctgccaaagtgggagcccaggcagaggag gtgccgagagcgagcgaggcctgcgagggctgccagcacgctgtcacctctcataagtat taa >gi568815595f:98668441_98893758|GENSCAN_predicted_peptide_6|335_aa MDTLTSQLKELEKQEQTHSKASRRQEITKIREELKEIGTQKNLQKINESRRWFFEKINKI DRLLARLIKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKQENLEEMDKFLD MHTLPRLNQEEVESLNRPITGSEIEAIIAYQPKKVQDQTDSQPNSTRDAEKGFDKIQQPF ILKTLNKSCIDETCLKIIRVIYDKPTANITLNGQKLEAFPLKTGAKQGCPLSLLLFHIVL EVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIIYAQNLLKLISNFSKVSGYK INVQKSQAFLYTNNRQTESQIMSELPFTTASKRIK >gi568815595f:98668441_98893758|GENSCAN_predicted_CDS_6|1008_bp atggacaccctaacatcacaattaaaagaactagagaagcaagagcaaacacattcaaaa gctagcagaaggcaagaaataactaagatcagagaagaactgaaggagatagggacacaa aaaaaccttcaaaaaatcaatgaatccaggagatggttttttgaaaagatcaacaaaatt gatagattgctagcaagactaataaagaaaagagagaagaatcaaatagacgcaataaaa aatgataaaggggatatcaccaccgatcccacagagatacaaactaccatcagagaatac tacaaacacctctacgcaaataaacaagaaaatctagaagaaatggataagttcctggac atgcacaccctcccaagactaaaccaggaagaagttgaatccctgaacagaccaataaca ggctctgaaattgaggcaataatagcctaccaaccaaaaaaagtccaggaccagacagat tcacagccaaattctaccagagatgcagaaaaggggtttgacaaaattcaacagcctttt atcctaaaaactctcaataaatcatgtattgatgagacgtgtctcaaaataataagagtt atttatgacaaacccacagccaatatcacactgaatgggcaaaaactggaagcattccct ttgaaaactggtgcaaaacagggatgccctctctcactactcctatttcatatagtattg gaagttctggccagggcaatcaggcaggagaaagaaataaagggtattcagttaggaaaa gaggaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatc atctacgcccaaaatcttcttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaggcattcttatataccaataacagacaaacagagagccaa atcatgagtgaactcccattcacaactgcttcaaagagaataaaataa >gi568815595f:98668441_98893758|GENSCAN_predicted_peptide_7|349_aa XLAFSLWKESWLLAIPKAYGFNTEKFCIFPIVQGPGKGDGCGHTVLGPESGTLTSINYPQ TYPNSTVCEWEIRVKMGERVRIKFGDFDIEDSDSCHFNYLRIYNGIGVSRTEIELEKTTL KFIWNQKRAGIAKSILSQKNKAGGIRLPDFKLYYKATVTKTAWYWDQNRDIDQWNRTEPS EIIPHIYNHLIFDKPDKNKKWGKDSLFNKWFWENWLAICKKLKLDPFLTPYAKTNSRWIK DLNVRPKTIKTLEENLGSTIQDIGMGKDFMSETPKAMATKAKIDKWDLIKLKSFCTAKET TIRVNRQPTEWEKIFAIYSSDKGLILRIYKELKQIYKKKTTPSTSGQMI >gi568815595f:98668441_98893758|GENSCAN_predicted_CDS_7|1050_bp nntctggctttctctttgtggaaggaatcatggctgctggcaatcccgaaggcttacggt tttaacacggagaaattctgtattttccctattgtacaaggtccaggaaagggtgatgga tgtggacacactgtactaggccctgagagtggaacccttacatccataaactacccacag acctatcccaacagcactgtttgtgaatgggagatccgtgtaaagatgggagagagagtt cgcatcaaatttggtgactttgacattgaagattctgattcttgtcactttaattacttg agaatttataatggaattggagtcagcagaactgaaatagaattggaaaaaactacttta aagttcatatggaaccaaaaaagagctggcattgccaagtcaatcctaagccaaaagaac aaagcgggaggcatcaggctacctgacttcaaactatactacaaggctacagtaaccaaa acagcatggtactgggaccaaaacagagatatagaccaatggaacagaacagagccctca gaaataataccacacatctacaaccatctgatctttgacaaacctgacaaaaacaagaaa tggggaaaggattccctatttaataaatggttctgggaaaactggctagccatatgtaaa aagctgaaactggatcccttccttacaccttatgcaaaaactaattcaagatggattaaa gacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggcagtaccatt caggacataggcatgggcaaggacttcatgtctgaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaact accatcagagtgaacaggcaacctacagaatgggagaaaatttttgcaatctactcatca gacaaagggctaatattgagaatctacaaagaactcaaacaaatttacaagaaaaaaaca accccatcaacaagtgggcaaatgatatga