GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:44:53 Sequence gi568815588f:97219522_97420290 : 200769 bp : 47.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 257 252 6 -0.45 1.10 Term - 1083 878 206 1 2 80 38 123 0.342 4.03 1.09 Intr - 9704 9626 79 2 1 63 74 25 0.306 -2.28 1.08 Intr - 10353 10243 111 0 0 55 46 117 0.644 4.78 1.07 Intr - 15794 15696 99 2 0 68 100 45 0.651 4.01 1.06 Intr - 24638 24447 192 2 0 79 51 96 0.419 4.69 1.05 Intr - 36883 36797 87 1 0 79 100 8 0.587 1.27 1.04 Intr - 40107 39881 227 1 2 70 82 157 0.942 11.00 1.03 Intr - 44108 43899 210 0 0 82 105 141 0.999 14.08 1.02 Intr - 46604 46339 266 1 2 68 53 138 0.670 5.46 1.01 Init - 70863 70706 158 0 2 75 25 168 0.115 8.58 1.00 Prom - 79152 79113 40 -2.86 2.02 PlyA - 80590 80585 6 1.05 2.01 Sngl - 90437 90075 363 2 0 102 42 217 0.729 14.58 2.00 Prom - 92549 92510 40 -4.76 3.00 Prom + 97871 97910 40 -7.36 3.01 Sngl + 99933 100772 840 2 0 101 41 1038 0.956 96.15 3.02 PlyA + 102283 102288 6 1.05 4.04 PlyA - 102316 102311 6 1.05 4.03 Term - 115119 114350 770 1 2 98 48 962 0.300 86.77 4.02 Intr - 118287 118230 58 0 1 73 68 22 0.148 -2.84 4.01 Init - 123361 123278 84 1 0 53 58 127 0.641 7.02 4.00 Prom - 130794 130755 40 -6.96 5.36 PlyA - 132290 132285 6 1.05 5.35 Term - 132885 132784 102 0 0 88 42 90 0.843 2.68 5.34 Intr - 137675 137612 64 1 1 115 75 80 0.833 8.12 5.33 Intr - 139098 139016 83 0 2 43 80 157 0.784 8.84 5.32 Intr - 139489 139422 68 2 2 133 115 81 0.754 13.92 5.31 Intr - 141097 141025 73 1 1 113 100 120 0.999 14.68 5.30 Intr - 144382 144333 50 2 2 93 82 81 0.997 6.40 5.29 Intr - 146712 146587 126 1 0 55 86 160 0.968 13.15 5.28 Intr - 147100 146925 176 0 2 78 94 263 0.999 25.48 5.27 Intr - 147388 147221 168 0 0 101 94 329 0.997 33.86 5.26 Intr - 147709 147520 190 2 1 50 17 290 0.686 16.74 5.25 Intr - 147932 147803 130 2 1 68 23 65 0.741 -1.73 5.24 Intr - 150061 149904 158 2 2 69 89 237 0.945 21.63 5.23 Intr - 150753 150646 108 1 0 121 94 113 0.982 15.46 5.22 Intr - 151039 150934 106 1 1 18 100 102 0.993 4.19 5.21 Intr - 151275 151195 81 0 0 74 105 72 0.889 7.33 5.20 Intr - 151560 151402 159 0 0 82 72 270 0.990 24.98 5.19 Intr - 152645 152552 94 1 1 72 83 144 0.822 12.27 5.18 Intr - 153282 153215 68 0 2 2 92 84 0.999 -2.10 5.17 Intr - 153679 153525 155 2 2 60 99 175 0.444 15.59 5.16 Intr - 154216 154054 163 1 1 116 98 225 0.999 25.85 5.15 Intr - 154373 154309 65 0 2 83 110 75 0.738 7.64 5.14 Intr - 159893 159772 122 1 2 102 97 94 0.990 11.84 5.13 Intr - 160249 160107 143 1 2 79 80 84 0.966 5.85 5.12 Intr - 161392 161278 115 0 1 100 113 64 0.994 10.55 5.11 Intr - 161962 161865 98 1 2 70 94 69 0.936 4.51 5.10 Intr - 165736 165650 87 1 0 106 81 112 0.470 12.47 5.09 Intr - 166472 166374 99 2 0 83 89 109 0.975 10.81 5.08 Intr - 168858 168731 128 1 2 85 77 150 0.960 14.00 5.07 Intr - 169103 168968 136 2 1 110 59 125 0.948 11.94 5.06 Intr - 171018 170902 117 0 0 75 100 148 0.998 15.36 5.05 Intr - 171323 171218 106 1 1 127 40 118 0.999 11.12 5.04 Intr - 174239 174163 77 2 2 63 101 102 0.971 7.31 5.03 Intr - 176780 176697 84 2 0 80 105 81 0.975 9.02 5.02 Intr - 181013 180784 230 0 2 74 75 344 0.989 29.29 5.01 Init - 181710 181572 139 0 1 99 70 143 0.870 13.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:97219522_97420290|GENSCAN_predicted_peptide_1|544_aa MQEQWQAFSTIGSGNGSLTGSGAQQTPCQIRRDGSQQRDGSQRRVCDSGKQQCDAICSFV ICNDSSLRGQPIIFNPDFFVEKLRHEKPEIFTELVVSNITRLIDLPGTELAQLMGEVDLK LPGGAGPASGFFRSLMSLKRKDLRVEGLFRVPGNSVRQQILRDALNNGTDIDLESGEFHS NDVATLLKMFLGELPEPLLTHKHFNAHLKIADLMQFDDKGNKTNIPDKDRQIEALQLLFL ILPPPNRNLLKLLLDLLYQTAKKQDKNKMSAYNLALMFAPHVLWPKNVTANDLQENITKL NSGMAFMIKHSQKLFKDDLDLIASCHTKSFQLAKSQKRNRVDSCPHQEETQHHTEEALRE LFQHVHDMPESAKKKQLIRQFNKQSLTQTPGREPSTSQVQKRARSRSFSGLIKRKVLGNQ MMSEKKKKNPTPESVAIGELKGTSKENRNLLFSGSPAVTMTPTRLKWSEGKKEGKKAPKG QLVLLTVKKGTRSFKMKHGETLKVTCGKLGDAKKREWRGQEGHFEKQEEDQNAEYGTEHI IINH >gi568815588f:97219522_97420290|GENSCAN_predicted_CDS_1|1635_bp atgcaggaacaatggcaagcctttagcacgatcgggagtggcaatgggagtctcactgga tcaggagcacagcagacaccctgccagatccggagggatgggagtcagcagcgggatggg agtcagcggcgggtctgcgacagcggcaaacagcagtgtgatgccatctgcagttttgtg atctgcaatgattcttcccttcgaggtcagcccattatctttaatcctgacttttttgtg gagaaactccgacatgagaaacctgagattttcactgagttggtggtcagcaatatcaca aggctcatcgatttacctggaactgagttggctcagctgatgggggaagtggaccttaag ttgcctggcggggctggcccagcatcaggattcttccggtctctcatgtctctcaagcga aaggacttgcgagtagagggtttgtttagagtaccgggtaatagtgtccgacagcagatt ttaagggatgctctcaataatggaactgacattgacttggaatcaggggaatttcactca aatgatgttgccactttgctgaagatgtttctaggagagttgccggagcctctgctgaca cataaacacttcaatgcacacctcaaaatcgctgatttgatgcagtttgatgataaagga aacaagaccaatataccagacaaggaccggcaaattgaggctctccagttgctcttcctc attctccctcctcctaatcgtaatttgctgaagttattgcttgatctcctataccagaca gcaaagaaacaagacaagaacaagatgtcagcctataaccttgcccttatgtttgcaccc catgtcctgtggccaaaaaatgtcactgcaaatgaccttcaggagaatatcacaaagtta aacagtgggatggcttttatgattaaacactcccagaaactttttaaggatgaccttgac ctcatagcttcatgtcatactaagtcctttcagctggcaaagtctcagaaacggaaccgg gtagattcctgccctcaccaggaggagacccagcaccatacggaagaggcactgagagag ctgtttcaacacgttcatgatatgccagagtcagcaaagaagaaacaacttattagacag tttaataagcaatcattgacccagacaccagggcgagaaccttctacttcccaggtacaa aagagggctcgttcgcgctccttcagtgggcttattaagcggaaggtcctgggaaatcag atgatgtcagaaaagaaaaagaagaaccctactccagaatctgtggccattggtgaattg aagggaaccagcaaagaaaataggaacttattattttctggctctccagctgtcacgatg acaccaacaagattgaagtggtctgaagggaagaaagaggggaaaaaagcccccaagggc cagcttgtattactcactgtgaagaaaggaaccaggtccttcaaaatgaagcatggagag actcttaaagttacatgtgggaaactgggtgatgccaagaaaagggagtggcgggggcaa gagggacattttgaaaaacaggaagaggaccaaaatgctgagtacggtactgagcacata atcattaatcactga >gi568815588f:97219522_97420290|GENSCAN_predicted_peptide_2|120_aa MPPKFDPNKIKVVYLRCTGDEVGATSSLAPKISPLVLSLKKVGDAIAKATGDWKGLGITV KLNIQNRRAQIEVVPSASALIIKVLKEPPRDRRNRNTLNTVGISLLMRSSTLFDRGVTNL >gi568815588f:97219522_97420290|GENSCAN_predicted_CDS_2|363_bp atgccgccgaagttcgaccccaacaagattaaagtcgtatacctgaggtgcaccggggat gaagttggtgccacttcttcactggcccccaagatcagccccctggttctgtctctaaaa aaggttggtgatgccattgccaaggcaacaggtgactggaaaggcctggggattacagtg aaactgaacattcagaacagacgggcccagattgaggtggtgccttctgcctctgccctg atcatcaaagttctcaaggaaccaccaagagacagaagaaacagaaacacattaaacaca gtgggaatatcacttttgatgagatcatcaacattgtttgacagaggcgtcaccaatctt tag >gi568815588f:97219522_97420290|GENSCAN_predicted_peptide_3|279_aa MPCRREEEEEAGEEAEGEEEEEDSFLLLQQSVALGSSGEVDRLVAQIGETLQLDAAQHSP ASPCGPPGAPLRAPGPLAAAVPADKARSPAVPLLLPPALAETVGPAPPGVLRCALGDRGR VRGRAAPYCVAELATGPSALSPLPPQADLDGPPGAGKQGIPQPLSGPCRRGWLRGAAASR RLQQRRGSQPETRTGDDDPHRLLQQLVLSGNLIKEAVRRLHSRRLQLRAKLPQRPLLGPL SAPVHEPPSPRSPRAACSDPGASGRAQLRTGDGVLVPGS >gi568815588f:97219522_97420290|GENSCAN_predicted_CDS_3|840_bp atgccgtgccggagggaggaggaagaggaagccggcgaggaggcggagggggaggaagag gaggaggacagcttcctcctactgcagcagtcagtggcgctgggcagctcgggcgaggtg gaccggctggtggcccagatcggcgagacgctgcagctggacgcggcgcagcacagcccg gcctcgccgtgcgggcccccgggggcgccgctgcgggccccggggcccctggctgcggcg gtgccggcggacaaggccaggtccccggcggtgccgctgctgctgccgcccgcgttggcg gagactgtgggcccggcgccccctggggtcctgcgctgcgccctgggggaccgcggccgc gtgcggggccgcgctgcgccctactgcgtggccgagctcgccacaggccccagcgcgctg tccccactgccccctcaggccgaccttgatgggcctccgggagctggcaagcagggcatc ccgcagccgctgtcgggtccgtgccggcgaggatggctccggggcgccgccgcctcccgc cgcctgcagcagcgacgcgggtcccaaccagaaacccgcacaggcgacgacgacccgcac cggcttctgcagcagctagtgctctctggaaacctcatcaaggaggccgtgcgaaggctt cattcgcgacggctgcagttacgtgcaaagcttccccaacgcccgctcctgggacctctg tcggccccggtgcatgaacccccttcgcctcgcagccctcgcgcggcctgcagtgaccct ggcgcctccgggagggcgcagctcagaactggcgacggcgttcttgtgcctggcagctaa >gi568815588f:97219522_97420290|GENSCAN_predicted_peptide_4|303_aa MRSCDPSVDEISAGSEEILGVYLQESEMPGIIIQKGGKHYHFNSTEIPPRPRATKPPAKP AQLRVPGRGAMPCRREEEEEAGEEAEGEEEEDDSFLLLQQSVTLGSSGEVDRLVAQIGET LQLDAAQDSPASPCAPPGVPLRAPGPLAAAVPADKARPPAVPLLLPPASAETVGPAPSGA LRCALGDRGRVRGRAAPYCVAEVAAGPSALPGPCRRGWLRDAVTSRRLQQRRWTQAGARA GDDDPHRLLQQLVLSGNLIKEAVRRLQRAVAAVAATGPASAPGPGGGRSGPDRIALQPSG SLL >gi568815588f:97219522_97420290|GENSCAN_predicted_CDS_4|912_bp atgaggtcatgtgatccctctgtggatgagatcagcgctggctctgaggagatccttggt gtctacctgcaggaatctgaaatgccaggcatcatcatacagaaaggaggaaaacattac cattttaactctactgaaatccctccgcgcccgcgggcaaccaagcccccagcgaagccc gcacagctccgggtgccaggacggggggccatgccgtgccggagggaggaggaagaggaa gccggcgaggaggcggagggggaggaagaggaggacgacagcttcctcctgctgcagcag tcggtgacgctgggcagctcgggcgaggtggaccggctggtggcccagatcggcgagacg ctgcagctggacgcggcgcaggacagcccggcctcgccgtgcgcgcccccgggggtgccg ctgcgggccccggggcccctggctgcggcggtgccggcggacaaggcccggcccccggcg gtgccgctgctgctgccgcccgcttcggctgagacggtgggcccggcgccctctggggcc ctgcgctgcgccctaggggaccgcggccgcgtgcgcggacgcgctgcgccctactgcgtg gcggaggtcgccgcaggccccagcgcgctgccggggccgtgccggcgaggatggctcagg gacgcggtcacctcccgccgcttgcagcagcgccgatggacccaagccggggcacgcgcc ggcgacgacgacccgcatcggctcctccagcagctcgtgctctcgggaaacctcatcaag gaagccgtgcggagactccaacgagccgtcgccgcggttgcagccacgggccccgcaagc gcccctgggcccgggggaggccgcagcggacctgaccgcattgccctgcagccctcaggc tccttgctctga >gi568815588f:97219522_97420290|GENSCAN_predicted_peptide_5|1355_aa MGRSGKLPSGVSAKLKRWKKGHSSDSNPAICRHRQAARSRFFSRPSGRSDLTVDAVKLHN ELQSGSLRLGKSEAPETPMEEEAELVLTEKSSGTFLSGLSDCTNVTFSKVQRFWESNSAA HKEICAVLAAVTEVIRSQGGKETETEYFAALMTTMEAVESPESLAAVAYLLNLVLKRVPS PVLIKKFSDTSKAFMDIMSAQASSGSTSVLRWVLSCLATLLRKQDLEAWGYPVTLQVYHG LLSFTVHPKPKIRKAAQHGVCSVLKGSEFMFEKAPAHHPAAISTAKFCIQEIEKSGGSKE ATTTLHMLTLLKDLLPCFPEGLVKSCSETLLRVMTLSHVLVTACAMQAFHSLFHARPGLS TLSAELNAQIITALYDYVPSENDLQPLLAWLKVMEKAHINLEILKECVAPHMADIGSVTS SASGPAQSVAKMFRAVEEGLTYKFHAAWSSVLQLLCVFFEACGRQAHPVMRKCLQSLCDL RLSPHFPHTAALDQAVGAAVTSMGPEVVLQAVPLEIDGSEETLDFPRSWLLPVIRDHVQE TRLGFFTTYFLPLANTLKSKAMDLAQAGSTVESKIYDTLQWQMWTLLPGFCTRPTDVAIS FKGLARTLGMAISERPDLRVTVCQALRTLITKGCQAEADRAEVSRFAKNFLPILFNLYGQ PVAAGDTPAPRRAVLETIRTYLTITDTQLVNSLLEKASEKVLDPASSDFTRLSVLDLVVA LAPCADEAAISKLYSTIRPYLESKAHGVQKKAYRVLEEVCASPQGPGALFVQSHLEDLKK TLLDSLRSTSSPAKRPRLKCLLHIVRKLSAEHKEFITALIPEVILCTKEVSVGARKNAFA LLVEMGHAFLRFGSNQEEALQCYLVLIYPGLVGAVTMVSCSILALTHLLFEFKGLMGTST VEQLLENVCLLLASRTRDVVKSALGFIKVAVTVMDVAHLAKHVQLVGVIACISGVVQGFK ERIYVKYQVLLRRLKLSSCEGAQACVLKEAGLDHAASLGWTAVKGAQHGRLASLKAWCSV RQMEAIGKLSDDMRRHFRMKLRNLFTKFIRKFGFELVKRLLPEEYHRVLVNIRKAEARAK RHRALSQAAVEEEEEEEEEEEPAQGKGDSIEEILADSEDEEDNEEEERSRGKEQRKLARQ RSRAWLKEGGGDEPLNFLDPKVAQRVLATQPGPGRGRKKDHGFKVSADGRLIIREEADGN KMEEEEGAKGEDEEMADPMEDVIIRNKKHQKLKHQKEAEEEELEIPPQYQAGGSGIHRPV AKKAMPGAEYKAKKAKGDVKKKGRPDPYAYIPLNRSKLNRRKKMKLQGQFKGLVKAARRG SQEFNAITRCQTVCKGVFRKCRDDYSMFYFRYTLL >gi568815588f:97219522_97420290|GENSCAN_predicted_CDS_5|4068_bp atgggtcgctcgggaaagttgccttctggtgtctcagctaagttgaagcgctggaagaaa ggccacagcagcgacagcaaccccgccatctgccgccaccgtcaggccgcccgcagccgc ttcttcagccggccgtcaggaaggagtgacctgacagtcgatgctgtgaagttacataat gagctgcagtcagggtccttgcgcttgggcaaaagcgaagccccggagacgcccatggaa gaagaggcggagctggttctcaccgagaagtcctcgggtaccttcctgagtggcctttcc gactgcacaaacgtcaccttcagcaaagtacagcgcttctgggagtccaactcggctgcc cacaaggagatctgtgctgttctggctgctgtcactgaggtgattcgctcccagggaggg aaggagacggagactgagtacttcgctgctctgatgacaacaatggaagcagtggagtcc ccggagtccctggccgccgttgcttacctgctgaaccttgtcctgaagcgtgttcccagc cctgtgcttattaagaagttctctgatacctccaaagccttcatggatatcatgtcagct caggccagcagcggctccacctctgtcctccgatgggtcctttcctgcctggccaccctt ctgcggaagcaagacctggaggcctggggctaccccgtgacccttcaggtgtaccatggg ctgctgagcttcacggtgcatcccaagcccaagatccggaaggctgcccagcatggagta tgctcagtcctcaagggcagtgaattcatgtttgaaaaggcccctgcccatcatcctgct gccatttccactgccaagttctgcatccaggagattgagaagtctggaggctccaaggag gccaccaccacgctgcacatgctgacgctgctgaaggacctgctgccctgcttcccggaa ggcctggtgaagagctgcagtgagactctcctcagggtcatgaccttgagccatgtgctg gtgacagcctgtgccatgcaggcctttcacagcctcttccacgccaggcctggcctgagc accctgtcagcagagctcaacgcccagatcatcacggccctgtacgactatgttcccagt gagaatgatttacaacccctgctagcctggcttaaggtcatggagaaagcccacatcaac ctggagatcctgaaggaatgcgtggctccccacatggctgacattggctccgtgacctcc tcggcctcaggccctgcccaatctgttgccaagatgttcagggcagtggaggagggcctg acgtacaaattccatgcggcctggagctccgtgttgcagctgctgtgtgtcttcttcgag gcgtgtgggagacaggcccaccctgtgatgaggaagtgcctccagtccctgtgtgacctg cgcctctcccctcatttcccccacacggcggctcttgaccaggcagtgggggctgcggtg accagtatgggacctgaggtggtgctgcaggctgtgcctttggaaattgatggctctgag gagactctggatttcccacggagctggctgctgcctgtcatccgagaccatgttcaggaa acgcgacttggttttttcaccacctacttcttgcccctggctaacaccctgaagagcaaa gccatggacctggctcaggcaggcagcacagtggaatctaagatctacgacacactccag tggcagatgtggacactcctgcctgggttctgcacaaggcctacagatgtggccatctcc ttcaaagggctggcacggacgctgggcatggccatcagcgagcgtccagacctgagggtc accgtgtgccaggccctgcgcaccctcatcaccaagggctgccaggcagaggctgaccgt gctgaagtgagtcgctttgccaagaactttctgccgatcctcttcaacctgtatgggcag cccgtggcagccggggacactccagcccctcgccgggctgtgctggaaaccatcagaact tacctcaccatcactgacactcagttggtgaacagtctcctggaaaaagccagtgagaag gtgctcgaccctgccagctctgactttaccagattgtctgtcctggacctggtcgtggcc ttggctccgtgtgctgacgaagctgccatcagtaagctatactccaccatccggccctac ctagagagcaaggcccacggggtgcagaagaaggcctaccgagtgctggaggaggtgtgt gccagtcctcagggccccggggccctcttcgtgcagagccacctggaggacctgaagaag acactgctggactcgctgcggagcacctcctcacccgccaagaggccccgtttgaagtgc ctcctacacatcgtgaggaagctctcagctgaacacaaggagttcatcactgccctcatc ccagaggtgatcctgtgcaccaaggaggtgtcggtgggcgcacggaagaacgcttttgca ctgctcgtggagatgggccatgctttcctaaggtttggctcgaaccaggaagaggccctg cagtgctacctcgtcctgatctaccctggcctggtgggcgcggtgaccatggtcagctgc agcatcctggccctgacccacctccttttcgagtttaaaggtctgatggggaccagtaca gtggagcagctgctggagaatgtgtgcctgcttctggcctcccgcacccgtgacgtggtc aagtctgcactgggcttcatcaaggtggcagtgactgtcatggacgtggcgcacctggcc aaacatgtgcagctggtgggggtgatagcatgtatctcaggggtagtccaaggattcaag gagaggatatatgtcaagtaccaagtgctgttgagaagactaaagctctcatcctgcgag ggtgctcaggcctgtgtcctgaaggaggctgggcttgatcatgctgcgtccctgggctgg acagcagtaaagggggcgcagcatggaaggctcgcctccctgaaagcctggtgctctgtc cggcagatggaagccattgggaagctttcagatgacatgcggcggcacttccgcatgaag cttcggaacctgttcaccaagttcatccgcaagtttggatttgagctggtgaaaaggctg ttgcccgaggagtaccacagagtcctggtcaacatccggaaagctgaggcccgggccaag aggcaccgagccctgagccaggctgccgtggaggaggaagaagaggaggaggaggaggag gagcccgcccagggcaaaggtgacagcattgaggagattttagctgactcagaggacgag gaggacaatgaggaggaggaaagaagccgaggcaaggagcagcggaagctggcacgacag aggagccgggcatggctgaaagagggcggtggggacgagcccctcaacttcctggatccc aaggtggcccaacgagtcctggccacgcagccagggccaggccggggcaggaagaaggac cacggcttcaaggtgagcgccgatggccggctgatcataagggaggaggcagacggcaac aagatggaggaagaggaaggtgccaaaggcgaagatgaagagatggctgacccaatggaa gatgtgatcatcaggaataaaaagcaccagaagctcaagcaccagaaagaggctgaggag gaggagctggagataccccctcagtaccaagctggaggctctggcattcatcgccctgtg gccaagaaggctatgcctggggctgaatacaaggccaagaaagcaaaaggtgatgtgaag aagaaaggccggccggatccctatgcctacatccccctcaacagaagcaagctcaaccgc aggaagaagatgaagctgcagggacagttcaaaggcctggtgaaggctgcccggcgaggt tcccaggaatttaatgccatcactagatgccagaccgtgtgcaaaggagtatttagaaaa tgccgtgatgactactctatgttctattttcgatatacattattgtga