GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:02:17 Sequence gi568815589r:93852167_94052948 : 200782 bp : 47.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9621 9730 110 1 2 85 105 53 0.774 6.70 1.02 Intr + 14152 14230 79 0 1 67 77 11 0.355 -2.98 1.03 Intr + 18565 18609 45 2 0 119 92 32 0.801 5.08 1.04 Term + 18874 18971 98 2 2 82 43 61 0.273 -0.87 1.05 PlyA + 20951 20956 6 1.05 2.00 Prom + 23613 23652 40 -5.86 2.01 Init + 27998 28214 217 1 1 100 44 149 0.396 8.84 2.02 Term + 37426 37505 80 2 2 74 50 69 0.152 -0.37 2.03 PlyA + 37846 37851 6 1.05 3.05 PlyA - 41247 41242 6 1.05 3.04 Term - 46835 46372 464 0 2 84 54 194 0.325 10.82 3.03 Intr - 49087 48994 94 1 1 76 81 40 0.254 1.64 3.02 Intr - 50749 50690 60 1 0 108 35 53 0.098 0.93 3.01 Init - 57462 57313 150 0 0 81 108 63 0.706 5.67 3.00 Prom - 59690 59651 40 -2.16 4.00 Prom + 60929 60968 40 -6.36 4.01 Init + 66580 66591 12 0 0 37 97 15 0.243 -2.55 4.02 Intr + 68092 68196 105 0 0 62 47 149 0.986 8.61 4.03 Intr + 68878 68994 117 0 0 119 41 118 0.987 10.86 4.04 Intr + 71870 71954 85 1 1 92 62 23 0.362 -0.51 4.05 Term + 73889 74067 179 0 2 73 48 110 0.603 3.35 4.06 PlyA + 75095 75100 6 1.05 5.00 Prom + 82293 82332 40 -4.86 5.01 Init + 96358 96428 71 0 2 81 103 141 0.992 13.43 5.02 Intr + 96925 97096 172 1 1 83 70 12 0.483 -1.15 5.03 Term + 97626 97754 129 2 0 63 43 95 0.329 0.68 5.04 PlyA + 98154 98159 6 -1.75 6.05 PlyA - 99482 99477 6 1.05 6.04 Term - 100162 99998 165 1 0 76 55 222 0.802 15.62 6.03 Intr - 100647 100563 85 2 1 86 73 97 0.408 7.82 6.02 Intr - 101021 100730 292 0 1 103 91 387 0.999 36.69 6.01 Init - 102980 102758 223 2 1 99 79 383 0.982 35.22 6.00 Prom - 110124 110085 40 -3.46 7.00 Prom + 113275 113314 40 -6.66 7.01 Init + 116949 117196 248 2 2 73 93 107 0.142 5.23 7.02 Intr + 123677 123866 190 2 1 71 92 19 0.047 0.19 7.03 Term + 140501 140605 105 1 0 102 54 72 0.458 3.61 7.04 PlyA + 140742 140747 6 1.05 8.03 PlyA - 143995 143990 6 1.05 8.02 Term - 147518 147400 119 0 2 82 39 90 0.675 2.20 8.01 Init - 149890 149731 160 1 1 100 65 35 0.645 2.11 8.00 Prom - 168658 168619 40 -2.46 9.03 PlyA - 169422 169417 6 1.05 9.02 Term - 171110 170477 634 1 1 17 36 298 0.351 11.26 9.01 Init - 171421 171240 182 1 2 45 69 73 0.414 -0.24 9.00 Prom - 171884 171845 40 -4.96 10.07 PlyA - 172053 172048 6 1.05 10.06 Term - 173231 173073 159 2 0 75 35 153 0.980 6.64 10.05 Intr - 178372 178165 208 0 1 64 72 126 0.683 7.68 10.04 Intr - 178638 178461 178 1 1 70 69 137 0.922 8.98 10.03 Intr - 186039 185923 117 1 0 94 78 55 0.640 5.54 10.02 Intr - 192989 192906 84 0 0 81 47 54 0.186 0.39 10.01 Init - 194280 193140 1141 0 1 86 60 139 0.242 5.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_1|110_aa XPVASAHYWLRSLAILACFDPPSTTLWGRLFAMKDAEVTSRKHLLCALRPPKPPILNLYP PHHGGDTEIQRVAVTQLAGSLRQRPYLVGPVRAQYPAEYQELTGQLHSYE >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_1|333_bp ngacctgtggcatctgcccactactggctgcggagcttggccatcctggcatgttttgat cctccctctacaaccctctggggcagactgtttgctatgaaggatgcagaggtcacttcc aggaagcatctcctctgcgccctccgacctcctaaacccccaatcctgaatctatatccc ccacatcatggtggggacactgagatccagagagtagctgtgacacagctggcaggttct ctgagacagagaccttatctggtgggtcctgtccgtgcccagtacccagccgagtatcag gagctgacagggcagctgcactcttatgagtaa >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_2|98_aa MPERPHYPLRGGLLRSLSLPDECPPPPCSTAPSPIYHPRTEECRRMVRDWQAAPPVTPVQ DPLGEASWAPESAVRELTNTEEMMIKRGILDGVLEQKR >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_2|297_bp atgcctgagcgcccccactaccccctgcgcggtgggctcctgcgcagcctgagcctcccc gacgagtgcccccccccaccctgctccacggcgcccagtcccatctaccacccaaggact gaggagtgcaggcgcatggtgcgggactggcaggcagctccacctgtgaccccagtgcag gatccactgggtgaagccagctgggctcctgagtctgcagtgcgagagctgactaataca gaagagatgatgattaaacgtgggatactggatggggttctggaacagaaacgttag >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_3|255_aa MAVLCGGLGATWWLRTCWGRGGAISTVPPGDCVDHSEAQEGTKAAKLTEQLCSPRYPVPC AKVTQQEAGRPLPSDICQVEKDRAHLFPCSSKWPADASTEQGDCGGPGKGCQPQAREHLC RQEPLNLLCDSGATKFFPMGEAASYTLCCQGPRSGWETSHEQLLLLSGHGSLVHGLLWPL LLPLEESPITITSLAIHIVGGGDWAVSGCHVALSATSSLETDRSPGLVLRPDHAQVVFDR KSELPADRCDGRSAF >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_3|768_bp atggctgtgctgtgtggtggcttgggagctacatggtggctgagaacctgttggggcagg ggtggtgctatctccacagtcccacctggagactgtgtggaccactctgaggctcaggag ggaacaaaagctgccaagctcacagagcaactctgcagcccccgctatcctgttccttgt gctaaagtcacccagcaagaagctggcaggcctctgccatcagacatctgccaggtggag aaagatagggcccatctcttcccttgctctagcaaatggccagccgacgccagcacagaa caaggagactgtggtggtccagggaagggctgtcagccccaagcaagagagcacctctgc agacaggagcccctcaacctgttgtgtgacagtggtgccaccaagttcttccccatgggg gaagccgcctcctacaccctgtgctgccagggcccccgctctgggtgggaaacgagccac gagcagctgctcctgctctctgggcacggctctctggtccacggtttgctctggcccctc cttcttcccctagaagagtctcctatcaccatcacctcactggccattcatatcgtgggc ggtggggactgggctgtctctgggtgccacgtagccctgtcagccacttccagcctggag acagacaggagcccagggcttgttttacgccctgaccatgctcaggttgtcttcgacagg aagtcagagctcccagcagaccgctgtgatggccgcagtgctttctga >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_4|165_aa MEDSKTAKENTHKSPLTGARSLEELGFAVIKLIRYMCGKVLESEGLADTSMAWRHMGGQE LSMCKDSGEQRAMLEQLQVQPPIVIENAPSLSLCGLQGTIQALGCPGTVLDSGYSHRQDK SVTAHTATTLCVKEEPPAHSDHDSVHDSDDTQVSQQRHGDSHVHE >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_4|498_bp atggaggactcgaagacagccaaggaaaacacgcacaagtccccactgaccggtgcacgc agccttgaagaattaggtttcgcagtaattaagctcatccggtacatgtgcggcaaggta ctggagtctgagggtctggcggacacctccatggcctggaggcacatgggtggacaggag ctgtccatgtgcaaggattctggggagcagcgggcaatgctggagcaacttcaggtgcag ccaccaattgtcattgagaacgcccccagcctcagcctatgtggtttacaagggactatc caggccctgggatgcccaggcactgtgctagactcaggatacagccatagacaagataaa agcgtgactgctcacacagccaccacactgtgtgtgaaggaagagccaccagcccactca gatcatgactctgttcatgacagtgatgacacgcaggtgtcccagcagagacacggggac agccatgtccatgaatga >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_5|123_aa MGRPRSRLRLRLSSRCGGCAWSPRTLGMRRDLCIERKRNGIQGTFPSSPGPEMKDATAAC RQRDNLSARWLGRSGTQGGRQSPGPTKTTRSRTARHLGKMPKIRKAANLLKVAFLGFSKT GEP >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_5|372_bp atggggcgcccccgctcgcgactacgcctgcgactgtcctcgcgctgcgggggctgtgcg tggagcccgcggactttggggatgagaagagacttgtgcatagagcggaagaggaatggg atccaagggacctttccttcaagtcccgggcctgaaatgaaggatgcgactgctgcctgc aggcagagagacaacctcagcgctcggtggctggggcgcagtggcacgcagggcgggcgc cagagccctggtcctacaaagaccactaggtccagaacagcaaggcaccttgggaaaatg ccaaagattcgcaaggcagctaatctgctgaaggttgccttcctcggatttagtaaaacc ggagagccctga >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_6|254_aa MQRPGEPGAARFGPPEGCADHRPHRYRSFMIEEILTEPPGPKGAAPAAAAAAAGELLKFG VQALLAARPFHSHLAVLKAEQAAVFKFPLAPLGCSGLSSALLAAGPGLPGAAGAPHLPLE LQLRGKLEAAGPGEPGTKAKKGRRSRTVFTELQLMGLEKRFEKQKYLSTPDRIDLAESLG LSQLQVKTWYQNRRMKWKKIVLQGGGLESPTKPKGRPKKNSIPTSEQLTEQERAKDAEKP AEVPGEPSDRSRED >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_6|765_bp atgcagcggccgggggagccgggcgccgcgcgcttcggcccgcccgagggctgcgcggac caccggccgcaccgctatcgcagcttcatgattgaggagatcctcacggagccacccggg cccaagggcgccgcgcccgcagccgccgctgccgcggcgggcgagctgctgaagttcggc gtgcaggcgctgctggcggcgcggcccttccacagccacctggccgtgctgaaggccgag caggcggcggtgttcaagttcccactggcgccgctgggctgttcagggctgagctctgcg ttgctggcggcagggcccgggctgcccggcgccgcgggtgcgccacacctgccgctcgag ttgcagctccgcgggaagctggaggcggcaggccctggggagccaggcaccaaagccaag aaggggcgtcggagccgcactgtgttcaccgagctgcagctgatgggcctggagaaacgc ttcgagaagcagaagtacctttccacgccggacagaatagatcttgctgagtccctgggc ctgagccagttgcaggtgaagacgtggtaccagaatcggaggatgaagtggaagaaaata gtgctgcagggcggcggcctggagtctcccaccaagcccaaggggcggcccaagaagaac tcaattccaacgagcgagcagcttactgagcaggagcgcgccaaggatgcagagaaaccg gcggaggtgccgggcgagcccagcgacaggagccgcgaggactga >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_7|180_aa MTPGQRPHQEVGTAAAAAHMPRIPSTVGAHSGSTCFPNQQQSEPESLWQESDGRGHRVRP DVWGGCELGAPEACAFWSIHAARYTLLVSSHTRKIQGHTSPRAPNATLQLCGISKAASRR NLGFPVCDRKAGFKSSQGPCQPWHYWRTQHSILPWKQRQQPSPDTKPAGTLILDIQPPDL >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_7|543_bp atgacccctgggcagaggcctcaccaagaggtggggacagcagctgcggcggcccatatg ccaaggataccatccacagttggagcacattcagggtccacttgcttccctaaccagcag caaagtgaaccagagtctctttggcaggaatcagatggccgtggccaccgggtgcggcct gacgtctggggtgggtgcgagttgggggctcccgaggcctgcgccttctggagcatccat gcagcccgatacacattactggtctcctcccacacaaggaaaattcagggccacacaagc cctagagccccgaatgccactttgcagctttgtggcattagcaaagctgcttcacggcgg aacctcggtttccctgtctgtgacaggaaggctggattcaagagttctcaaggcccctgc cagccctggcattattggaggacacagcattcaatactaccttggaagcagcgacagcaa ccctcaccagacaccaagcctgctggcaccttgatcttggacatccagcctccagatctg tga >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_8|92_aa MESEDGLGPALALICSAELLATGKWEASNPAASPWLIVSQVAAVSDHYDSNDNFCEDHQK WFAVTQQGQWYAFTVLPQATLVLLLSAIIQSK >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_8|279_bp atggagtctgaggatgggttgggtcctgccctcgcgctcatatgcagtgctgagctcctt gctactgggaaatgggaggctagtaacccagcagcatccccgtggttgattgtgagccaa gtggctgctgtgtctgaccattatgacagtaatgataacttttgtgaggaccaccagaag tggtttgctgttacccaacagggccaatggtatgcctttacggtgttgccacaggctaca ttagttctgctgctgtctgccataatacagtctaagtga >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_9|271_aa MKAEIKMFFETNENKDTTYHNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTDIQTTIREYYKHLYANKLENLEE MDKFLNTYTLQRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAKFYQRYKEEL VPFLLKLFQSTEKEGILPNSFYEASSILIPKPGRDTMKKENFRPIALMNIDAKILNKILA NQIQQHIKKLIHHDQVGSSLECKAGSIYTNQ >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_9|816_bp atgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacaacataccat aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaagaactagaa aagatcaacaaaattgatagaccactagcaagactaataaagaagaaaagagagaagaat caaatagacacaataaaaaatgataaaggggatatcaccaccgatcccacagatatacaa actaccatcagagaatactacaaacacctctacgcaaataaactagaaaatctagaagaa atggataaattcctcaacacatacaccctccaaagactaaaccaggaagaagttgaatct ctgaatagaccaattacaggctctgaaattgaggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccaaattctaccagaggtacaaggaggaactg gtaccattccttctgaagctattccaatcaacagaaaaagagggaatcctccctaactca ttttatgaggccagcagcatcctgataccaaagccaggcagagacacaatgaaaaaagag aattttagaccaatagccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggctcatccctg gaatgcaaggctggttcaatatacacaaatcaataa >gi568815589r:93852167_94052948|GENSCAN_predicted_peptide_10|628_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNRKRACIAKSIPSQKNKAGGIMLPD FKLYYKATVTKTAWYWYQSRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTTYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDISMSKDF MSKTPKAMATKAKIEKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGLISRIY NELKQIYKKKTNNPVKKWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTTMRYHL TPVRMAIIKKSGNNRCWRGCGETGTLLHCWWDCKLVQPLWKSVWRFLWVLDLELPFDPAI PLLGIYPKDYKSCCYKDTCTRIWMKMETIILSKPLQGQKTKHRVFSLIGDAFVFIGVCVR ESNGLTCLLPEETMEIMLSLMLASLFHARPPANVSRHTPHVTCAGQWRPEADRDESGGSG HFRTGAGGEGTVRDAEGMARVRKGDPKVEKSEWRKSQESPGAPRTEPSDWPPLRARFLCC SSCFYWSYRPNSYGVAAPGGSSRPQVVQRNPRSRIWERSSSPATEQSWTENDFDELREEG FRRLNYSELQEEIQTKGKEVKNFEKNLD >gi568815589r:93852167_94052948|GENSCAN_predicted_CDS_10|1887_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccgatg actttcttcacagaattggaaaaaactactctaaagttcatatggaaccgaaaaagagcc tgcattgccaagtcaatcccaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactgtactataaggctacagtaaccaaaacagcatggtactggtaccaaagcaga gatatagaccaatggaacagaacagagccctcagaaataatgccacacatctacaactat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca acttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaactctagaagaaaacctaggcaataccattcaggacataagcatgagcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgaaaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttataagaaaaaaacaaacaaccctgtcaaaaagtgggcaaag gatatgaacagacacttctcaaaagaagacatttatgcagccaaaagacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtgctggagaggatgt ggagaaacaggaacacttttacactgttggtgggactgtaaactagttcaaccattgtgg aagtcagtgtggcgattcctctgggttctagatctagaattaccatttgacccagccatc ccattactgggtatatatccaaaggattataaatcatgctgctataaagacacatgcaca cggatatggatgaagatggaaaccatcattctcagcaaaccattgcagggacaaaaaacc aaacaccgcgtgttctcactcataggtgatgcctttgtttttattggcgtctgtgtaaga gaatccaacggcctgacctgtcttctgcccgaagagaccatggagattatgctcagtctt atgcttgcctcccttttccacgcgcgcccgcccgccaatgtcagccgccacacgccgcac gtgacgtgtgctggccaatggaggcctgaagcggaccgtgacgagagcggtggctctggg cattttcgaacgggcgcgggcggtgagggaaccgtgagggacgctgaggggatggcgcgg gtgcggaaaggggaccctaaggtcgagaagagtgaatggaggaagagccaggagagccct ggagcaccgcggacggaacccagcgactggccgcctctccgggcgcgcttcctgtgttgt tcctcctgcttctactggtcctatcgtcccaactcttacggcgtcgcggccccaggtggt tcgtcgcggccccaggtggttcagagaaatccgaggagcagaatctgggaacgcagttcc tcaccagcaacggaacaaagctggacggagaatgactttgacgagttgagagaagaaggc ttcagacgattaaactactccgagctacaggaggaaattcaaaccaaaggcaaagaagtt aaaaactttgaaaaaaatttagactaa